The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
scala big-data spark apache-spark hadoop analysis python3 text-extraction pyspark digital-humanities dataframe big-data-analytics webarchives network-graphing
-
Updated
Feb 27, 2024 - Scala