Stars
A lexical normalizer for historical spelling variants using a transformer architecture.
Fast and robust date extraction from web pages, with Python or on the command-line
Create a teiCorpus-file from a collection of TEI documents
FairCopy is a word processor for the humanities scholar.
Fix errors in xml document that make it invalid according to TEI P5
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
text corpus "Patientenbriefe" – visualization tools
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing…
Compute similarity between trees, e.g. dependency trees
Modeling and visualizing physical manuscript collation
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
RAIS: A IIIF-compliant, 100% open source image server for blazing-fast deep zooming
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
DM is an environment for the study and annotation of images and texts. It is a suite of tools, enabling scholars to gather and organize the evidence necessary to support arguments based in digitize…
RBush — a high-performance JavaScript R-tree-based 2D spatial index for points and rectangles
Old, previous version of the edition web app.
YUI-compatible utilities for creating resource combos, e.g. CSS/JavaScript rollups.