- Arlington, Virginia
- All languages
- ANTLR
- ActionScript
- Agda
- AppleScript
- Assembly
- Batchfile
- C
- C#
- C
- CSS
- Clojure
- CoffeeScript
- Cuda
- Dockerfile
- Elixir
- Elm
- Emacs Lisp
- Erlang
- F#
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Kotlin
- Lua
- Makefile
- Markdown
- Nunjucks
- Objective-C
- OpenSCAD
- PHP
- Pascal
- Perl
- PowerShell
- Prolog
- Puppet
- Python
- R
- Raku
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Vue
- Web Ontology Language
- XProc
- XQuery
- XSLT
- YAML
Starred repositories
Full text geoparsing as a Python library
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80 languages recognition, provide data annotation and synthesis tools, support training and…
Apply different text recognition services to images of handwritten documents.
A simple tool for visually comparing two PDF files
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
Implementation of KatKit as presented at DH2024
experimenting with IIIF Image and Presentation APIs
A simple Gatsby CETEIcean site with some useful examples for conference workshops.
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
An XSLT 2.0 compiler (targeting JavaScript) written in TypeScript.
A compliance analysis tool which enables organizations to more quickly articulate their compliance posture and also generate supporting evidence artifacts
Extracts and formats text annotations from a PDF file
OSCAL (Open Security Controls Assessment Language) on an XProc3 platform
Automatically exclude development dependencies from Apple Time Machine backups
TT2020 is an advanced, open source, hyperrealistic, multilingual typewriter font for a new decade.
Templating library for TEI Publisher's app manager, jinks