Skip to content

Herman Melville's Moby Dick in TEI and then Text-Fabric and then Pandas

License

Notifications You must be signed in to change notification settings

annotation/mobydick

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.

Herman Melville- Moby Dick

This repo contains a single book in different representations:

The conversion TEI to TF is done with programs/tfFromTei.py.

The conversion TF to Pandas is done with export.ipynb.

For an example of how to use the Pandas file, see programs/pandas.ipynb.

See docs for documentation about provenance and encoding.

Requirements

If you want to reproduce or make your own computations, pip-install the following Python modules

pip install text-fabric pandas pyarrow

Also, clone this repo, preferably in your ~/github/annotation directory.

Author

See about for the authors/editors of the data.

Dirk Roorda is the author of the representation in Text-Fabric of the data, and the programs and docs.