Skip to content

mzntaka0/ocra

Repository files navigation

OCR related utils

Features

  • Scrape PDF from Web
  • Extract information of coordinations and descriptions from PDF
  • Convert PDF to image object(png)
  • Make OCR dataset like PyTorch Dataset.

Installation

git clone https://github.com/mzntaka0/ocra.git
cd ocra
python setup.py install

Dependencies

  • poppler-utils(pdftohtml)
  • Python >= 3.6.2

About

some utils related w/ OCR processing

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published