This repo is for developing a Docstrum algorithm presented by O’Gorman (1993).
This source code is built on top of the work by Chadoliver. Please find the original code from here (https://github.com/chadoliver/cosc428-structor).
This project aims at segmenting a document image into meaningful components. The domain of image is specified on historical machine-printed/hand-written document image.
- python 2.7
- Packages:
numpy
cv2
- Pre-processing Optional for vertical-line removal
- Blurring Bilateral Filtering
- Otsu's thresholding
- Morphological erosion & dilation
- Smoothing (Averaging)
- Static thresholding
- Nearest-Neighbor Clustering and Docstrum Plot
- Spacing and Orientation Estimation
- Determination of Text-lines
- Structural Block Determination
- Post-processing
- TBD
- TBD
O'Gorman, L., 1993. The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11), pp.1162-1173. pdf.
@article{o1993document,
title={The document spectrum for page layout analysis},
author={O'Gorman, Lawrence},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
volume={15},
number={11},
pages={1162--1173},
year={1993},
publisher={IEEE}
}
find . -name '.DS_Store' -type f -delete