Skip to main content
Unpublished Paper
On Influence of Line Segmentation in Efficient Word Segmentation in Old Manuscripts
(2012)
  • D. Fernández
  • J. Lladós
  • A. Fornés
  • R. Manmatha, University of Massachusetts - Amherst
Abstract

The objective of this work is to show the importance of a good line segmentation to obtain better results in the segmentation of words of historical documents. We have used the approach developed by Manmatha and Rothfeder [1] to segment words in old handwritten documents. In their work the lines of the documents are extracted using projections. In this work, we have developed an approach to segment lines more efficiently. The new line segmentation algorithm tackles with skewed, touching and noising lines, so it is significantly improves word segmentation. Experiments using Spanish docu- ments from the Marriages Database of the Barcelona Cathedral show that this approach reduces the error rate by more than 20%.

Keywords
  • Segmentation,
  • document and text processing,
  • document analysis,
  • handwriting analysis,
  • heuristics,
  • path-finding
Disciplines
Publication Date
2012
Comments
This is the pre-published version harvested from CIIR.
Citation Information
D. Fernández, J. Lladós, A. Fornés and R. Manmatha. "On Influence of Line Segmentation in Efficient Word Segmentation in Old Manuscripts" (2012)
Available at: http://works.bepress.com/r_manmatha/52/