Skip to main content
Unpublished Paper
A Hidden Markov Model for Alphabet-Soup Word Recognition
(2008)
  • Shaolei Feng
  • Nicholas R. Howe
  • R. Manmatha, University of Massachusetts - Amherst
Abstract

Recent work on the ``alphabet soup'' paradigm has demonstrated effective segmentation-free character-based recognition of cursive handwritten historical text documents. The approach first uses a joint boosting technique to detect potential characters - the alphabet soup. A second stage uses a dynamic programming algorithm to recover the correct sequence of characters. Despite experimental success, the ad hoc dynamic programming method previously lacked theoretical justification. This paper puts the method on a sounder footing by recasting the dynamic programming as inference on an ensemble of hidden Markov models (HMMs). Although some work has questioned the use of score outputs from classifiers like boosting and support vector machines for probability estimates, experiments in this case show good results from treating shifted boosting scores as log probabilities.

Keywords
  • character detection,
  • word recognition,
  • inference models,
  • cursive,
  • historical manuscripts
Disciplines
Publication Date
2008
Comments
This is the pre-published version harvested from CIIR.
Citation Information
Shaolei Feng, Nicholas R. Howe and R. Manmatha. "A Hidden Markov Model for Alphabet-Soup Word Recognition" (2008)
Available at: http://works.bepress.com/r_manmatha/39/