"A Hidden Markov Model for Alphabet-Soup Word Recognition" by Shaolei Feng

Selected Works of R. Manmatha

Follow Contact

Unpublished Paper

A Hidden Markov Model for Alphabet-Soup Word Recognition

(2008)

Shaolei Feng
Nicholas R. Howe
R. Manmatha, University of Massachusetts - Amherst

Download

Abstract

Recent work on the ``alphabet soup'' paradigm has demonstrated effective segmentation-free character-based recognition of cursive handwritten historical text documents. The approach first uses a joint boosting technique to detect potential characters - the alphabet soup. A second stage uses a dynamic programming algorithm to recover the correct sequence of characters. Despite experimental success, the ad hoc dynamic programming method previously lacked theoretical justification. This paper puts the method on a sounder footing by recasting the dynamic programming as inference on an ensemble of hidden Markov models (HMMs). Although some work has questioned the use of score outputs from classifiers like boosting and support vector machines for probability estimates, experiments in this case show good results from treating shifted boosting scores as log probabilities.

Keywords

character detection,
word recognition,
inference models,
cursive,
historical manuscripts

Disciplines

Computer Sciences

Publication Date

2008

Comments

This is the pre-published version harvested from CIIR.

Citation Information

Shaolei Feng, Nicholas R. Howe and R. Manmatha. "A Hidden Markov Model for Alphabet-Soup Word Recognition" (2008)
Available at: http://works.bepress.com/r_manmatha/39/