"A Novel Word Spotting Method Based on Recurrent Neural Networks" by Volkmar Frinken

Selected Works of R. Manmatha

Follow Contact

Unpublished Paper

A Novel Word Spotting Method Based on Recurrent Neural Networks

(2011)

Volkmar Frinken
Andreas Fischer
R. Manmatha, University of Massachusetts - Amherst
Horst Bunke

Download

Abstract

Keyword spotting refers to the process of retrieving all instances of a given keyword from a document. In the present paper, a novel keyword spotting method for handwritten documents is described. It is derived from a neural network based system for unconstrained handwriting recognition. As such it performs template-free spotting, i.e. it is not necessary for a keyword to appear in the training set. The keyword spotting is done using a modification of the CTC Token Passing algorithm in conjunction with a recurrent neural network. We demonstrate that the proposed systems outperforms not only a classical dynamic time warping based approach but also a modern keyword spotting system, based on hidden Markov models. Furthermore, we analyze the performance of the underlying neural networks when using them in a recognition task followed by keyword spotting on the produced transcription. We point out the advantages of keyword spotting when compared to classic text line recognition.

Disciplines

Computer Sciences

Publication Date

2011

Comments

This is the pre-published version harvested from CIIR.

Citation Information

Volkmar Frinken, Andreas Fischer, R. Manmatha and Horst Bunke. "A Novel Word Spotting Method Based on Recurrent Neural Networks" (2011)
Available at: http://works.bepress.com/r_manmatha/44/