Skip to main content
Other
An Integrated, Conditional Model of Information Extraction and Coreference with Application to Citation Matching
Computer Science Department Faculty Publication Series
  • Ben Wellner, University of Massachusetts - Amherst
  • Andrew McCallum, University of Massachusetts - Amherst
  • Fuchun Peng, University of Massachusetts - Amherst
  • Michael Hay, University of Massachusetts - Amherst
Publication Date
2004
Abstract

Although information extraction and coref- erence resolution appear together in many applications, most current systems perform them as independent steps. This paper describes an approach to integrated infer- ence for extraction and coreference based on conditionally-trained undirected graphical models. We discuss the advantages of condi- tional probability training, and of a corefer- ence model structure based on graph parti- tioning. On a data set of research paper cita- tions, we show significant reduction in error by using extraction uncertainty to improve coreference citation matching accuracy, and using coreference to improve the accuracy of the extracted fields.

Disciplines
Comments
This paper was harvested from CiteSeer
Citation Information
Ben Wellner, Andrew McCallum, Fuchun Peng and Michael Hay. "An Integrated, Conditional Model of Information Extraction and Coreference with Application to Citation Matching" (2004)
Available at: http://works.bepress.com/andrew_mccallum/10/