"Relation Extraction with Matrix Factorization and Universal Schemas" by Sebastian Riedel

Selected Works of Benjamin M. Marlin

Follow Contact

Unpublished Paper

Relation Extraction with Matrix Factorization and Universal Schemas

(2013)

Sebastian Riedel
Limin Yao
Andrew McCallum
Benjamin M. Marlin, University of Massachusetts - Amherst

Download

Abstract

Traditional relation extraction predicts relations within some fixed and finite target schema. Machine learning approaches to this task require either manual annotation or, in the case of distant supervision, existing structured sources of the same schema. The need for existing datasets can be avoided by using a universal schema: the union of all involved schemas (surface form predicates as in OpenIE, and relations in the schemas of preexisting databases). This schema has an almost unlimited set of relations (due to surface forms), and supports integration with existing structured data (through the relation types of existing databases). To populate a database of such schema we present matrix factorization models that learn latent feature vectors for entity tuples and relations. We show that such latent models achieve substantially higher accuracy than a traditional classification approach. More importantly, by operating simultaneously on relations observed in text and in preexisting structured DBs such as Freebase, we are able to reason about unstructured and structured data in mutually-supporting ways. By doing so our approach outperforms state-of-the-art distant supervision.

Disciplines

Computer Sciences

Publication Date

2013

Comments

This is the pre-published version harvested from arXiv.

Citation Information

Sebastian Riedel, Limin Yao, Andrew McCallum and Benjamin M. Marlin. "Relation Extraction with Matrix Factorization and Universal Schemas" (2013)
Available at: http://works.bepress.com/benjamin_marlin/1/