Skip to main content
Article
Linguistic Structure and Bilingual Informants to Induce Machine Translation of Lesser-Resourced Languages
Institute for Software Research
  • Christian Monson, Carnegie Mellon University
  • Ariadna Font Llitjós
  • Vamshi Ambati, Carnegie Mellon University
  • Lori Levin, Carnegie Mellon University
  • Alon Lavie, Carnegie Mellon University
  • Alison Alvarez, Carnegie Mellon University
  • Roberto Aranovich, Carnegie Mellon University
  • Jaime G. Carbonell, Carnegie Mellon University
  • Robert Frederking, Carnegie Mellon University
  • Erik Peterson, Carnegie Mellon University
  • Katharina Probst
Date of Original Version
1-1-2008
Type
Working Paper
Rights Management
All Rights Reserved
Abstract or Description
Producing machine translation (MT) for the many minority languages in the world is a serious challenge. Minority languages typically have few resources for building MT systems. For many minor languages there is little machine readable text, few knowledgeable linguists, and little money available for mT development. For these reasons, our research programs on minority language MT have focused on leveraging to the maximum extent two resources that are available for minority languages: linguistic structure and bilingual informants. All natural languages contain linguistic structure. And although the details of that linguistic structure vary from language to language, language universals such as context-free syntactic structure and the paradigmatic structure of inflectional morphology, allow us to learn the specific details of a minority language. Similarly, most minority languages possess speakers who are bilingual with the major language of the area. This paper discusses our efforts to utilize linguistic structure and the translation information that bilingual informants can provide in three sub-areas of our rapid development MT program: morphology induction, syntactic transfer rule learning, and refinement of imperfect learned rules.
Citation Information
Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, et al.. "Linguistic Structure and Bilingual Informants to Induce Machine Translation of Lesser-Resourced Languages" (2008)
Available at: http://works.bepress.com/jaime_carbonell/41/