Skip to main content
Article
Multiword Expression Filtering for Building Knowledge Maps
ACL Workshop on Multiword Expressions: Integrating Processing (2004)
  • Shailaja Venkatsubramanyan, San Jose State University
  • J. Perez-Carballo, University of California - Los Angeles
Abstract
This paper describes an algorithm that can be used to improve the quality of multiword expressions extracted from documents. We measure multiword expression quality by the "usefulness" of a multiword expression in helping ontologists build knowledge maps that allow users to search a large document corpus. Our stopword based algorithm takes n-grams extracted from documents, and cleans them up to make them more suitable for building knowledge maps. Running our algorithm on large corpora of documents has shown that it helps to increase the percentage of useful terms from 40% to 70% --- with an eight-fold improvement observed in some cases.
Keywords
  • multiword expression,
  • knowledge maps
Publication Date
2004
Publisher Statement
SJSU users: use the following link to login and access the article via SJSU databases
Citation Information
Shailaja Venkatsubramanyan and J. Perez-Carballo. "Multiword Expression Filtering for Building Knowledge Maps" ACL Workshop on Multiword Expressions: Integrating Processing (2004)
Available at: http://works.bepress.com/shailaja_venkatsubramanyan/14/