Skip to main content
Article
The Semantic Vector Space Model: Implementation and Evaluation.
Journal of American Society for Information Science (1997)
  • Geoffrey Liu, San Jose State University
Abstract
This article presents the Semantic Vector Space Model (SVSM), a text representation and searching technique based on the combination of Vector Space Model (VSM) with heuristic syntax parsing and distributed representation of semantic case structures. In this model, both documents and queries are represented as semantic matrices. A search mechanism is designed to compute the similarity between two semantic matrices to predict relevancy. A prototype system was built to implement this model by modifying the SMART system and using the Xerox Part-Of-Speech (P-O-S) tagger as the pre-processor of the indexing process. The prototype system was used in an experimental study to evaluate this technique in terms of precision, recall, and effectiveness of relevance ranking. The results of the study showed that if documents and queries were too short (typically less than 2 lines in length), the technique was less effective than VSM. But with longer documents and queries, especially when original documents were used as queries, we found that the system based on our technique had significantly better performance than SMART.
Publication Date
1997
DOI
10.1002/(SICI)1097-4571(199705)48:5<395::AID-ASI3>3.0.CO;2-Q
Publisher Statement
SJSU users: use the following link to login and access the article via SJSU databases.
Citation Information
Geoffrey Liu. "The Semantic Vector Space Model: Implementation and Evaluation." Journal of American Society for Information Science Vol. 48 Iss. 5 (1997) p. 395 - 417
Available at: http://works.bepress.com/geoffrey-liu/11/