Skip to main content
Article
Privacy-Preserving Similarity-Based Text Retrieval
ACM Transactions on Internet Technology
  • Hwee Hwa PANG, Singapore Management University
  • Jialie SHEN, Singapore Management University
  • Ramayya Krishnan, Carnegie Mellon University
Publication Type
Journal Article
Version
publishedVersion
Publication Date
2-2010
Abstract

Users of online services are increasingly wary that their activities could disclose confidential information on their business or personal activities. It would be desirable for an online document service to perform text retrieval for users, while protecting the privacy of their activities. In this article, we introduce a privacy-preserving, similarity-based text retrieval scheme that (a) prevents the server from accurately reconstructing the term composition of queries and documents, and (b) anonymizes the search results from unauthorized observers. At the same time, our scheme preserves the relevance-ranking of the search server, and enables accounting of the number of documents that each user opens. The effectiveness of the scheme is verified empirically with two real text corpora.

Keywords
  • Privacy of search queries,
  • Security in text retrieval,
  • Singular value decomposition
Identifier
10.1145/1667067.1667071
Publisher
ACM
Copyright Owner and License
Authors
Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Comments

Article No.: 4

Additional URL
http://doi.org/10.1145/1667067.1667071
Citation Information
Hwee Hwa PANG, Jialie SHEN and Ramayya Krishnan. "Privacy-Preserving Similarity-Based Text Retrieval" ACM Transactions on Internet Technology Vol. 10 Iss. 1 (2010) ISSN: 1533-5399
Available at: http://works.bepress.com/hweehwa-pang/61/