Skip to main content
Article
Search Engine Coverage of the OAI-PMH Corpus
IEEE Internet Computing
  • Frank McCown, Ph.D., Harding University
  • Xiaoming Liu
  • Michael L. Nelson, Old Dominion University
  • Mohammad Zubair, Old Dominion University
Document Type
Article
Publication Date
3-1-2006
Abstract

Having indexed much of the "surface" Web, search engines are now using various approaches to index the "deep" Web. At the same time, institutional repositories and digital libraries are adopting the open archives initiative protocol for metadata harvesting (OAI-PMH) to expose their holdings. The authors harvested nearly 10 million records from OAI-PMH repositories. From these records, they extracted 3.3 million unique resource URLs and then conducted searches on samples from this collection to determine how much of the OAI-PMH corpus the three major search engines have indexed.

Copyright held by
IEEE Computer Society
Disciplines
Citation Information
Frank McCown, Xiaoming Liu, Michael L. Nelson and Mohammad Zubair. "Search Engine Coverage of the OAI-PMH Corpus" IEEE Internet Computing Vol. 10 Iss. 2 (2006) p. 66 - 73
Available at: http://works.bepress.com/fmccown/2/