Skip to main content
Unpublished Paper
Bibliometric Impact Measures Leveraging Topic Analysis
(2006)
  • Gideon S. Mann
  • David Mimno
  • Andrew McCallum, University of Massachusetts - Amherst
Abstract
Measurements of the impact and history of research literature provide a useful complement to scientific digital library collections. Bibliometric indicators have been extensively studied, mostly in the context of journals. However, journal-based metrics poorly capture topical distinctions in fast-moving fields, and are increasingly problematic in the context of open-access publishing. Recent developments in latent topic models have produced promising results for automatic sub-field discovery. The fine-grained, faceted topics produced by such models provide a more clear view of the topical divisions of a body of research literature and the interactions between those divisions. We demonstrate the usefulness of topic models to impact measurement by applying a new phrase-based topic discovery model to a collection of 300,000 Computer Science publications, collected by the Rexa automatic citation indexing system.
Keywords
  • Information Storage and Retrieval,
  • Digital Libraries,
  • Document and Text processing,
  • Electronic Publishing,
  • Pattern Recognition,
  • Clustering
Disciplines
Publication Date
2006
Comments
This is the pre-published version harvested from CIIR.
Citation Information
Gideon S. Mann, David Mimno and Andrew McCallum. "Bibliometric Impact Measures Leveraging Topic Analysis" (2006)
Available at: http://works.bepress.com/andrew_mccallum/134/