Skip to main content
Unpublished Paper
Classification Models for New Event Detection
(2004)
  • Girdhar Kumaran
  • James Allan
  • Andrew McCallum, University of Massachusetts - Amherst
Abstract
New event detection (NED) involves monitoring news streams to detect the stories that report on new events. In this paper we explore the application of machine learning classification techniques for this task. We introduce the concept of triangulation with illustrative examples. We develop new features that build on this concept, and the named entities present in a document. The classifiers we developed showed significant and consistent improvement over the baseline vector space model system, on all the collections we tested on. Analysis of the performance of our classifiers suggests the utility of named entities, and the applicability of machine learning techniques to the NED task.
Disciplines
Publication Date
2004
Comments
This is the pre-published version harvested from CIIR.
Citation Information
Girdhar Kumaran, James Allan and Andrew McCallum. "Classification Models for New Event Detection" (2004)
Available at: http://works.bepress.com/andrew_mccallum/42/