|
CMU-CS-04-118
Computer Science Department
School of Computer Science, Carnegie Mellon University
CMU-CS-04-118
New Event Detection with nearest Neighbor,
Support Vector Machines, and Kernel Regression
Jian Zhang, Yiming Yang, Jaime Carbonell
April 2004
This document also appears as Language Technologies Institute
Technical Report CMU-LTI-04-180
CMU-CS-04-118.ps
CMU-CS-04-118.pdf
Keywords: Artificial Intelligence: Learning, Pattern Recognition:
Models-Statistical; Pattern Recognition: Design Methodology-Classifier
design and evaluation; algorithms, novelty detection, new event
detection, nearest neighbor, support, vector machines, kernel
regression
Support Vector Machines have received extensive attention in machine
learning community and have been successfully applied in pattern
recognition and regression problems. Recently, it has also been
proposed to solve novelty detection problems, whose objective
is to detect novel objects from existing instances. New Event
Detection (NED), which can be treated as one special application
of novelty detection, has been a research topic in Topic Detection
and Tracking (TDT) community for several years. However, the
winning technology of NED in the TDT community has remained
to be the nearest neighbor method with suitable
distance metric in the document vector space. In this paper
we investigated Support Vector Machines and kernel regression
(as a smoothed nearest neighbor method) for the NED task, and
compared them to the nearest neighbor method. We conducted a set
of experiments on TDT benchmark collections, and provided analysis
on the failure of SVM for not being able to capture misses.
36 pages
|