The Needles-in-Haystack Problem

  • Authors:
  • Katherine Moreland;Klaus Truemper

  • Affiliations:
  • The MITRE Corporation, McLean, U.S.A. VA 22102;Department of Computer Science, University of Texas at Dallas, Richardson, U.S.A. TX 75083

  • Venue:
  • MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider a new problem of detecting members of a rare class of data, the needles, which have been hidden in a set of records, the haystack. The only information regarding the characterization of the rare class is a single instance of a needle. It is assumed that members of the needle class are similar to each other according to an unknown needle characterization. The goal is to find the needle records hidden in the haystack. This paper describes an algorithm for that task and applies it to several example cases.