Search and Classification of High Dimensional Data

  • Authors:
  • Yuval Rabani

  • Affiliations:
  • -

  • Venue:
  • APPROX '02 Proceedings of the 5th International Workshop on Approximation Algorithms for Combinatorial Optimization
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Modeling data sets as points in a high dimensional vector space is a trendy theme in modern information retrieval and data mining. Among the numerous drawbacks of this approach is the fact that many of the required processing tasks are computationally hard in high dimension. We survey several algorithmic ideas that have applications to the design and analysis of polynomial time approximation schemes for nearest neighbor search and clustering of high dimensional data. The main lesson from this line of research is that if one is willing to settle for approximate solutions, then high dimensional geometry is easy. Examples are included in the reference list below.