Subsets more representative than random ones

  • Authors:
  • Ilia Nouretdinov

  • Affiliations:
  • Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, England

  • Venue:
  • ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Suppose we have a database that describes a set of objects, and our aim is to find its representative subset of a smaller size. Representativeness here means the measure of quality of prediction when the subset is used instead of the whole set in a typical machine learning procedure. We research how to find a subset that is more representative than a random selection of the same size.