A New Conceptual Clustering Framework

  • Authors:
  • Nina Mishra;Dana Ron;Ram Swaminathan

  • Affiliations:
  • HP Labs, Palo Alto, CA 94304, USA/ Department of Computer Science, Stanford University, Palo Alto, CA 94305, USA. nina.mishra@cs.stanford.edu;Department of EE-Systems, Tel-Aviv University, Tel Aviv 69978, Israel. danar@eng.tau.ac.il;HP Labs, Palo Alto, CA 94304, USA. swaram@hpl.hp.com

  • Venue:
  • Machine Learning
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a new formulation of the conceptual clustering problem where the goal is to explicitly output a collection of simple and meaningful conjunctions of attributes that define the clusters. The formulation differs from previous approaches since the clusters discovered may overlap and also may not cover all the points. In addition, a point may be assigned to a cluster description even if it only satisfies most, and not necessarily all, of the attributes in the conjunction. Connections between this conceptual clustering problem and the maximum edge biclique problem are made. Simple, randomized algorithms are given that discover a collection of approximate conjunctive cluster descriptions in sublinear time.