Mining bi-sets in numerical data

  • Authors:
  • Jérémy Besson;Céline Robardet;Luc De Raedt;Jean-François Boulicaut

  • Affiliations:
  • LIRIS, UMR, CNRS, INSA, Lyon, Villeurbanne, France and UMR, INRA, INSERM, Lyon, cedex 08, France;LIRIS, UMR, CNRS, INSA, Lyon, Villeurbanne, France;Albert-Ludwigs-Universitat Freiburg, Gebaude, Freiburg, Germany;LIRIS, UMR, CNRS, INSA, Lyon, Villeurbanne, France

  • Venue:
  • KDID'06 Proceedings of the 5th international conference on Knowledge discovery in inductive databases
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Thanks to an important research effort the last few years, inductive queries on set patterns and complete solvers which can evaluate them on large 0/1 data sets have been proved extremely useful. However, for many application domains, the raw data is numerical (matrices of real numbers whose dimensions denote objects and properties). Therefore, using efficient 0/1 mining techniques needs for tedious Boolean property encoding phases. This is, e.g., the case, when considering microarray data mining and its impact for knowledge discovery in molecular biology. We consider the possibility to mine directly numerical data to extract collections of relevant bi-sets, i.e., couples of associated sets of objects and attributes which satisfy some user-defined constraints. Not only we propose a new pattern domain but also we introduce a complete solver for computing the so-called numerical bi-sets. Preliminary experimental validation is given.