Similarities in fuzzy data mining: from a cognitive view to real-world applications

  • Authors:
  • Bernadette Bouchon-Meunier;Maria Rifqi;Marie-Jeanne Lesot

  • Affiliations:
  • UPMC, Univ Paris, CNRS, UMR, Paris, France;UPMC, Univ Paris, CNRS, UMR, Paris, France;UPMC, Univ Paris, CNRS, UMR, Paris, France

  • Venue:
  • WCCI'08 Proceedings of the 2008 IEEE world conference on Computational intelligence: research frontiers
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Similarity is a key concept for all attempts to construct humanlike automated systems or assistants to human task solving since they are very natural in the human process of categorization, underlying many natural capabilities such as language understanding, pattern recognition or decision-making. In this paper, we study the use of similarities in data mining, basing our discourse on cognitive approaches of similarity stemming for instance from Tversky's and Rosch's seminal works, among others. We point out a general framework for measures of comparison compatible with these cognitive foundations, and we show that measures of similarity can be involved in all steps of the data mining process. We then focus on fuzzy logic that provides interesting tools for data mining mainly because of its ability to represent imperfect information, which is of crucial importance when databases are complex, large, and contain heterogeneous, imprecise, vague, uncertain or incomplete data. We eventually illustrate our discourse by examples of similarities used in real-world data mining problems.