Fuzzy Tree Mining: Go Soft on Your Nodes

  • Authors:
  • Federico Razo Lopez;Anne Laurent;Pascal Poncelet;Maguelonne Teisseire

  • Affiliations:
  • LIRMM-CNRS UMR5506, Université Montpellier 2, 161 rue Ada, 34392 Montpellier, France;LIRMM-CNRS UMR5506, Université Montpellier 2, 161 rue Ada, 34392 Montpellier, France;LGI2P - EMA, France;LIRMM-CNRS UMR5506, Université Montpellier 2, 161 rue Ada, 34392 Montpellier, France

  • Venue:
  • IFSA '07 Proceedings of the 12th international Fuzzy Systems Association world congress on Foundations of Fuzzy Logic and Soft Computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Tree mining consists in discovering the frequent subtrees from a forest of trees. This problem has many application areas. For instance, a huge volume of data available from the Internet is now described by trees (e.g. XML). Still, for several documents dealing with the same topic, this description is not always the same. It is thus necessary to mine a common structure in order to query these documents. Biology is another field where data may be described by means of trees. The problem of mining trees has now been addressed for several years, leading to well-known algorithms. However, these algorithms can hardly deal with real data in a soft manner. Indeed, they consider a subtree as fully includedin the super-tree. This means that all the nodes must appear. In this paper, we extend this definition to fuzzy inclusion based on the idea that a tree is included to a certain degree within another one, this fuzzy degree being correlated to the number of matching nodes.