Discovering interesting information in XML data with association rules

  • Authors:
  • Daniele Braga;Alessandro Campi;Stefano Ceri;Mika Klemettinen;PierLuca Lanzi

  • Affiliations:
  • Politecnico di Milano, P.za L. da Vinci 32, I-20133, Milano, Italy;Politecnico di Milano, P.za L. da Vinci 32, I-20133, Milano, Italy;Politecnico di Milano, P.za L. da Vinci 32, I-20133, Milano, Italy;Nokia Research Center, Finland;Politecnico di Milano, P.za L. da Vinci 32, I-20133, Milano, Italy

  • Venue:
  • Proceedings of the 2003 ACM symposium on Applied computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data mining algorithms are designed to extract interesting information from large amounts of data. They usually assume that source data are in relational (tabular) from. However, the recent success of XML as a standard to represent semi-structured data and the increasing amount of data available in XML pose new challenges to the data mining community. In this paper we introduce association rules for XML data. To accomplish this, we propose a new operator, based on XPath and inspired by the syntax of XQuery, which allows us to express complex mining tasks, compactly and intuitively. The operator can indifferently (and simultaneously) target both the content and the structure of the data, since the distinction in XML is slight.