Symbolic methodology for numeric data mining

  • Authors:
  • Boris Kovalerchuk;Evgenii Vityaev

  • Affiliations:
  • (Correspd. Tel.: +1 509 963 1438/ Fax: +1 509 963 1449/ E-mail: borisk@cwu.edu) Central Washington University, Ellensburg, WA 98926-7520, USA;Institute of Mathematics, Russian Academy of Science, Novosibirsk, 630090, Russia

  • Venue:
  • Intelligent Data Analysis - Philosophies and Methodologies for Knowledge Discovery
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Currently statistical and artificial neural network methods dominate in data mining applications. Alternative relational (symbolic) data mining methods have shown their effectiveness in robotics, drug design, and other areas. Neural networks and decision tree methods have serious limitations in capturing relations that may have a variety of forms. Learning systems based on symbolic first-order logic (FOL) representations capture relations naturally. The learned regularities are understandable directly in domain terms that help to build a domain theory. This paper describes relational data mining methodology and develops it further for numeric data such as financial and spatial data. This includes (1) comparing the attribute-value representation with the relational representation, (2) defining a new concept of joint relational representations, (3) a process of their use, and the Discovery algorithm. This methodology handles uniformly the numerical and interval forecasting tasks as well as classification tasks. It is shown that Relational Data Mining (RDM) can handle multiple constrains, initial rules and background knowledge very naturally to reduce the search space in contrast with attribute-based data mining. Theoretical concepts are illustrated with examples from financial and image processing domains.