Case studies: Commercial, multiple mining tasks systems: Kepler and Descartes

  • Authors:
  • Stefan Wrobel;Gennady Andrienko;Natalia Andrienko;Andrea Lüthje

  • Affiliations:
  • Professor of Computer Science, Otto-von-Guericke-Universität Magdeburg, Germany;Researcher, Autonomous Intelligent Systems Institute, Sankt Augustin, Germany;Researcher, Autonomous Intelligent Systems Institute, Sankt Augustin, Germany;Managing Director, Dialogis GmbH, Bonn, Germany

  • Venue:
  • Handbook of data mining and knowledge discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Kepler is an extensible data mining platform that supports the entire knowledge discovery process from data access and preparation to analysis and visualization. One of its particular strengths is its open plug-in architecture, which allows third-party developers to easily integrate analysis tools and to import formats or preprocessing operators without the need to re-implement existing software. A large number of popular analysis algorithms can be used as Kepler plug-ins, including such classics as regression, decision trees, association rules, and clustering, as well as instance-based methods, Bayesian approaches, and subgroup discovery. Furthermore, Kepler is able to work with data that is stored in more than one table. Foreign links can be defined and used by several analysis techniques. For most of the tasks mentioned above, both single-relational and multirelational plug-ins are available. Kepler is scriptable, and thus, a good workbench for analysts and developers. Kepler employs a Java client and features a three-tier architecture that links to relational databases. The architecture allows specialized vertical data mining solutions to be constructed in domains such as marketing/finance, electronic commerce, and science/engineering. The analysis of geographically referenced data is now also possible through a link to the Descartes interactive geographical data exploration environment.