Towards a general framework for data mining

Authors:
Sašo Džeroski
Affiliations:
Jožef Stefan Institute, Ljubljana, Slovenia
Venue:
KDID'06 Proceedings of the 5th international conference on Knowledge discovery in inductive databases
Year:
2006

Citing 48
Cited 11

Foundations of logic programming; (2nd extended ed.)

Foundations of logic programming; (2nd extended ed.)
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
A database perspective on knowledge discovery

Communications of the ACM
Clausal Discovery

Machine Learning - special issue on inductive logic programming
From data mining to knowledge discovery: an overview

Advances in knowledge discovery and data mining
Principles of data mining

Principles of data mining
A polynomial time computable metric between point sets

Acta Informatica
The Haskell: The Craft of Functional Programming

The Haskell: The Craft of Functional Programming
Feature Extraction, Construction and Selection: A Data Mining Perspective

Feature Extraction, Construction and Selection: A Data Mining Perspective
Multi-Objective Optimization Using Evolutionary Algorithms

Multi-Objective Optimization Using Evolutionary Algorithms
Relational Data Mining

Relational Data Mining
Inductive Logic Programming: Techniques and Applications

Inductive Logic Programming: Techniques and Applications
Data Structures and Algorithms

Data Structures and Algorithms
Levelwise Search and Borders of Theories in KnowledgeDiscovery

Data Mining and Knowledge Discovery
Discovery of frequent DATALOG patterns

Data Mining and Knowledge Discovery
Feature construction with Inductive Logic Programming: A Study of Quantitative Predictions of Biological Activity Aided by Structural Attributes

Data Mining and Knowledge Discovery
Bump hunting in high-dimensional data

Statistics and Computing
A perspective view and survey of meta-learning

Artificial Intelligence Review
Building Decision Trees with Constraints

Data Mining and Knowledge Discovery
Top-Down Induction of Clustering Trees

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Clustering with Instance-level Constraints

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
The 3W Model and Algebra for Unified Data Mining

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Data Mining as Constraint Logic Programming

Computational Logic: Logic Programming and Beyond, Essays in Honour of Robert A. Kowalski, Part II
A perspective on inductive databases

ACM SIGKDD Explorations Newsletter
Data mining tasks and methods: Subgroup discovery: deviation analysis

Handbook of data mining and knowledge discovery
Logic and Learning

Logic and Learning
A survey of kernels for structured data

ACM SIGKDD Explorations Newsletter
Kernel Methods for Pattern Analysis

Kernel Methods for Pattern Analysis
Summary from the KDD-03 panel: data mining: the next 10 years

ACM SIGKDD Explorations Newsletter
Subgroup Discovery with CN2-SD

The Journal of Machine Learning Research
Models for machine learning and data mining in functional programming

Journal of Functional Programming
Clustering Aggregation

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Expressive power of an algebra for data mining

ACM Transactions on Database Systems (TODS)
Constraint-Based Mining and Inductive Databases: European Workshop on Inductive Databases and Constraint Based Mining, Hinterzarten, Germany, March 11-13, ... / Lecture Notes in Artificial Intelligence)

Constraint-Based Mining and Inductive Databases: European Workshop on Inductive Databases and Constraint Based Mining, Hinterzarten, Germany, March 11-13, ... / Lecture Notes in Artificial Intelligence)
What are the grand challenges for data mining?: KDD-2006 panel report

ACM SIGKDD Explorations Newsletter
Mining optimal decision trees from itemset lattices

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way

A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way
Integrating pattern mining in relational databases

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Kernels on lists and sets over relational algebra: an application to classification of protein fingerprints

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Interestingness is not a dichotomy: introducing softness in constrained pattern mining

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
A survey on condensed representations for frequent sets

Proceedings of the 2004 European conference on Constraint-Based Mining and Inductive Databases
Inductive queries on polynomial equations

Proceedings of the 2004 European conference on Constraint-Based Mining and Inductive Databases
Data mining in inductive databases

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Inductive databases in the relational model: the data as the bridge

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Constraint based induction of multi-objective regression trees

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Learning predictive clustering rules

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases

Towards an Ontology of Data Mining Investigations

DS '09 Proceedings of the 12th International Conference on Discovery Science
Workflow construction for service-oriented knowledge discovery

ISoLA'10 Proceedings of the 4th international conference on Leveraging applications of formal methods, verification, and validation - Volume Part I
A relational view of pattern discovery

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Inductive databases and constraint-based data mining

ICFCA'11 Proceedings of the 9th international conference on Formal concept analysis
An information theoretic framework for data mining

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Predicting structured outputs k-nearest neighbours method

DS'11 Proceedings of the 14th international conference on Discovery science
Towards an algebraic framework for querying inductive databases

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
A unified framework for heterogeneous patterns

Information Systems
On analyzing process compliance in skin cancer treatment: an experience report from the evidence-based medical compliance cluster (EBMC2)

CAiSE'12 Proceedings of the 24th international conference on Advanced Information Systems Engineering
Tree ensembles for predicting structured outputs

Pattern Recognition
Learning with configurable operators and RL-based heuristics

NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we address the ambitious task of formulating a general framework for data mining. We discuss the requirements that such a framework should fulfill: It should elegantly handle different types of data, different data mining tasks, and different types of patterns/models. We also discuss data mining languages and what they should support: this includes the design and implementation of data mining algorithms, as well as their composition into nontrivial multistep knowledge discovery scenarios relevant for practical application. We proceed by laying out some basic concepts, starting with (structured) data and generalizations (e.g., patterns and models) and continuing with data mining tasks and basic components of data mining algorithms (i.e., refinement operators, distances, features and kernels). We next discuss how to use these concepts to formulate constraint-based data mining tasks and design generic data mining algorithms. We finally discuss how these components would fit in the overall framework and in particular into a language for data mining and knowledge discovery.