An introduction to database systems: vol. I (4th ed.)
An introduction to database systems: vol. I (4th ed.)
Molecular feature mining in HIV data
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A polynomial time computable metric between point sets
Acta Informatica
An Extension to SQL for Mining Association Rules
Data Mining and Knowledge Discovery
MSQL: A Query Language for Database Mining
Data Mining and Knowledge Discovery
Learning Logical Definitions from Relations
Machine Learning
Data Mining with SQL Server 2005
Data Mining with SQL Server 2005
Inductive databases in the relational model: the data as the bridge
KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
SINDBAD and SiQL: An Inductive Database and Query Language in the Relational Model
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A product control system using the cellular data system
CIMMACS'09 Proceedings of the 8th WSEAS International Conference on Computational intelligence, man-machine systems and cybernetics
Towards an algebraic framework for querying inductive databases
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Hi-index | 0.00 |
In the demonstration, we will present the concepts and an implementation of an inductive database -- as proposed by Imielinski and Mannila -- in the relational model. The goal is to support all steps of the knowledge discovery process, from pre-processing via data mining to post-processing, on the basis of queries to a database system. The query language SIQL (structured inductive query language), an SQL extension, offers query primitives for feature selection, discretization, pattern mining, clustering, instance-based learning and rule induction. A prototype system processing such queries was implemented as part of the SINDBAD (structured inductive database development) project. Key concepts of this system, among others, are the closure of operators and distances between objects. To support the analysis of multi-relational data, we incorporated multi-relational distance measures based on set distances and recursive descent. The inclusion of rule-based classification models made it necessary to extend the data model and the software architecture significantly. The prototype is applied to three different applications: gene expression analysis, gene regulation prediction and structure-activity relationships (SARs) of small molecules.