Theories for mutagenicity: a study in first-order and feature-based induction
Artificial Intelligence - Special volume on empirical methods
Explora: a multipattern and multistrategy discovery assistant
Advances in knowledge discovery and data mining
Bump hunting in high-dimensional data
Statistics and Computing
An Algorithm for Multi-relational Discovery of Subgroups
PKDD '97 Proceedings of the First European Symposium on Principles of Data Mining and Knowledge Discovery
Transformation-Based Learning Using Multirelational Aggregation
ILP '01 Proceedings of the 11th International Conference on Inductive Logic Programming
Subgroup Discovery with CN2-SD
The Journal of Machine Learning Research
Propositionalization-based relational subgroup discovery with RSD
Machine Learning
The Bump Hunting Method Using the Genetic Algorithm with the Extreme-Value Statistics
IEICE - Transactions on Information and Systems
Logical and Relational Learning: From ILP to MRDM (Cognitive Technologies)
Logical and Relational Learning: From ILP to MRDM (Cognitive Technologies)
The Journal of Machine Learning Research
kFOIL: learning simple relational kernels
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
On subgroup discovery in numerical domains
Data Mining and Knowledge Discovery
Fast Subgroup Discovery for Continuous Target Concepts
ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
nFOIL: integrating Naïve Bayes and FOIL
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Fast learning of relational kernels
Machine Learning
Gaussian logic for predictive classification
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Hi-index | 0.00 |
We propose an approach to subgroup discovery in relational databases containing numerical attributes. The approach is based on detecting bumps in histograms constructed from substitution sets resulting from matching a first-order query against the input relational database. The approach is evaluated on seven data sets, discovering interpretable subgroups. The subgroups' rate of survival from the training split to the testing split varies among the experimental data sets, but at least on three of them it is very high.