Multiple comparison procedures
Multiple comparison procedures
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Finding interesting rules from large sets of discovered association rules
CIKM '94 Proceedings of the third international conference on Information and knowledge management
Mining quantitative association rules in large relational tables
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Explora: a multipattern and multistrategy discovery assistant
Advances in knowledge discovery and data mining
Exploratory mining and pruning optimizations of constrained associations rules
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Efficiently mining long patterns from databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A framework for measuring changes in data characteristics
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient mining of emerging patterns: discovering trends and differences
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Pruning and summarizing the discovered associations
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Detecting change in categorical data: mining contrast sets
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Levelwise Search and Borders of Theories in KnowledgeDiscovery
Data Mining and Knowledge Discovery
Beyond Market Baskets: Generalizing Association Rules to Dependence Rules
Data Mining and Knowledge Discovery
What Makes Patterns Interesting in Knowledge Discovery Systems
IEEE Transactions on Knowledge and Data Engineering
Finding Interesting Patterns Using User Expectations
IEEE Transactions on Knowledge and Data Engineering
Pincer Search: A New Algorithm for Discovering the Maximum Frequent Set
EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
Mining Surprising Patterns Using Temporal Description Length
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Constraint-Based Rule Mining in Large, Dense Databases
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Post-analysis of learned rules
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Detecting Temporal Change in Event Sequences: An Application to Demographic Data
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Association Rules for Expressing Gradual Dependencies
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Collusion in the U.S. crop insurance program: applied data mining
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
On detecting differences between groups
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
A framework to support multiple query optimization for complex mining tasks
MDM '05 Proceedings of the 6th international workshop on Multimedia data mining: mining integrated media and complex data
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying bridging rules between conceptual clusters
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A probabilistic classifier system and its application in data mining
Evolutionary Computation
World Wide Web
Mining minimal distinguishing subsequence patterns with gap constraints
Knowledge and Information Systems
Using metarules to organize and group discovered association rules
Data Mining and Knowledge Discovery
Discovering Significant Patterns
Machine Learning
Mining statistically important equivalence classes and delta-discriminative emerging patterns
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Statistical change detection for multi-dimensional data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Empirical likelihood confidence intervals for differences between two datasets with missing data
Pattern Recognition Letters
Cost-based query optimization for complex pattern mining on multiple databases
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Mining significant graph patterns by leap search
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
The Journal of Machine Learning Research
Data & Knowledge Engineering
Contrast Set Mining for Distinguishing Between Similar Diseases
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Supporting Factors in Descriptive Analysis of Brain Ischaemia
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Measures of Ruleset Quality Capable to Represent Uncertain Validity
ECSQARU '07 Proceedings of the 9th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
Estimating confidence intervals for structural differences between contrast groups with missing data
Expert Systems with Applications: An International Journal
Mining class-bridge rules based on rough sets
Expert Systems with Applications: An International Journal
CSM-SD: Methodology for contrast set mining through subgroup discovery
Journal of Biomedical Informatics
Association Analysis Techniques for Bioinformatics Problems
BICoB '09 Proceedings of the 1st International Conference on Bioinformatics and Computational Biology
Measures of ruleset quality for general rules extraction methods
International Journal of Approximate Reasoning
Improved Comprehensibility and Reliability of Explanations via Restricted Halfspace Discretization
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
The Needles-in-Haystack Problem
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Cluster-grouping: from subgroup discovery to clustering
Machine Learning
Measuring the uncertainty of differences for contrasting groups
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Diverging patterns: discovering significant frequency change dissimilarities in large databases
Proceedings of the 18th ACM conference on Information and knowledge management
Interestingness of Association Rules Using Symmetrical Tau and Logistic Regression
AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
Engineering Applications of Artificial Intelligence
Causal difference detection using Bayesian networks
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Contrast set mining through subgroup discovery applied to brain ischaemina data
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
IR interface for contrasting multiple news sites
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
A concise representation of association rules using minimal predictive rules
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Adverse drug reaction mining in pharmacovigilance data using formal concept analysis
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Intelligent Data Analysis
IEEE Transactions on Fuzzy Systems
Interestingness measures for association rules based on statistical validity
Knowledge-Based Systems
Secure top-k subgroup discovery
PSDML'10 Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning
Using constraints to generate and explore higher order discriminative patterns
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
COSINE: a vertical group difference approach to contrast set mining
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Direct local pattern sampling by efficient two-step random procedures
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
A multi-objective evolutionary approach for subgroup discovery
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
GENCCS: a correlated group difference approach to contrast set mining
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Contrasting correlations by an efficient double-clique condition
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Constrained logistic regression for discriminative pattern mining
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Multiple hypothesis testing in pattern discovery
DS'11 Proceedings of the 14th international conference on Discovery science
CLAP: Collaborative pattern mining for distributed information systems
Decision Support Systems
Controlling false positives in association rule mining
Proceedings of the VLDB Endowment
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Improving data quality by source analysis
Journal of Data and Information Quality (JDIQ)
Difference detection between two contrast sets
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Mining bridging rules between conceptual clusters
Applied Intelligence
An algorithm for extracting rare concepts with concise intents
ICFCA'10 Proceedings of the 8th international conference on Formal Concept Analysis
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Pruning derivative partial rules during impact rule discovery
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Inductive querying for discovering subgroups and clusters
Proceedings of the 2004 European conference on Constraint-Based Mining and Inductive Databases
An algorithm for mining implicit itemset pairs based on differences of correlations
DS'05 Proceedings of the 8th international conference on Discovery Science
Multi-class correlated pattern mining
KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Expert Systems with Applications: An International Journal
Hunting for fraudsters in random forests
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
An enhanced relevance criterion for more concise supervised pattern discovery
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Top-N minimization approach for indicative correlation change mining
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Contrast mining from interesting subgroups
Bisociative Knowledge Discovery
A pattern mining based integrative framework for biomarker discovery
Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Explaining subgroups through ontologies
PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
A bayesian scoring technique for mining predictive and non-spurious rules
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Direct out-of-memory distributed parallel frequent pattern mining
Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
A study of subgroup discovery approaches for defect prediction
Information and Software Technology
Hi-index | 0.00 |
A fundamental task in data analysis is understanding the differences between several contrasting groups. These groups can represent different classes of objects, such as male or female students, or the same group over time, e.g. freshman students in 1993 through 1998. We present the problem of mining contrast sets: conjunctions of attributes and values that differ meaningfully in their distribution across groups. We provide a search algorithm for mining contrast sets with pruning rules that drastically reduce the computational complexity. Once the contrast sets are found, we post-process the results to present a subset that are surprising to the user given what we have already shown. We explicitly control the probability of Type I error (false positives) and guarantee a maximum error rate for the entire analysis by using Bonferroni corrections.