Multiple Comparisons in Induction Algorithms

Authors:
David D. Jensen;Paul R. Cohen
Affiliations:
Experimental Knowledge Systems Laboratory, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610 USA. jensen@cs.umass.edu;Experimental Knowledge Systems Laboratory, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610 USA. cohen@cs.umass.edu
Venue:
Machine Learning
Year:
2000

Citing 25
Cited 50

Randomization tests

Randomization tests
Multivariate analysis of variance for behavioural scientists

Multivariate analysis of variance for behavioural scientists
Simplifying decision trees

International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 5
Decision trees and multi-valued attributes

Machine intelligence 11
Inferring decision trees using the minimum description length principle

Information and Computation
Learnability and the Vapnik-Chervonenkis dimension

Journal of the ACM (JACM)
An ounce of knowledge is worth a ton of data: quantitative studies of the trade-off between expertise and data based on statistically well-founded empirical induction

Proceedings of the sixth international workshop on Machine learning
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Statistical significance in inductive learning

ECAI '92 Proceedings of the 10th European conference on Artificial intelligence
Neural networks and the bias/variance dilemma

Neural Computation
Overfitting Avoidance as Bias

Machine Learning
Induction with randomization testing: decision-oriented analysis of large data sets

Induction with randomization testing: decision-oriented analysis of large data sets
The Importance of Attribute Selection Measures in Decision Tree Induction

Machine Learning
Superstitious learning and induction

Artificial Intelligence Review
Empirical methods for artificial intelligence

Empirical methods for artificial intelligence
Overfitting and undercomputing in machine learning

ACM Computing Surveys (CSUR)
On Comparing Classifiers: Pitfalls toAvoid and a Recommended Approach

Data Mining and Knowledge Discovery
An Empirical Comparison of Pruning Methods for Decision Tree Induction

Machine Learning
An Empirical Comparison of Selection Measures for Decision-Tree Induction

Machine Learning
Induction of Decision Trees

Machine Learning
The Effects of Training Set Size on Decision Tree Complexity

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improved use of continuous attributes in C4.5

Journal of Artificial Intelligence Research
Oversearching and layered search in empirical learning

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Lookahead and pathology in decision tree induction

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Coordinating agent activities in knowledge discovery processes

WACC '99 Proceedings of the international joint conference on Work activities coordination and collaboration
Learning quantitative knowledge for multiagent coordination

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Understanding the crucial differences between classification and discovery of association rules: a position paper

ACM SIGKDD Explorations Newsletter
Molecular feature mining in HIV data

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A Survey of Methods for Scaling Up Inductive Algorithms

Data Mining and Knowledge Discovery
The Role of Occam‘s Razor in Knowledge Discovery

Data Mining and Knowledge Discovery
Discovering Interesting Patterns for Investment Decision Making with GLOWER ◯-A Genetic Learner Overlaid with Entropy Reduction

Data Mining and Knowledge Discovery
A Statistical Theory for Quantitative Association Rules

Journal of Intelligent Information Systems
Model Complexity and Algorithm Selection in Classification

DS '02 Proceedings of the 5th International Conference on Discovery Science
Worst-Case Analysis of Rule Discovery

DS '01 Proceedings of the 4th International Conference on Discovery Science
Knowledge discovery in databases: the purpose, necessity, and challenges

Handbook of data mining and knowledge discovery
Data mining tasks and methods: scalability

Handbook of data mining and knowledge discovery
Knowledge evaluation: statistical evaluations

Handbook of data mining and knowledge discovery
Machine learning

Handbook of data mining and knowledge discovery
Industry: telecommunications network diagnosis

Handbook of data mining and knowledge discovery
Data snooping, dredging and fishing: the dark side of data mining a SIGKDD99 panel report

ACM SIGKDD Explorations Newsletter
Aggregation-based feature invention and relational concept classes

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning relational probability trees

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
An intelligent system for customer targeting: a data mining approach

Decision Support Systems
On the discovery of significant statistical quantitative rules

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Redundancy based feature selection for microarray data

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Worst Case and a Distribution-Based Case Analyses of Sampling for Rule Discovery Based on Generality and Accuracy

Applied Intelligence
Efficient Feature Selection via Analysis of Relevance and Redundancy

The Journal of Machine Learning Research
Ordering and Finding the Best of K2 Supervised Learning Algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature subset selection bias for classification learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Segmenting Customers from Population to Individuals: Does 1-to-1 Keep Your Customers Forever?

IEEE Transactions on Knowledge and Data Engineering
A Dichotomic Search Algorithm for Mining and Learning in Domain-Specific Logics

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Discovering Significant Patterns

Machine Learning
Argument based machine learning

Artificial Intelligence
On the chance accuracies of large collections of classifiers

Proceedings of the 25th international conference on Machine learning
A Statistical Approach to Incremental Induction of First-Order Hierarchical Knowledge Bases

ILP '08 Proceedings of the 18th international conference on Inductive Logic Programming
A Survey on Statistical Pattern Feature Extraction

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Active Feature-Value Acquisition

Management Science
Learning when training data are costly: the effect of class distribution on tree induction

Journal of Artificial Intelligence Research
Process-oriented estimation of generalization error

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Factors affecting automated syndromic surveillance

Artificial Intelligence in Medicine
Factors affecting automated syndromic surveillance

Artificial Intelligence in Medicine
Autocorrelation and linkage cause bias in evaluation of relational learners

ILP'02 Proceedings of the 12th international conference on Inductive logic programming
Robust weighted kernel logistic regression in imbalanced and rare events data

Computational Statistics & Data Analysis
Behavior based record linkage

Proceedings of the VLDB Endowment
Why is rule learning optimistic and how to correct it

ECML'06 Proceedings of the 17th European conference on Machine Learning
Beware the null hypothesis: critical value tables for evaluating classifiers

ECML'05 Proceedings of the 16th European conference on Machine Learning
A survey on feature extraction for pattern recognition

Artificial Intelligence Review
Generalised bottom-up pruning: A model level combination of decision trees

Expert Systems with Applications: An International Journal
Less biased measurement of feature selection benefits

SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection
A Dichotomic Search Algorithm for Mining and Learning in Domain-Specific Logics

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Enhanced spatiotemporal relational probability trees and forests

Data Mining and Knowledge Discovery
Evolutionary computation for supervised learning

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
GA-TVRC-Het: genetic algorithm enhanced time varying relational classifier for evolving heterogeneous networks

Data Mining and Knowledge Discovery
Machine learning for targeted display advertising: transfer learning in action

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

A single mechanism is responsible for three pathologies ofinduction algorithms: attribute selection errors, overfitting, andoversearching. In each pathology, induction algorithms comparemultiple items based on scores from an evaluation function andselect the item with the maximum score. We call this amultiple comparison procedure (MCP). We analyze thestatistical properties of MCPs and show how failure to adjustfor these properties leads to the pathologies. We also discussapproaches that can control pathological behavior, includingBonferroni adjustment, randomization testing, andcross-validation.