Randomization tests
Multivariate analysis of variance for behavioural scientists
Multivariate analysis of variance for behavioural scientists
International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 5
Decision trees and multi-valued attributes
Machine intelligence 11
Inferring decision trees using the minimum description length principle
Information and Computation
Learnability and the Vapnik-Chervonenkis dimension
Journal of the ACM (JACM)
Proceedings of the sixth international workshop on Machine learning
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Statistical significance in inductive learning
ECAI '92 Proceedings of the 10th European conference on Artificial intelligence
Neural networks and the bias/variance dilemma
Neural Computation
Machine Learning
Induction with randomization testing: decision-oriented analysis of large data sets
Induction with randomization testing: decision-oriented analysis of large data sets
Superstitious learning and induction
Artificial Intelligence Review
Empirical methods for artificial intelligence
Empirical methods for artificial intelligence
Overfitting and undercomputing in machine learning
ACM Computing Surveys (CSUR)
On Comparing Classifiers: Pitfalls toAvoid and a Recommended Approach
Data Mining and Knowledge Discovery
Machine Learning
The Effects of Training Set Size on Decision Tree Complexity
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improved use of continuous attributes in C4.5
Journal of Artificial Intelligence Research
Oversearching and layered search in empirical learning
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Lookahead and pathology in decision tree induction
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
A study of cross-validation and bootstrap for accuracy estimation and model selection
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Coordinating agent activities in knowledge discovery processes
WACC '99 Proceedings of the international joint conference on Work activities coordination and collaboration
Learning quantitative knowledge for multiagent coordination
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
ACM SIGKDD Explorations Newsletter
Molecular feature mining in HIV data
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A Survey of Methods for Scaling Up Inductive Algorithms
Data Mining and Knowledge Discovery
The Role of Occam‘s Razor in Knowledge Discovery
Data Mining and Knowledge Discovery
Data Mining and Knowledge Discovery
A Statistical Theory for Quantitative Association Rules
Journal of Intelligent Information Systems
Model Complexity and Algorithm Selection in Classification
DS '02 Proceedings of the 5th International Conference on Discovery Science
Worst-Case Analysis of Rule Discovery
DS '01 Proceedings of the 4th International Conference on Discovery Science
Knowledge discovery in databases: the purpose, necessity, and challenges
Handbook of data mining and knowledge discovery
Data mining tasks and methods: scalability
Handbook of data mining and knowledge discovery
Knowledge evaluation: statistical evaluations
Handbook of data mining and knowledge discovery
Handbook of data mining and knowledge discovery
Industry: telecommunications network diagnosis
Handbook of data mining and knowledge discovery
Data snooping, dredging and fishing: the dark side of data mining a SIGKDD99 panel report
ACM SIGKDD Explorations Newsletter
Aggregation-based feature invention and relational concept classes
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning relational probability trees
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
An intelligent system for customer targeting: a data mining approach
Decision Support Systems
On the discovery of significant statistical quantitative rules
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Redundancy based feature selection for microarray data
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient Feature Selection via Analysis of Relevance and Redundancy
The Journal of Machine Learning Research
Ordering and Finding the Best of K2 Supervised Learning Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature subset selection bias for classification learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Segmenting Customers from Population to Individuals: Does 1-to-1 Keep Your Customers Forever?
IEEE Transactions on Knowledge and Data Engineering
A Dichotomic Search Algorithm for Mining and Learning in Domain-Specific Logics
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Discovering Significant Patterns
Machine Learning
Argument based machine learning
Artificial Intelligence
On the chance accuracies of large collections of classifiers
Proceedings of the 25th international conference on Machine learning
A Statistical Approach to Incremental Induction of First-Order Hierarchical Knowledge Bases
ILP '08 Proceedings of the 18th international conference on Inductive Logic Programming
A Survey on Statistical Pattern Feature Extraction
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Active Feature-Value Acquisition
Management Science
Learning when training data are costly: the effect of class distribution on tree induction
Journal of Artificial Intelligence Research
Process-oriented estimation of generalization error
IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Factors affecting automated syndromic surveillance
Artificial Intelligence in Medicine
Factors affecting automated syndromic surveillance
Artificial Intelligence in Medicine
Autocorrelation and linkage cause bias in evaluation of relational learners
ILP'02 Proceedings of the 12th international conference on Inductive logic programming
Robust weighted kernel logistic regression in imbalanced and rare events data
Computational Statistics & Data Analysis
Proceedings of the VLDB Endowment
Why is rule learning optimistic and how to correct it
ECML'06 Proceedings of the 17th European conference on Machine Learning
Beware the null hypothesis: critical value tables for evaluating classifiers
ECML'05 Proceedings of the 16th European conference on Machine Learning
A survey on feature extraction for pattern recognition
Artificial Intelligence Review
Generalised bottom-up pruning: A model level combination of decision trees
Expert Systems with Applications: An International Journal
Less biased measurement of feature selection benefits
SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection
A Dichotomic Search Algorithm for Mining and Learning in Domain-Specific Logics
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Enhanced spatiotemporal relational probability trees and forests
Data Mining and Knowledge Discovery
Evolutionary computation for supervised learning
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Data Mining and Knowledge Discovery
Hi-index | 0.00 |
A single mechanism is responsible for three pathologies ofinduction algorithms: attribute selection errors, overfitting, andoversearching. In each pathology, induction algorithms comparemultiple items based on scores from an evaluation function andselect the item with the maximum score. We call this amultiple comparison procedure (MCP). We analyze thestatistical properties of MCPs and show how failure to adjustfor these properties leads to the pathologies. We also discussapproaches that can control pathological behavior, includingBonferroni adjustment, randomization testing, andcross-validation.