Knowledge evaluation: statistical evaluations

Authors:
David D. Jensen
Affiliations:
Research Assistant, Professor of Computer Science, University of Massachusetts, Amherst
Venue:
Handbook of data mining and knowledge discovery
Year:
2002

Citing 28
Cited 0

Randomization tests

Randomization tests
Simplifying decision trees

International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 5
Decision trees and multi-valued attributes

Machine intelligence 11
An ounce of knowledge is worth a ton of data: quantitative studies of the trade-off between expertise and data based on statistically well-founded empirical induction

Proceedings of the sixth international workshop on Machine learning
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Instance-Based Learning Algorithms

Machine Learning
Statistical significance in inductive learning

ECAI '92 Proceedings of the 10th European conference on Artificial intelligence
C4.5: programs for machine learning

C4.5: programs for machine learning
Overfitting Avoidance as Bias

Machine Learning
Induction with randomization testing: decision-oriented analysis of large data sets

Induction with randomization testing: decision-oriented analysis of large data sets
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Learning decision lists using homogeneous rules

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Empirical methods for artificial intelligence

Empirical methods for artificial intelligence
Overfitting and undercomputing in machine learning

ACM Computing Surveys (CSUR)
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Glossary of Terms

Machine Learning - Special issue on applications of machine learning and the knowledge discovery process
Multiple Comparisons in Induction Algorithms

Machine Learning
Statistical Themes and Lessons for Data Mining

Data Mining and Knowledge Discovery
On Comparing Classifiers: Pitfalls toAvoid and a Recommended Approach

Data Mining and Knowledge Discovery
Learning Logical Definitions from Relations

Machine Learning
The CN2 Induction Algorithm

Machine Learning
The Effects of Training Set Size on Decision Tree Complexity

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Using a Permutation Test for Attribute Selection in Decision Trees

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Oversearching and layered search in empirical learning

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Lookahead and pathology in decision tree induction

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article introduces basic features of error estimators, including bias, variance, and loss functions. It outlines the logic behind classical hypothesis tests and explains the special challenges faced by knowledge discovery algorithms that search large model spaces. It discusses the statistical effects of multiple comparison procedures (MCPs), and several methods to adjust for those effects, including mathematical adjustments, cross-validation, and randomization tests. Finally, it outlines the basic concepts behind overfitting reduction and pruning.