C4.5: programs for machine learning
C4.5: programs for machine learning
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
Model selection for probabilistic clustering using cross-validatedlikelihood
Statistics and Computing
Making Better Use of Global Discretization
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Proportional k-Interval Discretization for Naive-Bayes Classifiers
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Inference for the Generalization Error
Machine Learning
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Rule-based active sampling for learning to rank
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
treeKL: A distance between high dimension empirical distributions
Pattern Recognition Letters
UniDis: a universal discretization technique
Journal of Intelligent Information Systems
Hi-index | 0.00 |
This paper presents an unsupervised discretization method that performs density estimation for univariate data. The subintervals that the discretization produces can be used as the bins of a histogram. Histograms are a very simple and broadly understood means for displaying data, and our method automatically adapts bin widths to the data. It uses the log-likelihood as the scoring function to select cut points and the cross-validated log-likelihood to select the number of intervals. We compare this method with equal-width discretization where we also select the number of bins using the cross-validated log-likelihood and with equal-frequency discretization.