C4.5: programs for machine learning
C4.5: programs for machine learning
Classification by feature partitioning
Machine Learning
Machine Learning
Artificial Intelligence Review - Special issue on lazy learning
Artificial Intelligence Review - Special issue on lazy learning
Predictive data mining: a practical guide
Predictive data mining: a practical guide
Machine Learning
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality
Data Mining and Knowledge Discovery
Systems for Knowledge Discovery in Databases
IEEE Transactions on Knowledge and Data Engineering
Locally Weighted Projection Regression: Incremental Real Time Learning in High Dimensional Space
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
An overview of regression techniques for knowledge discovery
The Knowledge Engineering Review
Rule-based machine learning methods for functional prediction
Journal of Artificial Intelligence Research
Further experimental evidence against the utility of Occam's razor
Journal of Artificial Intelligence Research
Particle swarm classification: A survey and positioning
Pattern Recognition
Hi-index | 0.00 |
A new instance-based learning method is presented for regression problems with high-dimensional data. As an instance-based approach, the conventional method, KNN, is very popular for classification. Although KNN performs well on classification tasks, it does not perform as well on regression problems. We have developed a new instance-based method, called Regression by Partitioning Feature Projections (RPFP) which is designed to meet the requirement for a lazy method that achieves high levels of accuracy on regression problems. RPFP gives better performance than well-known eager approaches found in machine learning and statistics such as MARS, rule-based regression, and regression tree induction systems. The most important property of RPFP is that it is a projection-based approach that can handle interactions. We show that it outperforms existing eager or lazy approaches on many domains when there are many missing values in the training data.