Lazy decision trees

Authors:
Jerome H. Friedman
Affiliations:
Statistics Department and Stanford Linear Accelerator Center, Stanford University, Stanford, CA
Venue:
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Year:
1996

Citing 11
Cited 67

Boolean Feature Discovery in Empirical Learning

Machine Learning
Improved Estimates for the Accuracy of Small Disjuncts

Machine Learning
Elements of information theory

Elements of information theory
Neural networks and the bias/variance dilemma

Neural Computation
C4.5: programs for machine learning

C4.5: programs for machine learning
Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

Machine Learning
Technical Note: Selecting a Classification Method by Cross-Validation

Machine Learning
Comparing connectionist and symbolic learning methods

Proceedings of a workshop on Computational learning theory and natural learning systems (vol. 1) : constraints and prospects: constraints and prospects
An Information Theoretic Approach to Rule Induction from Databases

IEEE Transactions on Knowledge and Data Engineering
A study of distance-based machine learning algorithms

A study of distance-based machine learning algorithms
Concept learning and the problem of small disjuncts

IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1

Forgetting Exceptions is Harmful in Language Learning

Machine Learning - Special issue on natural language learning
Constructing X-of-N Attributes for Decision Tree Learning

Machine Learning
Lazy Learning of Bayesian Rules

Machine Learning
Non-traditional applications of data mining

Data mining for design and manufacturing
Interactive Case-Based Reasoning in Sequential Diagnosis

Applied Intelligence
Possibilistic Induction in Decision-Tree Learning

ECML '02 Proceedings of the 13th European Conference on Machine Learning
A Lazy Model-Based Algorithm for On-Line Classification

PAKDD '99 Proceedings of the Third Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining
Local Attribute Value Grouping for Lazy Rule Induction

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
SNNB: A Selective Neighborhood Based Naïve Bayes for Lazy Learning

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Cluster-Based Algorithms for Dealing with Missing Values

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Feature Transformation and Multivariate Decision Tree Induction

DS '98 Proceedings of the First International Conference on Discovery Science
Instance Guided Rule Induction

DS '98 Proceedings of the First International Conference on Discovery Science
Rough sets and boolean reasoning

Granular computing
Rough sets perspective on data and knowledge

Handbook of data mining and knowledge discovery
Data reduction: feature aggregation

Handbook of data mining and knowledge discovery
Case studies: Public domain, multiple mining tasks systems: MLC++

Handbook of data mining and knowledge discovery
Mining with rarity: a unifying framework

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Decision trees with minimal costs

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Online adaptive decision trees

Neural Computation
MOB-ESP and other improvements in probability estimation

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
ELA—A new Approach for Learning Agents

Autonomous Agents and Multi-Agent Systems
"Missing Is Useful': Missing Values in Cost-Sensitive Decision Trees

IEEE Transactions on Knowledge and Data Engineering
Feature value acquisition in testing: a sequential batch test algorithm

ICML '06 Proceedings of the 23rd international conference on Machine learning
Test Strategies for Cost-Sensitive Decision Trees

IEEE Transactions on Knowledge and Data Engineering
Multi-evidence, multi-criteria, lazy associative document classification

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
RIONA: A New Classification System Combining Rule Induction and Instance-Based Learning

Fundamenta Informaticae
Optimizing combustion efficiency of a circulating fluidized boiler: A data mining approach

International Journal of Knowledge-based and Intelligent Engineering Systems - Selected papers from the KES2004 conference
Online Adaptive Decision Trees: Pattern Classification and Function Approximation

Neural Computation
Comparing probability measures using possibility theory: A notion of relative peakedness

International Journal of Approximate Reasoning
Customized classification learning based on query projections

Information Sciences: an International Journal
Toward Exploratory Test-Instance-Centered Diagnosis in High-Dimensional Classification

IEEE Transactions on Knowledge and Data Engineering
Semi-parametric optimization for missing data imputation

Applied Intelligence
Missing values prediction with K2

Intelligent Data Analysis
Tree structured classifiers, interconnected data, and predictive accuracy

Intelligent Data Analysis
Bayesian networks for imputation in classification problems

Journal of Intelligent Information Systems
Privacy-preserving imputation of missing data

Data & Knowledge Engineering
Innovation science: a primer

International Journal of Computer Applications in Technology
Decision trees as possibilistic classifiers

International Journal of Approximate Reasoning
A lazy bagging approach to classification

Pattern Recognition
Usages of Generalization in Case-Based Reasoning

ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Entropy-based associative classification algorithm for mining manufacturing data

International Journal of Computer Integrated Manufacturing
Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees

Recent Advances in Reinforcement Learning
On the influence of imputation in classification: practical issues

Journal of Experimental & Theoretical Artificial Intelligence
Knowledge Discovery with Explained Case-Based Reasoning

Proceedings of the 2008 conference on Artificial Intelligence Research and Development: Proceedings of the 11th International Conference of the Catalan Association for Artificial Intelligence
On the Use of Clustering in Possibilistic Decision Tree Induction

ECSQARU '09 Proceedings of the 10th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty
A Multi-Strategy Approach to KNN and LARM on Small and Incrementally Induced Prediction Knowledge

ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Discovering association rules in incomplete transactional databases

Transactions on rough sets VI
Learning locally weighted C4.4 for class probability estimation

DS'07 Proceedings of the 10th international conference on Discovery science
Database implementation of a model-free classifier

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Missing value imputation based on data clustering

Transactions on computational science I
Possibilistic missing data estimation

AIKED'10 Proceedings of the 9th WSEAS international conference on Artificial intelligence, knowledge engineering and data bases
Towards a possibilistic processing of missing values under complex conditions

WSEAS Transactions on Information Science and Applications
An automated solution to the multiuser carved data ascription problem

IEEE Transactions on Information Forensics and Security
Learning Instance-Specific Predictive Models

The Journal of Machine Learning Research
Classification of melanomas in situ using knowledge discovery with explained case-based reasoning

Artificial Intelligence in Medicine
A comparison of imputation methods for handling missing scores in biometric fusion

Pattern Recognition
Evaluation of a probabilistic approach to classify incomplete objects using decision trees

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Analogy-based reasoning in classifier construction

Transactions on Rough Sets IV
Approximate boolean reasoning: foundations and applications in data mining

Transactions on Rough Sets V
Lazy averaged one-dependence estimators

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
A new version of the Fuzzy-ID3 algorithm

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
RIONA: A New Classification System Combining Rule Induction and Instance-Based Learning

Fundamenta Informaticae
Pattern Extraction from Data

Fundamenta Informaticae
Enhanced spatiotemporal relational probability trees and forests

Data Mining and Knowledge Discovery
Lazy overfitting control

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Instance driven clustering for the imputation of missing data in KDD

International Journal of Communication Networks and Distributed Systems
Lazy attribute selection: Choosing attributes at classification time

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Lazy learning algorithms, exemplified by nearest-neighbor algorithms, do not induce a concise hypothesis from a given training set; the inductive process is delayed until a test instance is given. Algorithms for constructing decision trees, such as C4.5, ID3, and CART create a single "best" decision tree during the training phase, and this tree is then used to classify test instances. The tests at the nodes of the constructed tree are good on average, but there may be better tests for classifying a specific instance. We propose a lazy decision tree algorithm--LAZYDT--that conceptually constructs the "best" decision tree for each test instance. In practice, only a path needs to be constructed, and a caching scheme makes the algorithm fast. The algorithm is robust with respect to missing values without resorting to the complicated methods usually seen in induction of decision trees. Experiments on real and artificial problems are presented.