Optimal Partitioning for Classification and Regression Trees

Authors:
Philip A. Chou
Affiliations:
-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1991

Citing 16
Cited 28

Algorithms for clustering data

Algorithms for clustering data
Inferring decision trees using the minimum description length principle

Information and Computation
Conversion of Limited-Entry Decision Tables to Optimal Computer Programs II: minimum storage requirement

Journal of the ACM (JACM)
Optimizing decision trees through heuristically guided search

Communications of the ACM
The synthetic approach to decision table conversion

Communications of the ACM
Information theory applied to the conversion of decision tables to computer programs

Communications of the ACM
Conversion of limited-entry decision tables to computer programs—a proposed modification to Pollack's algorithm

Communications of the ACM
Conversion of limited-entry decision tables to computer programs

Communications of the ACM
A procedure for converting logic table conditions into an efficient sequence of test instructions

Communications of the ACM
Linear Prediction of Speech

Linear Prediction of Speech
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
The CN2 Induction Algorithm

Machine Learning
An Empirical Comparison of Selection Measures for Decision-Tree Induction

Machine Learning
Induction of Decision Trees

Machine Learning
Induction over large data bases

Induction over large data bases
Applications of information theory to pattern recognition and the design of decision trees and trellises

Applications of information theory to pattern recognition and the design of decision trees and trellises

An Active Testing Model for Tracking Roads in Satellite Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Geometric decision trees for optical character recognition (extended abstract)

SCG '97 Proceedings of the thirteenth annual symposium on Computational geometry
A Deterministic Annealing Approach for Parsimonious Design of Piecewise Regression Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical Pattern Recognition: A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey

Data Mining and Knowledge Discovery
Partitioning Nominal Attributes in Decision Trees

Data Mining and Knowledge Discovery
Discretization: An Enabling Technique

Data Mining and Knowledge Discovery
The Application of Semantic Classification Trees to Natural Language Understanding

IEEE Transactions on Pattern Analysis and Machine Intelligence
A General Measure of Rule Interestingness

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
A pylonic decision-tree language model with optimal question selection

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Data structures for maintaining set partitions

Random Structures & Algorithms
Augmentation-based learning: combining observations and user edits for programming-by-demonstration

Proceedings of the 11th international conference on Intelligent user interfaces
Modelling lexical redundancy for machine translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A note on data structures for maintaining bipartitions

Journal of Discrete Algorithms
A prediction model for success of services in e-commerce using decision tree: E-customer's attitude towards online service

Expert Systems with Applications: An International Journal
Reducing decision tree fragmentation through attribute value grouping: A comparative study

Intelligent Data Analysis
An Integrated Approach for Modeling Learning Patterns of Students in Web-Based Instruction: A Cognitive Style Perspective

ACM Transactions on Computer-Human Interaction (TOCHI)
Productivity improvement of manufacturing SMEs via technology innovation in Korea

AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Intelligent approach for effective management of governmental funds for small and medium enterprises

Expert Systems with Applications: An International Journal
Applying text and data mining techniques to forecasting the trend of petitions filed to e-People

Expert Systems with Applications: An International Journal
Inter mode selection for H.264/AVC using time-efficient learning-theoretic algorithms

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
How to design and utilize online customer center to support new product concept generation

Expert Systems with Applications: An International Journal
Social correlates of turn-taking style

Computer Speech and Language
A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis

Speech Communication
Internet Auction Fraud Detection Using Social Network Analysis and Classification Tree Approaches

International Journal of Electronic Commerce
On considering uncertainty and alternatives in low-level vision

UAI'93 Proceedings of the Ninth international conference on Uncertainty in artificial intelligence
Design of convergent product concepts based on functionality: An association rule mining and decision tree approach

Expert Systems with Applications: An International Journal
Evolving decision trees with beam search-based initialization and lexicographic multi-objective evaluation

Information Sciences: an International Journal

Quantified Score

Hi-index	0.15

Visualization

Abstract

An iterative algorithm that finds a locally optimal partition for an arbitrary loss function, in time linear in N for each iteration is presented. The algorithm is a K-means-like clustering algorithm that uses as its distance measure a generalization of Kullback's information divergence. Moreover, it is proven that the globally optimal partition must satisfy a nearest neighbour condition using divergence as the distance measure. These results generalize similar results of L. Breiman et al. (1984) to an arbitrary number of classes or regression variables and to an arbitrary number of bills. Experimental results on a text-to-speech example are provided and additional applications of the algorithm, including the design of variable combinations, surrogate splits, composite nodes, and decision graphs, are suggested.