On the Complexity of Optimal Multisplitting

Authors:
Tapio Elomaa;Juho Rousu
Affiliations:
-;-
Venue:
ISMIS '00 Proceedings of the 12th International Symposium on Foundations of Intelligent Systems
Year:
2000

Citing 10
Cited 0

A linear-time algorithm for concave one-dimensional dynamic programming

Information Processing Letters
A Distance-Based Attribute Selection Measure for Decision Tree Induction

Machine Learning
Elements of information theory

Elements of information theory
Dynamic programming with convexity, concavity and sparsity

Theoretical Computer Science - Selected papers of the Combinatorial Pattern Matching School
On the Handling of Continuous-Valued Attributes in Decision Tree Generation

Machine Learning
General and Efficient Multisplitting of Numerical Attributes

Machine Learning
Induction of Decision Trees

Machine Learning
On Fast and Simple Algorithms for Finding Maximal Subarrays and Applications in Learning Theory

EuroCOLT '97 Proceedings of the Third European Conference on Computational Learning Theory
Generalizing Boundary Points

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Speeding Up the Search for Optimal Partitions

PKDD '99 Proceedings of the Third European Conference on Principles of Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.01

Visualization

Abstract

Dynamic programming has been studied extensively, e.g., in computational geometry and string matching. It has recently found a new application in the optimal multisplitting of numerical attribute value domains.We reflect the results obtained earlier to this problem and study whether they help to shed a new light on the inherent complexity of this time-critical subtask of machine learning and data mining programs. The concept of monotonicity has come up in earlier research. It helps to explain the different asymptotic time requirements of optimal multisplitting with respect to different attribute evaluation functions. As case studies we examine Training Set Error and Average Class Entropy functions. The former has a linear-time optimization algorithm, while the latter--like most well-known attribute evaluation functions--takes a quadratic time to optimize. It is shown that neither of them fulfills the strict monotonicity condition, but computing optimal Training Set Error values can be decomposed into monotone subproblems.