MC-tree: Improving Bayesian anytime classification

Authors:
Philipp Kranen;Stephan Günnemann;Sergej Fries;Thomas Seidl
Affiliations:
Data management and data exploration group, RWTH Aachen University, Germany;Data management and data exploration group, RWTH Aachen University, Germany;Data management and data exploration group, RWTH Aachen University, Germany;Data management and data exploration group, RWTH Aachen University, Germany
Venue:
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Year:
2010

Citing 14
Cited 2

The EM algorithm for graphical association models with missing data

Computational Statistics & Data Analysis - Special issue dedicated to Toma´sˇ Havra´nek
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Induction of Decision Trees

Machine Learning
Anytime Interval-Valued Outputs for Kernel Machines: Fast Support Vector Machine Classification via Distance Geometry

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Anytime Classification Using the Nearest Neighbor Algorithm with Applications to Stream Mining

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Indexing density models for incremental learning and anytime classification on data streams

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Any time induction of decision trees: an iterative improvement approach

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Harnessing the strengths of anytime algorithms for constant data streams

Data Mining and Knowledge Discovery
Self-Adaptive Anytime Stream Clustering

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Estimating continuous distributions in Bayesian classifiers

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Naive bayes classifiers that perform well with continuous variables

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence

Precise anytime clustering of noisy sensor data with logarithmic complexity

Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
BT*: an advanced algorithm for anytime classification

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

In scientific databases large amounts of data are collected to create knowledge repositories for deriving new insights or planning further experiments. These databases can be used to train classifiers that later categorize new data tuples. However, the large amounts of data might yield a time consuming classification process, e.g. for nearest neighbors or kernel density estimators. Anytime classifiers bypass this drawback by being interruptible at any time while the quality of the result improves with higher time allowances. Interruptible classifiers are especially useful when newly arriving data has to be classified on demand, e.g. during a running experiment. A statistical approach to anytime classification has recently been proposed using Bayes classification on kernel density estimates. In this paper we present a novel data structure called MC-Tree (Multi-Class Tree) that significantly improves Bayesian anytime classification. The tree stores a hierarchy of mixture densities that represent objects from several classes. Data transformations are used during tree construction to optimize the condition of the tree with respect to multiple classes. Anytime classification is achieved through novel query dependent model refinement approaches that take the entropy of the current mixture components into account. We show in experimental evaluation that the MC-Tree outperforms previous approaches in terms of anytime classification accuracy.