A Cascade Multiple Classifier System for Document Categorization

Authors:
Jian-Wu Xu;Vartika Singh;Venu Govindaraju;Depankar Neogi
Affiliations:
Copanion Inc., Andover, USA MA 01810;Center for Unified Biometrics and Sensors, University at Buffalo, USA;Center for Unified Biometrics and Sensors, University at Buffalo, USA;Copanion Inc., Andover, USA MA 01810
Venue:
MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
Year:
2009

Citing 12
Cited 0

Learning in the presence of concept drift and hidden contexts

Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental Learning from Noisy Data

Machine Learning
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Dealing with non-stationary environments using context detection

ICML '06 Proceedings of the 23rd international conference on Machine learning
Stochastic learning-based weak estimation of multinomial random variables and its applications to pattern recognition in non-stationary environments

Pattern Recognition
Boosting classifiers for drifting concepts

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts

The Journal of Machine Learning Research
Combining Online Classification Approaches for Changing Environments

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
An ensemble approach for incremental learning in nonstationary environments

MCS'07 Proceedings of the 7th international conference on Multiple classifier systems
Just-in-Time Adaptive Classifiers—Part I: Detecting Nonstationary Changes

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

A novel cascade multiple classifier system (MCS) for document image classification is presented in the paper. It consists of two different classifiers with different feature sets. The proceeding classifier uses image features, learns physical representation of the document, and outputs a set of candidate class labels for the second classifier. The succeeding classifier is a hierarchical classification model based on textual features. The candidate labels set from the first classifier provides subtrees for the second classifier to search in the hierarchical tree and derive a final classification decision. Hence, it reduces the computational complexity and improves classification accuracy for the second classifier. We test the proposed cascade MCS on a large scale set of tax document classification. The experimental results show improvement of classification performance over individual classifiers.