A Cascade Multiple Classifier System for Document Categorization

  • Authors:
  • Jian-Wu Xu;Vartika Singh;Venu Govindaraju;Depankar Neogi

  • Affiliations:
  • Copanion Inc., Andover, USA MA 01810;Center for Unified Biometrics and Sensors, University at Buffalo, USA;Center for Unified Biometrics and Sensors, University at Buffalo, USA;Copanion Inc., Andover, USA MA 01810

  • Venue:
  • MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A novel cascade multiple classifier system (MCS) for document image classification is presented in the paper. It consists of two different classifiers with different feature sets. The proceeding classifier uses image features, learns physical representation of the document, and outputs a set of candidate class labels for the second classifier. The succeeding classifier is a hierarchical classification model based on textual features. The candidate labels set from the first classifier provides subtrees for the second classifier to search in the hierarchical tree and derive a final classification decision. Hence, it reduces the computational complexity and improves classification accuracy for the second classifier. We test the proposed cascade MCS on a large scale set of tax document classification. The experimental results show improvement of classification performance over individual classifiers.