Combining multiple classifiers for text categorization

  • Authors:
  • Khalid Al-Kofahi;Alex Tyrrell;Arun Vachher;Tim Travers;Peter Jackson

  • Affiliations:
  • Thomson Legal & Regulatory, Rochester, NY;Thomson Legal & Regulatory, Rochester, NY;Thomson Legal & Regulatory, Rochester, NY;Thomson Legal & Regulatory, Rochester, NY;Thomson Legal & Regulatory, Eagan, MN

  • Venue:
  • Proceedings of the tenth international conference on Information and knowledge management
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

A major problem facing online information services is how to index and supplement large document collections with respect to a rich set of categories. We focus upon the routing of case law summaries to various secondary law volumes in which they should be cited. Given the large number ( 13,000) of closely related categories, this is a challenging task that is unlikely to succumb to a single algorithmic solution. Our fully implemented and recently deployed system shows that a superior classification engine for this task can be constructed from a combination of classifiers. The multi-classifier approach helps us leverage all the relevant textual features and meta data, and appears to generalize to related classification tasks.