Meta-learning Models for Automatic Textual Document Categorization

  • Authors:
  • Kwok-Yin Lai;Wai Lam

  • Affiliations:
  • -;-

  • Venue:
  • PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate two meta-model approaches for the task of automatic textual document categorization. The first approach is the linear combination approach. Based on the idea of distilling the characteristics of how we estimate the merits of each component algorithm, we propose three different strategies for the linear combination approach. The linear combination approach makes use of limited knowledge in the training document set. To address this limitation, we propose the second meta-model approach, called Meta-learning Using Document Feature characteristics (MUDOF), which employs a meta-learning phase using document feature characteristics. Document feature characteristics, derived from the training document set, capture some inherent properties of a particular category. Extensive experiments have been conducted on a real-world document collection and satisfactory performance is obtained.