Text classification for a large-scale taxonomy using dynamically mixed local and global models for a node

  • Authors:
  • Heung-Seon Oh;Yoonjung Choi;Sung-Hyon Myaeng

  • Affiliations:
  • Department of Computer Science, Korea Advanced Institute of Science and Technology;Department of Computer Science, Korea Advanced Institute of Science and Technology;Department of Computer Science, Korea Advanced Institute of Science and Technology

  • Venue:
  • ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical text classification for a large-scale Web taxonomy is challenging because the number of categories hierarchically organized is large and the training data for deep categories are usually sparse. It's been shown that a narrow-down approach involving a search of the taxonomical tree is an effective method for the problem. A recent study showed that both local and global information for a node is useful for further improvement. This paper introduces two methods for mixing local and global models dynamically for individual nodes and shows they improve classification effectiveness by 5% and 30%, respectively, over and above the state-of-art method.