Automatic scientific text classification using local patterns: KDD CUP 2002 (task 1)

  • Authors:
  • Moustafa M. Ghanem;Yike Guo;Huma Lodhi;Yong Zhang

  • Affiliations:
  • Imperial College of Science Technology & Medicine, London, UK;Imperial College of Science Technology & Medicine, London, UK;Imperial College of Science Technology & Medicine, London, UK;Imperial College of Science Technology & Medicine, London, UK

  • Venue:
  • ACM SIGKDD Explorations Newsletter
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe our approach for addressing Task 1 in the KDD CUP 2002 competition. The approach is based on developing and using an improved automatic feature selection method in conjunction with traditional classifiers. The feature selection method used is based on capturing frequently occurring keyword combinations (or motifs) within short segments of the text of a document and has proved to produce more accurate classification results than approaches relying solely on using keyword-based features.