Using bag-of-concepts to improve the performance of support vector machines in text categorization

  • Authors:
  • Magnus Sahlgren;Rickard Cöster

  • Affiliations:
  • SICS, Kista, Sweden;SICS, Kista, Sweden

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates the use of concept-based representations for text categorization. We introduce a new approach to create concept-based text representations, and apply it to a standard text categorization collection. The representations are used as input to a Support Vector Machine classifier, and the results show that there are certain categories for which concept-based representations constitute a viable supplement to word-based ones. We also demonstrate how the performance of the Support Vector Machine can be improved by combining representations.