Extraction of representative keywords considering co-occurrence in positive documents

  • Authors:
  • Byeong-Man Kim;Qing Li;KwangHo Lee;Bo-Yeong Kang

  • Affiliations:
  • Kumoh National Institute of Technology, Korea;,Kumoh National Institute of Technology, Korea;Mokpo National University, Korea;Information and Communications University, Korea

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In linear text classification, user feedback is usually used to tune up the representative keywords (RK) for a certain class. Despite some algorithms (e.g. Rocchio) deal well with user positive and negative feedback to adjust the RKs, few researches have investigated how to adjust RKs only based on a small positive responses which is a popular case in the real-world application (e.g. users tend to click their interested URL). In this work, we describe a method of extracting representative keywords for a user from a small set of his positive feedback documents. Experiments on the Reuters-21578 collection illustrate that our approach is better than other two famous methods (Rocchio and Widrow-Hoff) with 24.8% and 14.5% improvement, respectively.