Semi-supervised Collaborative Text Classification

  • Authors:
  • Rong Jin;Ming Wu;Rahul Sukthankar

  • Affiliations:
  • Michigan State University, East Lansing MI 48823, USA;Michigan State University, East Lansing MI 48823, USA;Intel Research Pittsburgh and Carnegie Mellon University, USA

  • Venue:
  • ECML '07 Proceedings of the 18th European conference on Machine Learning
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most text categorization methods require text content of documents that is often difficult to obtain. We consider "Collaborative Text Categorization", where each document is represented by the feedback from a large number of users. Our study focuses on the semi-supervised case in which one key challenge is that a significant number of users have not rated any labeled document. To address this problem, we examine several semi-supervised learning methods and our empirical study shows that collaborative text categorization is more effective than content-based text categorization and the manifold regularization is more effective than other state-of-the-art semi-supervised learning methods.