Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis

  • Authors:
  • Songbo Tan;Xueqi Cheng;Yuefen Wang;Hongbo Xu

  • Affiliations:
  • Key Laboratory of Network, Institute of Computing Technology, China;Key Laboratory of Network, Institute of Computing Technology, China;Information Center, Chinese Academy of Geological Sciences, China;Key Laboratory of Network, Institute of Computing Technology, China

  • Venue:
  • ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the community of sentiment analysis, supervised learning techniques have been shown to perform very well. When transferred to another domain, however, a supervised sentiment classifier often performs extremely bad. This is so-called domain-transfer problem. In this work, we attempt to attack this problem by making the maximum use of both the old-domain data and the unlabeled new-domain data. To leverage knowledge from the old-domain data, we proposed an effective measure, i.e., Frequently Co-occurring Entropy (FCE), to pick out generalizable features that occur frequently in both domains and have similar occurring probability. To gain knowledge from the new-domain data, we proposed Adapted Naïve Bayes (ANB), a weighted transfer version of Naive Bayes Classifier. The experimental results indicate that proposed approach could improve the performance of base classifier dramatically, and even provide much better performance than the transfer-learning baseline, i.e. the Naïve Bayes Transfer Classifier (NTBC).