Co-training for cross-lingual sentiment classification

Authors:
Xiaojun Wan
Affiliations:
Peking University, Beijing, China
Venue:
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Year:
2009

Citing 27
Cited 51

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Email classification with co-training

CASCON '01 Proceedings of the 2001 conference of the Centre for Advanced Studies on Collaborative research
Opinion observer: analyzing and comparing opinions on the Web

WWW '05 Proceedings of the 14th international conference on World Wide Web
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Applying co-training methods to statistical parsing

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Weakly supervised natural language learning without redundant views

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
An EM Based Training Algorithm for Cross-Language Text Categorization

WI '05 Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence
Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach

HICSS '06 Proceedings of the 39th Annual Hawaii International Conference on System Sciences - Volume 03
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Bootstrapping POS taggers using unlabelled data

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Ensemble methods for unsupervised WSD

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Deeper sentiment analysis using machine translation technology

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Determining the sentiment of opinions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Co-clustering based classification for out-of-domain documents

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A two-stage approach to domain adaptation for statistical classifiers

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Can chinese web pages be classified with english data source?

Proceedings of the 17th international conference on World Wide Web
Topic-bridged PLSA for cross-domain text classification

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Multilingual subjectivity analysis using machine translation

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Using bilingual knowledge and ensemble techniques for unsupervised Chinese sentiment analysis

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Transferring naive bayes classifiers for text classification

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Domain adaptation for statistical classifiers

Journal of Artificial Intelligence Research
Using emoticons to reduce dependency in machine learning techniques for sentiment classification

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Cross language text categorization by acquiring multilingual domain models from comparable corpora

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts

Employing personal/impersonal views in supervised and semi-supervised sentiment classification

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Evaluating multilanguage-comparability of subjectivity analysis systems

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Cross-language text classification using structural correspondence learning

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Cross lingual adaptation: an experiment on sentiment classifications

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Sentiment translation through lexicon induction

ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Holistic sentiment analysis across languages: multilingual supervised latent Dirichlet allocation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Cross language text classification by model translation and semi-supervised learning

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Multilingual subjectivity: are more languages better?

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Sentiment classification and polarity shifting

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Document sentiment classification by exploring description model of topical terms

Computer Speech and Language
Using information from the target language to improve crosslingual text classification

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
A vector space model for subjectivity classification in Urdu aided by co-training

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Sentiment translation through multi-edge graphs

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Active deep networks for semi-supervised sentiment classification

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Latent sentiment model for weakly-supervised cross-lingual sentiment classification

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Joint bilingual sentiment classification with unlabeled parallel corpora

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Is machine translation ripe for cross-lingual sentiment classification?

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Cross-lingual sentiment classification via bi-view non-negative matrix tri-factorization

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Collaborative data cleaning for sentiment classification with noisy training corpus

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Knowledge transfer across multilingual corpora via latent topics

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Cross-Lingual Adaptation Using Structural Correspondence Learning

ACM Transactions on Intelligent Systems and Technology (TIST)
Sentiment analysis with a multilingual pipeline

WISE'11 Proceedings of the 12th international conference on Web information system engineering
Language-independent sentiment classification using three common words

Proceedings of the 20th ACM international conference on Information and knowledge management
Bilingual co-training for sentiment classification of chinese product reviews

Computational Linguistics
Creating sentiment dictionaries via triangulation

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Developing robust models for favourability analysis

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Instance level transfer learning for cross lingual opinion analysis

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Generating syntactic tree templates for feature-based opinion mining

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Cross-lingual text classification with model translation and document translation

Proceedings of the 50th Annual Southeast Regional Conference
Semi-supervised learning for imbalanced sentiment classification

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Learning to identify review spam

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Active learning for cross language text categorization

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Transverse subjectivity classification

Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining
Creating sentiment dictionaries via triangulation

Decision Support Systems
On developing robust models for favourability analysis: Model choice, feature sets and imbalanced data

Decision Support Systems
Cross-lingual genre classification

EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Cross-lingual mixture model for sentiment classification

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Active learning for imbalanced sentiment classification

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Using similes to extract basic sentiments across languages

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Lost in translation: viability of machine translation for cross language sentiment analysis

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Combining supervised and unsupervised polarity classification for non-english reviews

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches

Expert Systems with Applications: An International Journal
Cross-lingual web spam classification

Proceedings of the 22nd international conference on World Wide Web companion
Co-training over domain-independent and domain-dependent features for sentiment analysis of an online cancer support community

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Cross-lingual polarity detection with machine translation

Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
Adaptive co-training SVM for sentiment classification on tweets

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Preface: Computational approaches to subjectivity and sentiment analysis: Present and envisaged methods and applications

Computer Speech and Language
Sense-level subjectivity in a multilingual setting

Computer Speech and Language
Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis

Computer Speech and Language
Fuzzy deep belief networks for semi-supervised sentiment classification

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The lack of Chinese sentiment corpora limits the research progress on Chinese sentiment classification. However, there are many freely available English sentiment corpora on the Web. This paper focuses on the problem of cross-lingual sentiment classification, which leverages an available English corpus for Chinese sentiment classification by using the English corpus as training data. Machine translation services are used for eliminating the language gap between the training set and test set, and English features and Chinese features are considered as two independent views of the classification problem. We propose a cotraining approach to making use of unlabeled Chinese data. Experimental results show the effectiveness of the proposed approach, which can outperform the standard inductive classifiers and the transductive classifiers.