Class-based n-gram models of natural language
Computational Linguistics
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
Constrained K-means Clustering with Background Knowledge
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
The Journal of Machine Learning Research
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Mining and summarizing customer reviews
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Measures of distributional similarity
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Opinion observer: analyzing and comparing opinions on the Web
WWW '05 Proceedings of the 14th international conference on World Wide Web
Measuring semantic similarity in the taxonomy of WordNet
ACSC '05 Proceedings of the Twenty-eighth Australasian conference on Computer Science - Volume 38
Extracting knowledge from evaluative text
Proceedings of the 3rd international conference on Knowledge capture
A web-based kernel function for measuring the similarity of short text snippets
Proceedings of the 15th international conference on World Wide Web
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Novel association measures using web search with double checking
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Extracting product features and opinions from reviews
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Topic sentiment mixture: modeling facets and opinions in weblogs
Proceedings of the 16th international conference on World Wide Web
Measuring semantic similarity between words using web search engines
Proceedings of the 16th international conference on World Wide Web
A Graph Modeling of Semantic Similarity between Words
ICSC '07 Proceedings of the International Conference on Semantic Computing
Modeling online reviews with multi-grain topic models
Proceedings of the 17th international conference on World Wide Web
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
OpinionMiner: a novel machine learning system for web opinion mining and extraction
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Topic identification for fine-grained opinion analysis
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Graph-based word clustering using a web search engine
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
A study on similarity and relatedness using distributional and WordNet-based approaches
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Product feature categorization with multilevel latent semantic association
Proceedings of the 18th ACM conference on Information and knowledge management
Extracting opinions, opinion holders, and topics expressed in online news media text
SST '06 Proceedings of the Workshop on Sentiment and Subjectivity in Text
Phrase clustering for discriminative learning
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Information content measures of semantic similarity perform better without sense-tagged text
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Grouping product features using semi-supervised learning with soft-constraints
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Using pointwise mutual information to identify implicit features in customer reviews
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
An upgrading feature-based opinion mining model on vietnamese product reviews
AMT'11 Proceedings of the 7th international conference on Active media technology
Diversifying search results of controversial queries
Proceedings of the 20th ACM international conference on Information and knowledge management
Expert Systems with Applications: An International Journal
An aspect-lexicon creation and evaluation tool for sentiment analysis researchers
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
A comparison study of clustering models for online review sentiment analysis
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.01 |
In sentiment analysis of product reviews, one important problem is to produce a summary of opinions based on product features/attributes (also called aspects). However, for the same feature, people can express it with many different words or phrases. To produce a useful summary, these words and phrases, which are domain synonyms, need to be grouped under the same feature group. Although several methods have been proposed to extract product features from reviews, limited work has been done on clustering or grouping of synonym features. This paper focuses on this task. Classic methods for solving this problem are based on unsupervised learning using some forms of distributional similarity. However, we found that these methods do not do well. We then model it as a semi-supervised learning problem. Lexical characteristics of the problem are exploited to automatically identify some labeled examples. Empirical evaluation shows that the proposed method outperforms existing state-of-the-art methods by a large margin.