Extracting and ranking product features in opinion documents

Authors:
Lei Zhang;Bing Liu;Suk Hwan Lim;Eamonn O'Brien-Strain
Affiliations:
University of Illinois at Chicago;University of Illinois at Chicago;Hewlett-Packard Labs;Hewlett-Packard Labs
Venue:
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Year:
2010

Citing 19
Cited 14

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic Discovery of Part-Whole Relations

Computational Linguistics
Movie review mining and summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Recognizing contextual polarity in phrase-level sentiment analysis

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Topic sentiment mixture: modeling facets and opinions in weblogs

Proceedings of the 16th international conference on World Wide Web
Red Opal: product-feature scoring from reviews

Proceedings of the 8th ACM conference on Electronic commerce
A holistic lexicon-based approach to opinion mining

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Hidden sentiment association in chinese web opinion mining

Proceedings of the 17th international conference on World Wide Web
An unsupervised framework for extracting and normalizing product attributes from multiple web sites

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Topic identification for fine-grained opinion analysis

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Expanding domain sentiment lexicon through double propagation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Web-scale distributional similarity and entity set expansion

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Automatically constructing a dictionary for information extraction tasks

AAAI'93 Proceedings of the eleventh national conference on Artificial intelligence

Integrating web feed opinions into a corporate data warehouse

Proceedings of the 2nd International Workshop on Business intelligencE and the WEB
Probabilistic ranking of product features from customer reviews

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
An upgrading feature-based opinion mining model on vietnamese product reviews

AMT'11 Proceedings of the 7th international conference on Active media technology
Automatically ranking reviews based on the ordinal regression model

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part III
Comparison of feature-level learning methods for mining online consumer reviews

Expert Systems with Applications: An International Journal
Combining probabilistic language models for aspect-based sentiment retrieval

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Structuring e-commerce inventory

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Opinion target extraction using word-based translation model

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
One seed to find them all: mining opinion features via association

Proceedings of the 21st ACM international conference on Information and knowledge management
Extracting chinese product features: representing a sequence by a set of skip-bigrams

CLSW'12 Proceedings of the 13th Chinese conference on Chinese Lexical Semantics
Walk and learn: a two-stage approach for opinion words and opinion targets co-extraction

Proceedings of the 22nd international conference on World Wide Web companion
Storing and analysing voice of the market data in the corporate data warehouse

Information Systems Frontiers
Opinion target extraction using partially-supervised word alignment model

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Opinion Bias Detection with Social Preference Learning in Social Data

International Journal on Semantic Web & Information Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

An important task of opinion mining is to extract people's opinions on features of an entity. For example, the sentence, "I love the GPS function of Motorola Droid" expresses a positive opinion on the "GPS function" of the Motorola phone. "GPS function" is the feature. This paper focuses on mining features. Double propagation is a state-of-the-art technique for solving the problem. It works well for medium-size corpora. However, for large and small corpora, it can result in low precision and low recall. To deal with these two problems, two improvements based on part-whole and "no" patterns are introduced to increase the recall. Then feature ranking is applied to the extracted feature candidates to improve the precision of the top-ranked candidates. We rank feature candidates by feature importance which is determined by two factors: feature relevance and feature frequency. The problem is formulated as a bipartite graph and the well-known web page ranking algorithm HITS is used to find important features and rank them high. Experiments on diverse real-life datasets show promising results.