Selecting a characteristic set of reviews

Authors:
Theodoros Lappas;Mark Crovella;Evimaria Terzi
Affiliations:
Boston University, Boston, MA, USA;Boston University, Boston, MA, USA;Boston University, Boston, MA, Greece
Venue:
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2012

Citing 16
Cited 1

Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Movie review mining and summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Utility scoring of product reviews

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Designing novel review ranking systems: predicting the usefulness and impact of reviews

Proceedings of the ninth international conference on Electronic commerce
Opinion spam and analysis

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
A holistic lexicon-based approach to opinion mining

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Gather customer concerns from online product reviews - A text summarization approach

Expert Systems with Applications: An International Journal
Modeling and Predicting the Helpfulness of Online Reviews

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way

A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way
Multi-facet Rating of Product Reviews

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Automatically assessing review helpfulness

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Mining user reviews: from specification to summarization

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Exploiting social context for review quality prediction

Proceedings of the 19th international conference on World wide web
Efficient confident search in large review corpora

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Selecting a comprehensive set of reviews

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

Using micro-reviews to select an efficient set of reviews

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online reviews provide consumers with valuable information that guides their decisions on a variety of fronts: from entertainment and shopping to medical services. Although the proliferation of online reviews gives insights about different aspects of a product, it can also prove a serious drawback: consumers cannot and will not read thousands of reviews before making a purchase decision. This need to extract useful information from large review corpora has spawned considerable prior work, but so far all have drawbacks. Review summarization (generating statistical descriptions of review sets) sacrifices the immediacy and narrative structure of reviews. Likewise, review selection (identifying a subset of 'helpful' or 'important' reviews) leads to redundant or non-representative summaries. In this paper, we fill the gap between existing review-summarization and review-selection methods by selecting a small subset of reviews that together preserve the statistical properties of the entire review corpus. We formalize this task as a combinatorial optimization problem and show that it NP-hard both tosolve and approximate. We also design effective algorithms that prove to work well in practice. Our experiments with real review corpora on different types of products demonstrate the utility of our methods, and our user studies indicate that our methods provide a better summary than prior approaches.