Improving the quality of predictions using textual information in online user reviews

Authors:
Gayatree Ganu;Yogesh Kakodkar;AméLie Marian
Affiliations:
Department of Computer Science, Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ 08854, United States;Department of Computer Science, Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ 08854, United States;Department of Computer Science, Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ 08854, United States
Venue:
Information Systems
Year:
2013

Citing 24
Cited 1

GroupLens: an open architecture for collaborative filtering of netnews

CSCW '94 Proceedings of the 1994 ACM conference on Computer supported cooperative work
Unsupervised document classification using sequential information maximization

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Amazon.com Recommendations: Item-to-Item Collaborative Filtering

IEEE Internet Computing
A Similarity-Based Soft Clustering Algorithm for Documents

DASFAA '01 Proceedings of the 7th International Conference on Database Systems for Advanced Applications
Unsupervised Image Clustering Using the Information Bottleneck Method

Proceedings of the 24th DAGM Symposium on Pattern Recognition
Mining the peanut gallery: opinion extraction and semantic classification of product reviews

WWW '03 Proceedings of the 12th international conference on World Wide Web
Evaluating collaborative filtering recommender systems

ACM Transactions on Information Systems (TOIS)
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A support vector method for multivariate performance measures

ICML '05 Proceedings of the 22nd international conference on Machine learning
Rapid Identification of Column Heterogeneity

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Show me the money!: deriving the pricing power of product features by mining consumer reviews

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Lessons from the Netflix prize challenge

ACM SIGKDD Explorations Newsletter - Special issue on visual analytics
Factorization meets the neighborhood: a multifaceted collaborative filtering model

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Matrix factorization and neighbor based algorithms for the netflix prize problem

Proceedings of the 2008 ACM conference on Recommender systems
Rated aspect summarization of short comments

Proceedings of the 18th international conference on World wide web
Matrix Factorization Techniques for Recommender Systems

Computer
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Combining predictions for accurate recommender systems

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
An unsupervised aspect-sentiment model for online reviews

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Performance of recommender algorithms on top-n recommendation tasks

Proceedings of the fourth ACM conference on Recommender systems
Learning Attribute-to-Feature Mappings for Cold-Start Recommendations

ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Collaborative topic modeling for recommending scientific articles

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Empirical analysis of predictive algorithms for collaborative filtering

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

Clustering a very large number of textual unstructured customers' reviews in english

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online reviews are often accessed by users deciding to buy a product, see a movie, or go to a restaurant. However, most reviews are written in a free-text format, usually with very scant structured metadata information and are therefore difficult for computers to understand, analyze, and aggregate. Users then face the daunting task of accessing and reading a large quantity of reviews to discover potentially useful information. We identified topical and sentiment information from free-form text reviews, and use this knowledge to improve user experience in accessing reviews. Specifically, we focus on improving recommendation accuracy in a restaurant review scenario. We propose methods to derive a text-based rating from the body of the reviews. We then group similar users together using soft clustering techniques based on the topics and sentiments that appear in the reviews. Our results show that using textual information results in better review score predictions than those derived from the coarse numerical star ratings given by the users. In addition, we use our techniques to make fine-grained predictions of user sentiments towards the individual topics covered in reviews with good accuracy.