Hidden factors and hidden topics: understanding rating dimensions with review text

Authors:
Julian McAuley;Jure Leskovec
Affiliations:
Stanford, Stanford, CA, USA;Stanford, Stanford, CA, USA
Venue:
Proceedings of the 7th ACM conference on Recommender systems
Year:
2013

Citing 23
Cited 1

Amazon.com Recommendations: Item-to-Item Collaborative Filtering

IEEE Internet Computing
Latent dirichlet allocation

The Journal of Machine Learning Research
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Determining the sentiment of opinions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Opinion spam and analysis

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Modeling online reviews with multi-grain topic models

Proceedings of the 17th international conference on World Wide Web
Graphical Models, Exponential Families, and Variational Inference

Foundations and Trends® in Machine Learning
Rated aspect summarization of short comments

Proceedings of the 18th international conference on World wide web
Multi-facet Rating of Product Reviews

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Matrix Factorization Techniques for Recommender Systems

Computer
Sentiment summarization: evaluating and learning user preferences

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Joint sentiment/topic model for sentiment analysis

Proceedings of the 18th ACM conference on Information and knowledge management
Latent aspect rating analysis on review text data: a rating regression approach

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
An unsupervised aspect-sentiment model for online reviews

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Aspect and sentiment unification model for online review analysis

Proceedings of the fourth ACM international conference on Web search and data mining
Collaborative topic modeling for recommending scientific articles

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-aspect Sentiment Analysis with Topic Models

ICDMW '11 Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops
Finding a needle in a haystack of reviews: cold start context-based hotel recommender system

Proceedings of the sixth ACM conference on Recommender systems
Learning Attitudes and Attributes from Multi-aspect Reviews

ICDM '12 Proceedings of the 2012 IEEE 12th International Conference on Data Mining
From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews

Proceedings of the 22nd international conference on World Wide Web
Do social explanations work?: studying and modeling the effects of social explanations in recommender systems

Proceedings of the 22nd international conference on World Wide Web

CoBaFi: collaborative bayesian filtering

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

In order to recommend products to users we must ultimately predict how a user will respond to a new product. To do so we must uncover the implicit tastes of each user as well as the properties of each product. For example, in order to predict whether a user will enjoy Harry Potter, it helps to identify that the book is about wizards, as well as the user's level of interest in wizardry. User feedback is required to discover these latent product and user dimensions. Such feedback often comes in the form of a numeric rating accompanied by review text. However, traditional methods often discard review text, which makes user and product latent dimensions difficult to interpret, since they ignore the very text that justifies a user's rating. In this paper, we aim to combine latent rating dimensions (such as those of latent-factor recommender systems) with latent review topics (such as those learned by topic models like LDA). Our approach has several advantages. Firstly, we obtain highly interpretable textual labels for latent rating dimensions, which helps us to `justify' ratings with text. Secondly, our approach more accurately predicts product ratings by harnessing the information present in review text; this is especially true for new products and users, who may have too few ratings to model their latent factors, yet may still provide substantial information from the text of even a single review. Thirdly, our discovered topics can be used to facilitate other tasks such as automated genre discovery, and to identify useful and representative reviews.