Real-time helpfulness prediction based on voter opinions

Authors:
Richong Zhang;Thomas Tran;Yongyi Mao
Affiliations:
School of Information Technology and Engineering, University of Ottawa, 800 King Edward Avenue, Ottawa, K1N6N5, Canada;School of Information Technology and Engineering, University of Ottawa, 800 King Edward Avenue, Ottawa, K1N6N5, Canada;School of Information Technology and Engineering, University of Ottawa, 800 King Edward Avenue, Ottawa, K1N6N5, Canada
Venue:
Concurrency and Computation: Practice & Experience
Year:
2012

Citing 22
Cited 1

Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Latent dirichlet allocation

The Journal of Machine Learning Research
Predicting the semantic orientation of adjectives

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Dynamic topic models

ICML '06 Proceedings of the 23rd international conference on Machine learning
Movie review mining and summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Utility scoring of product reviews

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Modeling hidden topics on document manifold

Proceedings of the 17th ACM conference on Information and knowledge management
Multi-aspect expertise matching for review assignment

Proceedings of the 17th ACM conference on Information and knowledge management
The Effect of On-Line Consumer Reviews on Consumer Purchasing Intention: The Moderating Role of Involvement

International Journal of Electronic Commerce
On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Modeling and Predicting the Helpfulness of Online Reviews

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Rated aspect summarization of short comments

Proceedings of the 18th international conference on World wide web
Automatically assessing the post quality in online discussions on software

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Automatically assessing review helpfulness

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Review recommendation with graphical model and EM algorithm

Proceedings of the 19th international conference on World wide web
Online multiscale dynamic topic models

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
An information gain-based approach for recommending useful product reviews

Knowledge and Information Systems

Managing Web 2.0 Content

Concurrency and Computation: Practice & Experience

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper studies the problem of designing real-time helpfulness prediction algorithms. Instead of following the conventional route, in which the fraction of positive votes is used as the measure of helpfulness, we give ‘helpfulness’ a naturally sensible and mathematically precise definition, namely, as the probability that a user will vote ‘helpful’ on the user-generated content. Building on this definition, we introduce a principled methodology to helpfulness prediction, in which the prediction problem is naturally formulated as an optimization problem. Under this proposed methodology, we first develop a batch (off-line) algorithm. Experiments on data from Amazon.com suggest that our proposed model in fact outperforms the previously reported prediction algorithm, support vector regression. In some circumstances, an online algorithm that can update the model as additional data arrive is required. In light of this, we proposed an online algorithm that incrementally updates the parameters of the model. Finally, an efficient hybrid algorithm is provided to increase the convergence rate and prediction precision. The final two algorithms are tested on real-life user-generated contents, and experimental results illustrate that the hybrid approach efficiently processes incoming data and generates reliable helpfulness predictions for users. Copyright © 2011 John Wiley & Sons, Ltd.