Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
The Journal of Machine Learning Research
Predicting the semantic orientation of adjectives
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Mining and summarizing customer reviews
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
ICML '06 Proceedings of the 23rd international conference on Machine learning
Movie review mining and summarization
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Utility scoring of product reviews
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Modeling hidden topics on document manifold
Proceedings of the 17th ACM conference on Information and knowledge management
Multi-aspect expertise matching for review assignment
Proceedings of the 17th ACM conference on Information and knowledge management
International Journal of Electronic Commerce
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Modeling and Predicting the Helpfulness of Online Reviews
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Rated aspect summarization of short comments
Proceedings of the 18th international conference on World wide web
Automatically assessing the post quality in online discussions on software
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Automatically assessing review helpfulness
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Review recommendation with graphical model and EM algorithm
Proceedings of the 19th international conference on World wide web
Online multiscale dynamic topic models
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
An information gain-based approach for recommending useful product reviews
Knowledge and Information Systems
Concurrency and Computation: Practice & Experience
Hi-index | 0.00 |
This paper studies the problem of designing real-time helpfulness prediction algorithms. Instead of following the conventional route, in which the fraction of positive votes is used as the measure of helpfulness, we give ‘helpfulness’ a naturally sensible and mathematically precise definition, namely, as the probability that a user will vote ‘helpful’ on the user-generated content. Building on this definition, we introduce a principled methodology to helpfulness prediction, in which the prediction problem is naturally formulated as an optimization problem. Under this proposed methodology, we first develop a batch (off-line) algorithm. Experiments on data from Amazon.com suggest that our proposed model in fact outperforms the previously reported prediction algorithm, support vector regression. In some circumstances, an online algorithm that can update the model as additional data arrive is required. In light of this, we proposed an online algorithm that incrementally updates the parameters of the model. Finally, an efficient hybrid algorithm is provided to increase the convergence rate and prediction precision. The final two algorithms are tested on real-life user-generated contents, and experimental results illustrate that the hybrid approach efficiently processes incoming data and generates reliable helpfulness predictions for users. Copyright © 2011 John Wiley & Sons, Ltd.