A practical approach to feature selection
ML92 Proceedings of the ninth international workshop on Machine learning
Estimating attributes: analysis and extensions of RELIEF
ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
A vector space model for automatic indexing
Communications of the ACM
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Overcoming the Myopia of Inductive Learning Algorithms with RELIEFF
Applied Intelligence
Characterization of Classification Algorithms
EPIA '95 Proceedings of the 7th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
An adaptation of Relief for attribute estimation in regression
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Theoretical and Empirical Analysis of ReliefF and RReliefF
Machine Learning
An introduction to variable and feature selection
The Journal of Machine Learning Research
An extensive empirical study of feature selection metrics for text classification
The Journal of Machine Learning Research
Benchmarking Attribute Selection Techniques for Discrete Class Data Mining
IEEE Transactions on Knowledge and Data Engineering
EBizPort: collecting and analyzing business intelligence information
Journal of the American Society for Information Science and Technology
YALE: rapid prototyping for complex data mining tasks
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Utility scoring of product reviews
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Journal of Management Information Systems
Designing novel review ranking systems: predicting the usefulness and impact of reviews
Proceedings of the ninth international conference on Electronic commerce
Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums
ACM Transactions on Information Systems (TOIS)
Weighing Stars: Aggregating Online Product Reviews for Intelligent E-commerce Applications
IEEE Intelligent Systems
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
An Entropy-Based Model for Discovering the Usefulness of Online Product Reviews
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
'Helpfulness' in online communities: a measure of message quality
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
How opinions are received by online communities: a case study on amazon.com helpfulness votes
Proceedings of the 18th international conference on World wide web
Multi-facet Rating of Product Reviews
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Text Mining: Classification, Clustering, and Applications
Text Mining: Classification, Clustering, and Applications
Automatically assessing review helpfulness
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Proceedings of the 2010 ACM conference on Computer supported cooperative work
Exploiting social context for review quality prediction
Proceedings of the 19th international conference on World wide web
A quality-aware model for sales prediction using reviews
Proceedings of the 19th international conference on World wide web
A model for evaluating the quality of user-created documents
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
IEEE Intelligent Systems
From frequency to meaning: vector space models of semantics
Journal of Artificial Intelligence Research
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Optimal Search-Based Gene Subset Selection for Gene Array Cancer Classification
IEEE Transactions on Information Technology in Biomedicine
Hi-index | 0.00 |
Within the emerging context of Web 2.0 social media, online customer reviews are playing an increasingly important role in disseminating information, facilitating trust, and promoting commerce in the e-marketplace. The sheer volume of customer reviews on the web produces information overload for readers. Developing a system that can automatically identify the most helpful reviews would be valuable to businesses that are interested in gathering informative and meaningful customer feedback. Because the target variable---review helpfulness---is continuous, common feature selection techniques from text classification cannot be applied. In this article, we propose and investigate a text mining model, enhanced using the Regressional ReliefF (RReliefF) feature selection method, for predicting the helpfulness of online reviews from Amazon.com. We find that RReliefF significantly outperforms two popular dimension reduction methods. This study is the first to investigate and compare different dimension reduction techniques in the context of applying text regression for predicting online review helpfulness. Another contribution is that our analysis of the keywords selected by RReliefF reveals meaningful feature groupings.