Spotting fake reviewer groups in consumer reviews

Authors:
Arjun Mukherjee;Bing Liu;Natalie Glance
Affiliations:
University of Illinois at Chicago, Chicago, IL, USA;University of Illinois at Chicago, Chicago, IL, USA;Google Inc., Pittsburgh, PA, USA
Venue:
Proceedings of the 21st international conference on World Wide Web
Year:
2012

Citing 32
Cited 17

Making large-scale support vector machine learning practical

Advances in kernel methods
Authoritative sources in a hyperlinked environment

Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Scientific Computing

Scientific Computing
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
The Sybil Attack

IPTPS '01 Revised Papers from the First International Workshop on Peer-to-Peer Systems
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
An efficient boosting algorithm for combining preferences

The Journal of Machine Learning Research
MailRank: using ranking for spam detection

Proceedings of the 14th ACM international conference on Information and knowledge management
Learning to rank using gradient descent

ICML '05 Proceedings of the 22nd international conference on Machine learning
Reality mining: sensing complex social systems

Personal and Ubiquitous Computing
Topical TrustRank: using topicality to combat web spam

Proceedings of the 15th international conference on World Wide Web
Detecting spam web pages through content analysis

Proceedings of the 15th international conference on World Wide Web
Utility scoring of product reviews

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A reference collection for web spam

ACM SIGIR Forum
Spam double-funnel: connecting web spammers with advertisers

Proceedings of the 16th international conference on World Wide Web
Combating spam in tagging systems

AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Mining behavioral groups in large wireless LANs

Proceedings of the 13th annual ACM international conference on Mobile computing and networking
Opinion spam and analysis

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Community Discovery Based on Social Actors' Interests and Social Relationships

SKG '08 Proceedings of the 2008 Fourth International Conference on Semantics, Knowledge and Grid
Web spam identification through language model analysis

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Social spam detection

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Detecting spammers and content promoters in online video social networks

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Detecting spam blogs: a machine learning approach

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Automatically assessing review helpfulness

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning to Rank for Information Retrieval

Foundations and Trends in Information Retrieval
Detecting product review spammers using rating behaviors

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Finding unusual review patterns using unexpected rules

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Detecting group review spam

Proceedings of the 20th international conference companion on World wide web
Adversarial Web Search

Foundations and Trends in Information Retrieval
Finding deceptive opinion spam by any stretch of the imagination

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Review Graph Based Online Store Review Spammer Detection

ICDM '11 Proceedings of the 2011 IEEE 11th International Conference on Data Mining
Learning to identify review spam

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three

Review spam detection via temporal pattern discovery

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A generic approach to generate opinion lists of phrases for opinion mining applications

Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining
Modeling review comments

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Mining coherent anomaly collections on web data

Proceedings of the 21st ACM international conference on Information and knowledge management
Simultaneously detecting fake reviews and review spammers using factor graph model

Proceedings of the 5th Annual ACM Web Science Conference
Spotting opinion spammers using behavioral footprints

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Why people hate your app: making sense of user feedback in a mobile app store

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Iolaus: securing online content rating systems

Proceedings of the 22nd international conference on World Wide Web
The best answers? think twice: online detection of commercial campaigns in the CQA forums

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Battling the internet water army: detection of hidden paid posters

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Uncovering collusive spammers in Chinese review websites

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Potential Power and Problems in Sentiment Mining of Social Media

International Journal of Strategic Decision Sciences
Detecting collusive spammers in online review communities

Proceedings of the sixth workshop on Ph.D. students in information and knowledge management
A study of manipulative and authentic negative reviews

Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Demographics, weather and online reviews: a study of restaurant recommendations

Proceedings of the 23rd international conference on World wide web
Opinion Bias Detection with Social Preference Learning in Social Data

International Journal on Semantic Web & Information Systems
External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Opinionated social media such as product reviews are now widely used by individuals and organizations for their decision making. However, due to the reason of profit or fame, people try to game the system by opinion spamming (e.g., writing fake reviews) to promote or demote some target products. For reviews to reflect genuine user experiences and opinions, such spam reviews should be detected. Prior works on opinion spam focused on detecting fake reviews and individual fake reviewers. However, a fake reviewer group (a group of reviewers who work collaboratively to write fake reviews) is even more damaging as they can take total control of the sentiment on the target product due to its size. This paper studies spam detection in the collaborative setting, i.e., to discover fake reviewer groups. The proposed method first uses a frequent itemset mining method to find a set of candidate groups. It then uses several behavioral models derived from the collusion phenomenon among fake reviewers and relation models based on the relationships among groups, individual reviewers, and products they reviewed to detect fake reviewer groups. Additionally, we also built a labeled dataset of fake reviewer groups. Although labeling individual fake reviews and reviewers is very hard, to our surprise labeling fake reviewer groups is much easier. We also note that the proposed technique departs from the traditional supervised learning approach for spam detection because of the inherent nature of our problem which makes the classic supervised learning approach less effective. Experimental results show that the proposed method outperforms multiple strong baselines including the state-of-the-art supervised classification, regression, and learning to rank algorithms.