Identify Online Store Review Spammers via Social Review Graph

Authors:
Guan Wang;Sihong Xie;Bing Liu;Philip S. Yu
Affiliations:
University of Illinois at Chicago;University of Illinois at Chicago;University of Illinois at Chicago;University of Illinois at Chicago
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST)
Year:
2012

Citing 15
Cited 1

Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Bias and controversy: beyond the statistical deviation

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Netprobe: a fast and scalable system for fraud detection in online auction networks

Proceedings of the 16th international conference on World Wide Web
Opinion spam and analysis

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Truth Discovery with Multiple Conflicting Information Providers on the Web

IEEE Transactions on Knowledge and Data Engineering
Bias and Controversy in Evaluation Systems

IEEE Transactions on Knowledge and Data Engineering
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
SNARE: a link analytic system for graph labeling and risk detection

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Detecting product review spammers using rating behaviors

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Finding unusual review patterns using unexpected rules

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Detecting group review spam

Proceedings of the 20th international conference companion on World wide web
Distortion as a validation criterion in the identification of suspicious reviews

Proceedings of the First Workshop on Social Media Analytics
Finding deceptive opinion spam by any stretch of the imagination

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1

Uncovering collusive spammers in Chinese review websites

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online shopping reviews provide valuable information for customers to compare the quality of products, store services, and many other aspects of future purchases. However, spammers are joining this community trying to mislead consumers by writing fake or unfair reviews to confuse the consumers. Previous attempts have used reviewers’ behaviors such as text similarity and rating patterns, to detect spammers. These studies are able to identify certain types of spammers, for instance, those who post many similar reviews about one target. However, in reality, there are other kinds of spammers who can manipulate their behaviors to act just like normal reviewers, and thus cannot be detected by the available techniques. In this article, we propose a novel concept of review graph to capture the relationships among all reviewers, reviews and stores that the reviewers have reviewed as a heterogeneous graph. We explore how interactions between nodes in this graph could reveal the cause of spam and propose an iterative computation model to identify suspicious reviewers. In the review graph, we have three kinds of nodes, namely, reviewer, review, and store. We capture their relationships by introducing three fundamental concepts, the trustiness of reviewers, the honesty of reviews, and the reliability of stores, and identifying their interrelationships: a reviewer is more trustworthy if the person has written more honesty reviews; a store is more reliable if it has more positive reviews from trustworthy reviewers; and a review is more honest if many other honest reviews support it. This is the first time such intricate relationships have been identified for spam detection and captured in a graph model. We further develop an effective computation method based on the proposed graph model. Different from any existing approaches, we do not use an review text information. Our model is thus complementary to existing approaches and able to find more difficult and subtle spamming activities, which are agreed upon by human judges after they evaluate our results.