Review spam detection via time series pattern discovery

  • Authors:
  • Sihong Xie;Guan Wang;Shuyang Lin;Philip S. Yu

  • Affiliations:
  • University of Illioins at Chicago, Chicago, IL, USA;University of Illioins at Chicago, Chicago, IL, USA;University of Illioins at Chicago, Chicago, IL, USA;University of Illioins at Chicago, Chicago, IL, USA & King Abdulaziz University Jeddah, Saudi Arabia

  • Venue:
  • Proceedings of the 21st international conference companion on World Wide Web
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Online reviews play a crucial role in today's electronic commerce. Due to the pervasive spam reviews, customers can be misled to buy low-quality products, while decent stores can be defamed by malicious reviews. We observe that, in reality, a great portion ( 90% in the data we study) of the reviewers write only one review (singleton review). These reviews are so enormous in number that they can almost determine a store's rating and impression. However, existing methods ignore these reviewers. To address this problem, we observe that the normal reviewers' arrival pattern is stable and uncorrelated to their rating pattern temporally. In contrast, spam attacks are usually bursty and either positively or negatively correlated to the rating. Thus, we propose to detect such attacks via unusually correlated temporal patterns. We identify and construct multidimensional time series based on aggregate statistics, in order to depict and mine such correlation. Experimental results show that the proposed method is effective in detecting singleton review attacks. We discover that singleton review is a significant source of spam reviews and largely affects the ratings of online stores.