Mining the Web: Discovering Knowledge from HyperText Data
Mining the Web: Discovering Knowledge from HyperText Data
Mining the peanut gallery: opinion extraction and semantic classification of product reviews
WWW '03 Proceedings of the 12th international conference on World Wide Web
On the Resemblance and Containment of Documents
SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
The connectivity sonar: detecting site functionality by structural patterns
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
Mining and summarizing customer reviews
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying link farm spam pages
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Detecting phrase-level duplication on the world wide web
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Topical TrustRank: using topicality to combat web spam
Proceedings of the 15th international conference on World Wide Web
Detecting spam web pages through content analysis
Proceedings of the 15th international conference on World Wide Web
Finding near-duplicate web pages: a large-scale evaluation of algorithms
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Utility scoring of product reviews
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A reference collection for web spam
ACM SIGIR Forum
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Extracting product features and opinions from reviews
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Detectives: detecting coalition hit inflation attacks in advertising networks streams
Proceedings of the 16th international conference on World Wide Web
Spam double-funnel: connecting web spammers with advertisers
Proceedings of the 16th international conference on World Wide Web
Learning to detect phishing emails
Proceedings of the 16th international conference on World Wide Web
HoneySpam: honeypots fighting spam at the source
SRUTI'05 Proceedings of the Steps to Reducing Unwanted Traffic on the Internet on Steps to Reducing Unwanted Traffic on the Internet Workshop
Analyzing and Detecting Review Spam
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Model-based collaborative filtering as a defense against profile injection attacks
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Combining multiple email filters based on multivariate statistical analysis
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
A Unified Framework for Opinion Retrieval
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Adaptive subjective triggers for opinionated document retrieval
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Classifying web review opinions for consumer product analysis
Proceedings of the 11th International Conference on Electronic Commerce
A co-classification framework for detecting web spam and spammers in social media web sites
Proceedings of the 18th ACM conference on Information and knowledge management
Effectiveness of web search results for genre and sentiment classification
Journal of Information Science
Using an Information Quality Framework to Evaluate the Quality of Product Reviews
AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Phrase dependency parsing for opinion mining
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Proceedings of the 19th international conference on World wide web
Cross-domain sentiment classification via spectral feature alignment
Proceedings of the 19th international conference on World wide web
Identifying influential reviewers for word-of-mouth marketing
Electronic Commerce Research and Applications
Detecting product review spammers using rating behaviors
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Finding unusual review patterns using unexpected rules
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Efficient confident search in large review corpora
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Aspect-based sentiment analysis of movie reviews on discussion boards
Journal of Information Science
Quality evaluation of product reviews using an information quality framework
Decision Support Systems
Proceedings of the 20th international conference companion on World wide web
Foundations and Trends in Information Retrieval
Personalised rating prediction for new users using latent factor models
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Finding deceptive opinion spam by any stretch of the imagination
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Towards bounding sequential patterns
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
‘twazn me!!! ;(’ automatic authorship analysis of micro-blogging messages
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Toward a fair review-management system
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
A bipartite graph model and mutually reinforcing analysis for review sites
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Multi-facets quality assessment of online opinionated expressions
WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Detection of near-duplicate user generated contents: the SMS spam collection
Proceedings of the 3rd international workshop on Search and mining user-generated contents
Text mining and probabilistic language modeling for online review spam detection
ACM Transactions on Management Information Systems (TMIS)
Systematic analysis of centralized online reputation systems
Decision Support Systems
ETF: extended tensor factorization model for personalizing prediction of review helpfulness
Proceedings of the fifth ACM international conference on Web search and data mining
Learning opinions in user-generated web content
Natural Language Engineering
Sentiment-Preserving reduction for social media analysis
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Lexicon-based Comments-oriented News Sentiment Analyzer system
Expert Systems with Applications: An International Journal
Identifying spam in the iOS app store
Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality
Integration of opinion mining service flow with anonymous information
Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Spotting fake reviewer groups in consumer reviews
Proceedings of the 21st international conference on World Wide Web
Estimating the prevalence of deception in online review communities
Proceedings of the 21st international conference on World Wide Web
Serf and turf: crowdturfing for fun and profit
Proceedings of the 21st international conference on World Wide Web
Survey on mining subjective data on the web
Data Mining and Knowledge Discovery
Adapting social spam infrastructure for political censorship
LEET'12 Proceedings of the 5th USENIX conference on Large-Scale Exploits and Emergent Threats
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Building and managing reputation in the environment of Chinese e-commerce: a case study on Taobao
Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
Learning to identify review spam
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Identifying helpful reviews based on customer's mentions about experiences
Expert Systems with Applications: An International Journal
Identify Online Store Review Spammers via Social Review Graph
ACM Transactions on Intelligent Systems and Technology (TIST)
Information Retrieval in the Commentsphere
ACM Transactions on Intelligent Systems and Technology (TIST)
Review spam detection via temporal pattern discovery
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Selecting a characteristic set of reviews
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Information Retrieval on the Blogosphere
Foundations and Trends in Information Retrieval
Information credibility on twitter in emergency situation
PAISI'12 Proceedings of the 2012 Pacific Asia conference on Intelligence and Security Informatics
A generic approach to generate opinion lists of phrases for opinion mining applications
Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining
What reviews are satisfactory: novel features for automatic helpfulness voting
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Automatic categorisation of comments in social news websites
Expert Systems with Applications: An International Journal
Discovering K web user groups with specific aspect interests
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Review quality aware collaborative filtering
Proceedings of the sixth ACM conference on Recommender systems
Fake reviews: the malicious perspective
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Social Science Computer Review
Automatic generation of short informative sentiment summaries
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Mining social media: key players, sentiments, and communities
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
In search of a gold standard in studies of deception
EACL 2012 Proceedings of the Workshop on Computational Approaches to Deception Detection
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Syntactic stylometry for deception detection
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Streaming analysis of discourse participants
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Diversionary comments under political blog posts
Proceedings of the 21st ACM international conference on Information and knowledge management
Mining sentiment terminology through time
Proceedings of the 21st ACM international conference on Information and knowledge management
TwiSent: a multistage system for analyzing sentiment in twitter
Proceedings of the 21st ACM international conference on Information and knowledge management
NordSec'12 Proceedings of the 17th Nordic conference on Secure IT Systems
Optimizing parallel algorithms for all pairs similarity search
Proceedings of the sixth ACM international conference on Web search and data mining
Simultaneously detecting fake reviews and review spammers using factor graph model
Proceedings of the 5th Annual ACM Web Science Conference
Are user-contributed reviews community property?: exploring the beliefs and practices of reviewers
Proceedings of the 5th Annual ACM Web Science Conference
Detecting tip spam in location-based social networks
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Cache-conscious performance optimization for similarity search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Spotting opinion spammers using behavioral footprints
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-source deep learning for information trustworthiness estimation
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Synthetic review spamming and defense
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Detection of spam tipping behaviour on foursquare
Proceedings of the 22nd international conference on World Wide Web companion
Why people hate your app: making sense of user feedback in a mobile app store
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews
Proceedings of the 22nd international conference on World Wide Web
The FLDA model for aspect-based opinion mining: addressing the cold start problem
Proceedings of the 22nd international conference on World Wide Web
Cross-media sentiment classification and application to box-office forecasting
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
The best answers? think twice: online detection of commercial campaigns in the CQA forums
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Battling the internet water army: detection of hidden paid posters
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Review spam detector with rating consistency check
Proceedings of the 51st ACM Southeast Conference
Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
Uncovering collusive spammers in Chinese review websites
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Potential Power and Problems in Sentiment Mining of Social Media
International Journal of Strategic Decision Sciences
Hidden factors and hidden topics: understanding rating dimensions with review text
Proceedings of the 7th ACM conference on Recommender systems
Detecting collusive spammers in online review communities
Proceedings of the sixth workshop on Ph.D. students in information and knowledge management
On the hardness of evading combinations of linear classifiers
Proceedings of the 2013 ACM workshop on Artificial intelligence and security
Evidence-based trust metrics in web services
CASCON '13 Proceedings of the 2013 Conference of the Center for Advanced Studies on Collaborative Research
A study of manipulative and authentic negative reviews
Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Proceedings of the 7th ACM international conference on Web search and data mining
Large, huge or gigantic? Identifying and encoding intensity relations among adjectives in WordNet
Language Resources and Evaluation
Cross domain recommendation based on multi-type media fusion
Neurocomputing
CoBaFi: collaborative bayesian filtering
Proceedings of the 23rd international conference on World wide web
What are you complaining about?: a study of online reviews of mobile applications
BCS-HCI '13 Proceedings of the 27th International BCS Human Computer Interaction Conference
Predicting community preference of comments on the Social Web
Web Intelligence and Agent Systems
Hi-index | 0.01 |
Evaluative texts on the Web have become a valuable source of opinions on products, services, events, individuals, etc. Recently, many researchers have studied such opinion sources as product reviews, forum posts, and blogs. However, existing research has been focused on classification and summarization of opinions using natural language processing and data mining techniques. An important issue that has been neglected so far is opinion spam or trustworthiness of online opinions. In this paper, we study this issue in the context of product reviews, which are opinion rich and are widely used by consumers and product manufacturers. In the past two years, several startup companies also appeared which aggregate opinions from product reviews. It is thus high time to study spam in reviews. To the best of our knowledge, there is still no published study on this topic, although Web spam and email spam have been investigated extensively. We will see that opinion spam is quite different from Web spam and email spam, and thus requires different detection techniques. Based on the analysis of 5.8 million reviews and 2.14 million reviewers from amazon.com, we show that opinion spam in reviews is widespread. This paper analyzes such spam activities and presents some novel techniques to detect them