A few bad votes too many?: towards robust ranking in social media

Authors:
Jiang Bian;Yandong Liu;Eugene Agichtein;Hongyuan Zha
Affiliations:
Georgia Institute of Technology;Emory University;Emory University;Georgia Institute of Technology
Venue:
AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Year:
2008

Citing 16
Cited 21

Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Implicit feedback for inferring user preference: a bibliography

ACM SIGIR Forum
Accurately interpreting clickthrough data as implicit feedback

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Finding similar questions in large question and answer archives

Proceedings of the 14th ACM international conference on Information and knowledge management
An analysis of the AskMSR question-answering system

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Improving web search ranking by incorporating user behavior information

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A framework to predict the quality of answers with non-textual features

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Lies and propaganda: detecting spam users in collaborative filtering

Proceedings of the 12th international conference on Intelligent user interfaces
Internet-scale collection of human-reviewed data

Proceedings of the 16th international conference on World Wide Web
Detectives: detecting coalition hit inflation attacks in advertising networks streams

Proceedings of the 16th international conference on World Wide Web
A regression framework for learning ranking functions using relative relevance judgments

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges

IEEE Internet Computing
Finding high-quality content in social media

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Finding the right facts in the crowd: factoid question answering over social media

Proceedings of the 17th international conference on World Wide Web
Minimally invasive randomization for collecting unbiased preferences from clickthrough logs

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Click fraud resistant methods for learning click-through rates

WINE'05 Proceedings of the First international conference on Internet and Network Economics

Social spam detection

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Ranking community answers by modeling question-answer relationships via analogical reasoning

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Model for Voter Scoring and Best Answer Selection in Community Q&A Services

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Incorporating robustness into web ranking evaluation

Proceedings of the 18th ACM conference on Information and knowledge management
A brief survey of computational approaches in social computing

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
The participation payoff: challenges and opportunities for multimedia access in networked communities

Proceedings of the international conference on Multimedia information retrieval
Measurement and analysis of an online content voting network: a case study of Digg

Proceedings of the 19th international conference on World wide web
SpotRank: a robust voting system for social news websites

Proceedings of the 4th workshop on Information credibility
Analysing multimedia content in social networking environments

Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Automatically discovering quality-assured consensual knowledge in social web

Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Adversarial Web Search

Foundations and Trends in Information Retrieval
A community question-answering refinement system

Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Is vote effective? an empirical study of community deliberation in social webs

WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Mobile content contribution and retrieval: An exploratory study using the uses and gratifications paradigm

Information Processing and Management: an International Journal
Using media related user profiles to personalize multimedia access over social networks

SBNMA '11 Proceedings of the 2011 ACM workshop on Social and behavioural networked media access
Assessing the quality of textual features in social media

Information Processing and Management: an International Journal
Discerning actuality in backstage: comprehensible contextual aging

EC-TEL'12 Proceedings of the 7th European conference on Technology Enhanced Learning
Beyond pollution and taste: A tag-based strategy to increase download quality in P2P file sharing systems

Computer Communications
Bayesian vote weighting in crowdsourcing systems

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
The best answers? think twice: online detection of commercial campaigns in the CQA forums

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online social media draws heavily on active reader participation, such as voting or rating of news stories, articles, or responses to a question. This user feedback is invaluable for ranking, filtering, and retrieving high quality content - tasks that are crucial with the explosive amount of social content on the web. Unfortunately, as social media moves into the mainstream and gains in popularity, the quality of the user feedback degrades. Some of this is due to noise, but, increasingly, a small fraction of malicious users are trying to "game the system" by selectively promoting or demoting content for profit, or fun. Hence, an effective ranking of social media content must be robust to noise in the user interactions, and in particular to vote spam. We describe a machine learning based ranking framework for social media that integrates user interactions and content relevance, and demonstrate its effec- tiveness for answer retrieval in a popular community question answering portal. We consider several vote spam attacks, and introduce a method of training our ranker to increase its robustness to some common forms of vote spam attacks. The results of our large-scale experimental evaluation show that our ranker is signifcicantly more robust to vote spam compared to a state-of-the-art baseline as well as the ranker not explicitly trained to handle malicious interactions.