Privacy-aware spam detection in social bookmarking systems

Authors:
Beate Navarro Bullock;Hana Lerch;Alexander Roßnagel;Andreas Hotho;Gerd Stumme
Affiliations:
University of Kassel, Kassel, Germany;University of Kassel, Design Kassel, Germany;University of Kassel, Design Kassel, Germany;University of Würzburg, Wiirzburg, Germany;University of Kassel, Kassel, Germany
Venue:
i-KNOW '11 Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies
Year:
2011

Citing 15
Cited 1

Usage patterns of collaborative tagging systems

Journal of Information Science
An introduction to ROC analysis

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges

IEEE Internet Computing
Network properties of folksonomies

AI Communications - Network Analysis in Natural Sciences and Engineering
Characterizing privacy in online social networks

Proceedings of the first workshop on Online social networks
The anti-social tagger: detecting spam in social bookmarking systems

AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Social spam detection

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
A brief survey on anonymization techniques for privacy preserving publishing of social network data

ACM SIGKDD Explorations Newsletter
How much do you tell?: information disclosure behaviour indifferent types of online communities

Proceedings of the fourth international conference on Communities and technologies
A Data Privacy Taxonomy

BNCOD 26 Proceedings of the 26th British National Conference on Databases: Dataspace: The Final Frontier
A co-classification framework for detecting web spam and spammers in social media web sites

Proceedings of the 18th ACM conference on Information and knowledge management
Inferring privacy policies for social networking services

Proceedings of the 2nd ACM workshop on Security and artificial intelligence
Personality traits, usage patterns and information disclosure in online communities

Proceedings of the 23rd British HCI Group Annual Conference on People and Computers: Celebrating People and Technology
Class-based graph anonymization for social network data

Proceedings of the VLDB Endowment
Privacy wizards for social networking sites

Proceedings of the 19th international conference on World wide web

Mining social media: key players, sentiments, and communities

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the increased popularity of Web 2.0 services in the last years data privacy has become a major concern for users. The more personal data users reveal, the more difficult it becomes to control its disclosure in the web. However, for Web 2.0 service providers, the data provided by users is a valuable source for offering effective, personalised data mining services. One major application is the detection of spam in social bookmarking systems: in order to prevent a decrease of content quality, providers need to distinguish spammers and exclude them from the system. They thereby experience a conflict of interests: on the one hand, they need to identify spammers based on the information they collect about users, on the other hand, they need to respect privacy concerns and process as few personal data as possible. It would therefore be of tremendous help for system developers and users to know which personal data are needed for spam detection and which can be ignored. In this paper we address these questions by presenting a data privacy aware feature engineering approach. It consists of the design of features for spam classification which are evaluated according to both, performance and privacy conditions. Experiments using data from the social bookmarking system BibSonomy show that both conditions must not exclude each other.