Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Improved algorithms for topic distillation in a hyperlinked environment
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Trawling the Web for emerging cyber-communities
WWW '99 Proceedings of the eighth international conference on World Wide Web
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
The stochastic approach for link-structure analysis (SALSA) and the TKC effect
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
A comparison of techniques to find mirrored hosts on the WWW
Journal of the American Society for Information Science
Proceedings of the 10th international conference on World Wide Web
Finding authorities and hubs from link structures on the World Wide Web
Proceedings of the 10th international conference on World Wide Web
Improvement of HITS-based algorithms on web documents
Proceedings of the 11th international conference on World Wide Web
Mining the Web: Discovering Knowledge from HyperText Data
Mining the Web: Discovering Knowledge from HyperText Data
Learning to Probabilistically Identify Authoritative Documents
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
The connectivity sonar: detecting site functionality by structural patterns
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
Spam, damn spam, and statistics: using statistical analysis to locate spam web pages
Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
Combating web spam with trustrank
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Hyperlink analysis on the world wide web
Proceedings of the sixteenth ACM conference on Hypertext and hypermedia
MailRank: using ranking for spam detection
Proceedings of the 14th ACM international conference on Information and knowledge management
Topical TrustRank: using topicality to combat web spam
Proceedings of the 15th international conference on World Wide Web
Site level noise removal for search engines
Proceedings of the 15th international conference on World Wide Web
Detecting spam web pages through content analysis
Proceedings of the 15th international conference on World Wide Web
Detecting semantic cloaking on the web
Proceedings of the 15th international conference on World Wide Web
Undue influence: eliminating the impact of link plagiarism on web search rankings
Proceedings of the 2006 ACM symposium on Applied computing
Topical link analysis for web search
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Link spam detection based on mass estimation
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Multi-level Link Structure Analysis Technqiue for Detecting Link Farm Spam Pages
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Web searching, search engines and Information Retrieval
Information Services and Use
Spam double-funnel: connecting web spammers with advertisers
Proceedings of the 16th international conference on World Wide Web
Extraction and classification of dense communities in the web
Proceedings of the 16th international conference on World Wide Web
Improving web spam classification using rank-time features
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Improving web spam classifiers using link structure
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Transductive link spam detection
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Using spam farm to boost PageRank
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Extracting link spam using biased random walks from spam seed sets
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
A large-scale study of link spam detection by graph algorithms
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Measuring similarity to detect qualified links
AIRWeb '07 Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Know your neighbors: web spam detection using the web topology
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Countering web spam with credibility-based link analysis
Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Link analysis for Web spam detection
ACM Transactions on the Web (TWEB)
Detecting splogs via temporal dynamics using self-similarity analysis
ACM Transactions on the Web (TWEB)
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
DirichletRank: Solving the zero-one gap problem of PageRank
ACM Transactions on Information Systems (TOIS)
Larger is better: seed selection in link-based anti-spamming algorithms
Proceedings of the 17th international conference on World Wide Web
Web Structure Mining by Isolated Stars
Algorithms and Models for the Web-Graph
Cleaning search results using term distance features
AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Predicting web spam with HTTP session information
Proceedings of the 17th ACM conference on Information and knowledge management
The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Extraction and classification of dense implicit communities in the Web graph
ACM Transactions on the Web (TWEB)
Web Structure Mining by Isolated Cliques
IEICE - Transactions on Information and Systems
Improvements of HITS Algorithms for Spam Links
IEICE - Transactions on Information and Systems
Looking into the past to better classify web spam
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Web spam filtering in internet archives
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Nullification test collections for web spam and SEO
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Link spam target detection using page farms
ACM Transactions on Knowledge Discovery from Data (TKDD)
Large human communication networks: patterns and a utility-driven generator
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Detecting spam blogs: a machine learning approach
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Nonlinear static-rank computation
Proceedings of the 18th ACM conference on Information and knowledge management
Proceedings of the 18th ACM conference on Information and knowledge management
Exploiting bidirectional links: making spamming detection easier
Proceedings of the 18th ACM conference on Information and knowledge management
Automatic seed set expansion for trust propagation based anti-spamming algorithms
Proceedings of the eleventh international workshop on Web information and data management
An axiomatic approach to personalized ranking systems
Journal of the ACM (JACM)
Foundations and Trends in Information Retrieval
The Journal of Machine Learning Research
Modeling the web as a hypergraph to compute page reputation
Information Systems
Improvements of HITS algorithms for spam links
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Crowdsourcing human-based computation
Proceedings of the 6th Nordic Conference on Human-Computer Interaction: Extending Boundaries
Temporal query log profiling to improve web search ranking
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Mining useful time graph patterns on extensively discussed topics on the web
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Let web spammers expose themselves
Proceedings of the fourth ACM international conference on Web search and data mining
Removing web spam links from search engine results
Journal in Computer Virology
The dark side of the Internet: Attacks, costs and responses
Information Systems
Detecting spam blogs from blog search results
Information Processing and Management: an International Journal
Foundations and Trends in Information Retrieval
deSEO: combating search-result poisoning
SEC'11 Proceedings of the 20th USENIX conference on Security
SURF: detecting and measuring search poisoning
Proceedings of the 18th ACM conference on Computer and communications security
Thwarting the nigritude ultramarine: learning to identify link spam
ECML'05 Proceedings of the 16th European conference on Machine Learning
Understanding and combating link farming in the twitter social network
Proceedings of the 21st international conference on World Wide Web
Survey on web spam detection: principles and algorithms
ACM SIGKDD Explorations Newsletter
Fighting against web spam: a novel propagation method based on click-through data
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Analysis and detection of web spam by means of web content
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Detecting Fake Medical Web Sites Using Recursive Trust Labeling
ACM Transactions on Information Systems (TOIS)
RAID'12 Proceedings of the 15th international conference on Research in Attacks, Intrusions, and Defenses
A Self-Supervised Approach to Comment Spam Detection Based on Content Analysis
International Journal of Information Security and Privacy
Automatic seed set expansion for trust propagation based anti-spam algorithms
Information Sciences: an International Journal
Transforming graph data for statistical relational learning
Journal of Artificial Intelligence Research
Combating Web spam through trust-distrust propagation with confidence
Pattern Recognition Letters
SAAD, a content based Web Spam Analyzer and Detector
Journal of Systems and Software
Take this personally: pollution attacks on personalized services
SEC'13 Proceedings of the 22nd USENIX conference on Security
Campaign extraction from social media
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Compact representation of Web graphs with extended functionality
Information Systems
How to Improve Your Search Engine Ranking: Myths and Reality
ACM Transactions on the Web (TWEB)
Hi-index | 0.00 |
With the increasing importance of search in guiding today's web traffic, more and more effort has been spent to create search engine spam. Since link analysis is one of the most important factors in current commercial search engines' ranking systems, new kinds of spam aiming at links have appeared. Building link farms is one technique that can deteriorate link-based ranking algorithms. In this paper, we present algorithms for detecting these link farms automatically by first generating a seed set based on the common link set between incoming and outgoing links of Web pages and then expanding it. Links between identified pages are re-weighted, providing a modified web graph to use in ranking page importance. Experimental results show that we can identify most link farm spam pages and the final ranking results are improved for almost all tested queries.