Detecting nepotistic links by language model disagreement

  • Authors:
  • András A. Benczúr;István Bíró;Károly Csalogány;Máté Uher

  • Affiliations:
  • Hungarian Academy of Sciences (MTA SZTAKI) and Eötvös University, Budapest;Hungarian Academy of Sciences (MTA SZTAKI) and Eötvös University, Budapest;Hungarian Academy of Sciences (MTA SZTAKI) and Eötvös University, Budapest;Hungarian Academy of Sciences (MTA SZTAKI) and Eötvös University, Budapest

  • Venue:
  • Proceedings of the 15th international conference on World Wide Web
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this short note we demonstrate the applicability of hyperlink downweighting by means of language model disagreement. The method filters out hyperlinks with no relevance to the target page without the need of white and blacklists or human interaction. We fight various forms of nepotism such as common maintainers, ads, link exchanges or misused affiliate programs. Our method is tested on a 31 M page crawl of the .de domain with a manually classified 1000-page random sample.