Automatic categorization of web sites based on source types

  • Authors:
  • Shourya Roy;Sachindra Joshi;Raghu Krishnapuram

  • Affiliations:
  • IBM India Research Lab, New Delhi, INDIA;IBM India Research Lab, New Delhi, INDIA;IBM India Research Lab, New Delhi, INDIA

  • Venue:
  • Proceedings of the fifteenth ACM conference on Hypertext and hypermedia
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

An important issue with the Web is verification of the accuracy, currency and authenticity of the information associated with Web sites. One way to address this problem is to identify the "source" or "sponsor" of the Web site. However, source identification is non-trivial because the source of a Web site cannot always be determined by the URL or content of the site. In this paper, we propose a method for source identification that uses various types of inbound, outbound and internal interactions that arise due to hyperlinks between and within Web sites.