Profile-Based Focused Crawler for Social Media-Sharing Websites

  • Authors:
  • Zhiyong Zhang;Olfa Nasraoui

  • Affiliations:
  • -;-

  • Venue:
  • ICTAI '08 Proceedings of the 2008 20th IEEE International Conference on Tools with Artificial Intelligence - Volume 01
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a novel profile based focused crawling system for dealing with increasingly popular social media-sharing web sites. In this system, we treat users' profiles as ranking criteria for guiding the crawling process. Furthermore, we divide a user's profile into two parts, an internal part, which comes from the user's own contribution, and an external part, which comes from the user's social contacts. In order to efficiently and effectively extract data from a social media-sharing website for focused crawling, a path string based page-classification method was first developed for identifying list pages, detail pages and profile pages.