Crawlets: Agents for High Performance Web Search Engines

  • Authors:
  • Prasannaa Thati;Po-Hao Chang;Gul Agha

  • Affiliations:
  • -;-;-

  • Venue:
  • MA '01 Proceedings of the 5th International Conference on Mobile Agents
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Some of the reasons for unsatisfactory performance of today's search engines are their centralized approach to web crawling and lack of explicit support from web servers. We propose a modification to conventional crawling in which a search engine uploads simple agents, called crawlets, to web sites. A crawlet crawls pages at a site locally and sends a compact summary back to the search engine. This not only reduces bandwidth requirements and network latencies, but also parallelizes crawling. Crawlets also provide an effective means for achieving the performance gains of personalized web servers, and can make up for the lack of cooperation from conventional web servers. The specialized nature of crawlets allows simple solutions to security and resource control problems, and reduces software requirements at participating web sites. In fact, we propose an implementation that requires no changes to web servers, but only the installation of a few (active) web pages at host sites.