Implementation of a web robot and statistics on the Korean web

  • Authors:
  • Sung Jin Kim;Sang Ho Lee

  • Affiliations:
  • School of Computing, Soongsil University, Seoul, Korea;School of Computing, Soongsil University, Seoul, Korea

  • Venue:
  • HSI'03 Proceedings of the 2nd international conference on Human.society@internet
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

A web robot is a program that downloads and stores web pages. Implementation issues of web robots have been studied widely and various web statistics are reported in the literature. First, this paper describes the overall architecture of our robot and the implementation decisions on several important issues. Second, we show empirical statistics on approximately 73 million Korean web pages. We also identify what factors of web pages could affect the page changes. The factors may be used for the selection of web pages to be updated incrementally.