Automatic information extraction from web pages

  • Authors:
  • Budi Rahardjo;Roland H. C. Yap

  • Affiliations:
  • National Univ. of Singapore, Republic of Singapore;National Univ. of Singapore, Republic of Singapore

  • Venue:
  • Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many web pages have implicit structure. In this paper, we show the feasibility of automatically extracting data from web pages by using approximate matching techniques. This can be applied to generate automatic wrappers or to notify/display web page differences, web page change monitoring, etc.