Automatic wrapper generation for metasearch using ordered tree structured patterns

  • Authors:
  • Kazuhide Aikou;Yusuke Suzuki;Takayoshi Shoudai;Tetsuhiro Miyahara

  • Affiliations:
  • Department of Informatics, Kyushu University, Kasuga, Japan;Department of Informatics, Kyushu University, Kasuga, Japan;Department of Informatics, Kyushu University, Kasuga, Japan;Faculty of Information Sciences, Hiroshima City University, Hiroshima, Japan

  • Venue:
  • AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

A wrapper is a program which extracts data from a web site and reorganizes them in a database Wrapper generation from web sites is a key technique in realizing such a metasearch system We present a new method of automatic wrapper generation for metasearch using our efficient learning algorithm for term trees Term trees are ordered tree structured patterns with structured variables, which represent structural features common to tree structured data such as HTML files.