Mechanisms of knowledge evolution for web information extraction

  • Authors:
  • Carsten Müller

  • Affiliations:
  • SAP AG, Walldorf, Germany

  • Venue:
  • Proceedings of the 2005 international conference on Federation over the Web
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The knowledge that is needed in Web information extraction can, under certain assumptions, be characterized as the knowledge held by wrappers that are used to extract the semantics of documents. The evolution of this knowledge can be divided into the phase of initial learning of the wrappers and the later phase of wrapper maintenance. In this paper we will focus only on the initial learning phase. Based on the LExIKON System, the principal structure of learning algorithms for island wrappers is explained.