Mechanisms of knowledge evolution for web information extraction

Authors:
Carsten Müller
Affiliations:
SAP AG, Walldorf, Germany
Venue:
Proceedings of the 2005 international conference on Federation over the Web
Year:
2005

Citing 14
Cited 0

Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Regression testing for wrapper maintenance

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Token-Templates and Logic Programs for Intelligent Web Search

Journal of Intelligent Information Systems - Special issue on methodologies for intelligent information systems
Wrapper induction: efficiency and expressiveness

Artificial Intelligence - Special issue on Intelligent internet systems
Inductive Inference: Theory and Methods

ACM Computing Surveys (CSUR)
A linear space algorithm for computing maximal common subsequences

Communications of the ACM
Hierarchical Wrapper Induction for Semistructured Information Sources

Autonomous Agents and Multi-Agent Systems
Ontology Learning for the Semantic Web

IEEE Intelligent Systems
Learning Approaches to Wrapper Induction

Proceedings of the Fourteenth International Florida Artificial Intelligence Research Society Conference
Consistency Queries in Information Extraction

ALT '02 Proceedings of the 13th International Conference on Algorithmic Learning Theory
A Unifying Approach to HTML Wrapper Representation and Learning

DS '00 Proceedings of the Third International Conference on Discovery Science
Advanced elementary formal systems

Theoretical Computer Science - Selected papers in honour of Setsuo Arikawa
Wrapper maintenance: a machine learning approach

Journal of Artificial Intelligence Research
Research in the theory of inductive inference by GDR mathematicians-A survey

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The knowledge that is needed in Web information extraction can, under certain assumptions, be characterized as the knowledge held by wrappers that are used to extract the semantics of documents. The evolution of this knowledge can be divided into the phase of initial learning of the wrappers and the later phase of wrapper maintenance. In this paper we will focus only on the initial learning phase. Based on the LExIKON System, the principal structure of learning algorithms for island wrappers is explained.