An overview and classification of adaptive approaches to information extraction

  • Authors:
  • Christian Siefkes;Peter Siniakov

  • Affiliations:
  • Database and Information Systems Group, Freie Universität Berlin, Berlin, Germany;Database and Information Systems Group, Freie Universität Berlin, Berlin, Germany

  • Venue:
  • Journal on Data Semantics IV
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most of the information stored in digital form is hidden in natural language texts. Extracting and storing it in a formal representation (e.g. in form of relations in databases) allows efficient querying, easy administration and further automatic processing of the extracted data. The area of information extraction (IE) comprises techniques, algorithms and methods performing two important tasks: finding (identifying) the desired, relevant data and storing it in appropriate form for future use. The rapidly increasing number and diversity of IE systems are the evidence of continuous activity and growing attention to this field. At the same time it is becoming more and more difficult to overview the scope of IE, to see advantages of certain approaches and differences to others. In this paper we identify and describe promising approaches to IE. Our focus is adaptive systems that can be customized for new domains through training or the use of external knowledge sources. Based on the observed origins and requirements of the examined IE techniques a classification of different types of adaptive IE systems is established.