Ontology-based information extraction: An introduction and a survey of current approaches

  • Authors:
  • Daya C. Wimalasuriya; Dejing Dou

  • Affiliations:
  • Department of Computer and Information Science, Universityof Oregon, Eugene, OR, USA;Department of Computer and Information Science, Universityof Oregon, Eugene, OR, USA

  • Venue:
  • Journal of Information Science
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information extraction (IE) aims to retrieve certain types of information from natural language text by processing them automatically. For example, an IE system might retrieve information about geopolitical indicators of countries from a set of web pages while ignoring other types of information. Ontology-based information extraction (OBIE) has recently emerged as a subfield of information extraction. Here, ontologies - which provide formal and explicit specifications of conceptualizations - play a crucial role in the IE process. Because of the use of ontologies, this field is related to knowledge representation and has the potential to assist the development of the Semantic Web. In this paper, we provide an introduction to ontology-based information extraction and review the details of different OBIE systems developed so far. We attempt to identify a common architecture among these systems and classify them based on different factors, which leads to a better understanding on their operation. We also discuss the implementation details of these systems including the tools used by them and the metrics used to measure their performance. In addition, we attempt to identify the possible future directions for this field.