Graph-Based Ontology Construction from Heterogenous Evidences

  • Authors:
  • Christoph Böhm;Philip Groth;Ulf Leser

  • Affiliations:
  • Hasso-Plattner-Institut, Potsdam, Germany 14482;Humboldt-Universität zu Berlin, Berlin, Germany 10099;Humboldt-Universität zu Berlin, Berlin, Germany 10099

  • Venue:
  • ISWC '09 Proceedings of the 8th International Semantic Web Conference
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ontologies are tools for describing and structuring knowledge, with many applications in searching and analyzing complex knowledge bases. Since building them manually is a costly process, there are various approaches for bootstrapping ontologies automatically through the analysis of appropriate documents. Such an analysis needs to find the concepts and the relationships that should form the ontology. However, since relationship extraction methods are imprecise and cannot homogeneously cover all concepts, the initial set of relationships is usually inconsistent and rather imbalanced - a problem which, to the best of our knowledge, was mostly ignored so far. In this paper, we define the problem of extracting a consistent as well as properly structured ontology from a set of inconsistent and heterogeneous relationships. Moreover, we propose and compare three graph-based methods for solving the ontology extraction problem. We extract relationships from a large-scale data set of more than 325K documents and evaluate our methods against a gold standard ontology comprising more than 12K relationships. Our study shows that an algorithm based on a modified formulation of the dominating set problem outperforms greedy methods.