Learning semantic n-ary relations from Wikipedia

  • Authors:
  • Marko Banek;Damir Jurić;Zoran Skočir

  • Affiliations:
  • University of Zagreb, Faculty of Electrical Engineering and Computing, Zagreb, Croatia;University of Zagreb, Faculty of Electrical Engineering and Computing, Zagreb, Croatia;University of Zagreb, Faculty of Electrical Engineering and Computing, Zagreb, Croatia

  • Venue:
  • DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automated construction of ontologies from text corpora, which saves both time and human effort, is a principal condition for realizing the idea of the Semantic Web. However, the recently proposed automated techniques are still limited in the scope of context that can be captured. Moreover, the source corpora generally lack the consensus of ontology users regarding the understanding and interpretation of ontology concepts. In this paper we introduce an unsupervised method for learning domain n-ary relations from Wikipedia articles, thus harvesting the consensus reached by the largest world community engaged in collecting and classifying knowledge. Providing ontologies with n-ary relations instead of the standard binary relations built on the subject-verb-object paradigm results in preserving the initial context of time, space, cause, reason or quantity that otherwise would be lost irreversibly. Our preliminary experiments with a prototype software tool show highly satisfactory results when extracting ternary and quaternary relations, as well as the traditional binary ones.