Distinguishing between instances and classes in the wikipedia taxonomy

  • Authors:
  • Cäcilia Zirn;Vivi Nastase;Michael Strube

  • Affiliations:
  • EML Research gGmbH, Heidelberg, Germany and Department of Computational Linguistics, University of Heidelberg, Heidelberg, Germany;EML Research gGmbH, Heidelberg, Germany;EML Research gGmbH, Heidelberg, Germany

  • Venue:
  • ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an automatic method for differentiating between instances and classes in a large scale taxonomy induced from the Wikipedia category network. The method exploits characteristics of the category names and the structure of the network. The approach we present is the first attempt to make this distinction automatically in a large scale resource. In contrast, this distinction has been made in WordNet and Cyc based on manual annotations. The result of the process is evaluated against ResearchCyc. On the subnetwork shared by our taxonomy and ResearchCyc we report 84.52% accuracy.