Linguistically informed mining lexical semantic relations from wikipedia structure

  • Authors:
  • Maciej Piasecki;Agnieszka Indyka-Piasecka;Roman Kurc

  • Affiliations:
  • Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland

  • Venue:
  • ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A method of the extraction of the wordnet lexico-semantic relations from the Polish Wikipedia articles was proposed. The method is based on a set of hand-written set of lexico-morphosyntactic extraction patterns that were developed in less than one man-week of workload. Two kinds of patterns were proposed: processing encyclopaedia articles as text documents, and utilising the information about the structure of the Wikipedia article (including links). Two types of evaluation were applied: manual assessment of the extracted data and on the basis of the application of the extracted data as an additional knowledge source in automatic plWordNet expansion.