Linguistically informed mining lexical semantic relations from wikipedia structure

Authors:
Maciej Piasecki;Agnieszka Indyka-Piasecka;Roman Kurc
Affiliations:
Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland
Venue:
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Year:
2011

Citing 8
Cited 0

Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Overcoming the brittleness bottleneck using wikipedia: enhancing text categorization with encyclopedic knowledge

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Comparing Wikipedia and German wordnet by evaluating semantic relatedness on multiple datasets

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Deriving a large scale taxonomy from Wikipedia

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Decoding wikipedia categories for knowledge acquisition

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Morphosyntactic Constraints in the Acquisition of Linguistic Knowledge for Polish

Aspects of Natural Language Processing
Heterogeneous knowledge sources in graph-based expansion of the polish wordnet

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Automatic assignment of wikipedia encyclopedic entries to wordnet synsets

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

A method of the extraction of the wordnet lexico-semantic relations from the Polish Wikipedia articles was proposed. The method is based on a set of hand-written set of lexico-morphosyntactic extraction patterns that were developed in less than one man-week of workload. Two kinds of patterns were proposed: processing encyclopaedia articles as text documents, and utilising the information about the structure of the Wikipedia article (including links). Two types of evaluation were applied: manual assessment of the extracted data and on the basis of the application of the extracted data as an additional knowledge source in automatic plWordNet expansion.