The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Word Sense Disambiguation: Algorithms and Applications (Text, Speech and Language Technology)
Word Sense Disambiguation: Algorithms and Applications (Text, Speech and Language Technology)
Yago: a core of semantic knowledge
Proceedings of the 16th international conference on World Wide Web
Automatically refining the wikipedia infobox ontology
Proceedings of the 17th international conference on World Wide Web
Inter-coder agreement for computational linguistics
Computational Linguistics
Personalizing PageRank for word sense disambiguation
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Large-scale taxonomy mapping for restructuring and integrating wikipedia
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Knowledge-rich Word Sense Disambiguation rivaling supervised systems
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Automatic assignment of wikipedia encyclopedic entries to wordnet synsets
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Automatically enriching a thesaurus with information from dictionaries
EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Subcat-LMF: fleshing out a standardized format for subcategorization frame interoperability
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Uby: a large-scale unified lexical-semantic resource based on LMF
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Collaboratively built semi-structured content and Artificial Intelligence: The story so far
Artificial Intelligence
Language Resources and Evaluation
Determining the conceptual space of metaphoric expressions
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Improved text annotation with Wikipedia entities
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Hi-index | 0.00 |
We propose a method to automatically align WordNet synsets and Wikipedia articles to obtain a sense inventory of higher coverage and quality. For each WordNet synset, we first extract a set of Wikipedia articles as alignment candidates; in a second step, we determine which article (if any) is a valid alignment, i.e. is about the same sense or concept. In this paper, we go significantly beyond state-of-the-art word overlap approaches, and apply a threshold-based Personalized PageRank method for the disambiguation step. We show that WordNet synsets can be aligned to Wikipedia articles with a performance of up to 0.78 F1-Measure based on a comprehensive, well-balanced reference dataset consisting of 1,815 manually annotated sense alignment candidates. The fully-aligned resource as well as the reference dataset is publicly available.