Using wiktionary for computing semantic relatedness

Authors:
Torsten Zesch;Christof Müller;Iryna Gurevych
Affiliations:
Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany;Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany;Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Year:
2008

Citing 10
Cited 29

Concept based query expansion

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Contextual correlates of synonymy

Communications of the ACM
Placing search in context: the concept revisited

ACM Transactions on Information Systems (TOIS)
Measuring and Improving the Quality of World Knowledge extracted from WordNet

Measuring and Improving the Quality of World Knowledge extracted from WordNet
Evaluating WordNet-based Measures of Lexical Semantic Relatedness

Computational Linguistics
A semantic approach to IE pattern induction

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Measures of semantic similarity and relatedness in the biomedical domain

Journal of Biomedical Informatics
Non-classical lexical semantic relations

CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
Computing semantic relatedness using Wikipedia-based explicit semantic analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Using the structure of a conceptual network in computing semantic relatedness

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing

Unsupervised recognition of literal and non-literal use of idiomatic expressions

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Wikipedia-based semantic interpretation for natural language processing

Journal of Artificial Intelligence Research
Search the web x.0: mining and recommending web-mediated processes

Proceedings of the third ACM conference on Recommender systems
An architecture to support intelligent user interfaces for Wikis by means of Natural Language Processing

Proceedings of the 5th International Symposium on Wikis and Open Collaboration
Combining lexical semantic resources with question & answer archives for translation-based answer finding

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Cross-lingual semantic relatedness using encyclopedic knowledge

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
A study on the semantic relatedness of query and document terms in information retrieval

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Wiktionary and NLP: improving synonymy networks

People's Web '09 Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources
Using the Wiktionary graph structure for synonym detection

People's Web '09 Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources
A cohesion graph based approach for unsupervised recognition of literal and non-literal use of multiword expressions

TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Extraction of unexpected sentences: A sentiment classification assessed approach

Intelligent Data Analysis
Wisdom of crowds versus wisdom of linguists – measuring the semantic relatedness of words

Natural Language Engineering
Using Wikipedia and Wiktionary in domain-specific information retrieval

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A word at a time: computing word relatedness using temporal semantic analysis

Proceedings of the 20th international conference on World wide web
Combining heterogeneous knowledge resources for improved distributional semantic models

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Taxonomy induction based on a collaboratively built knowledge repository

Artificial Intelligence
Invariants and variability of synonymy networks: self mediated agreement by confluence

TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Semantically enhanced term frequency

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Comparing verb synonym resources for Portuguese

PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Worth its weight in gold or yet another resource — a comparative study of wiktionary, openthesaurus and germanet

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Using wiktionary to improve lexical disambiguation in multiple languages

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Large-scale learning of word relatedness with constraints

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Measuring contextual fitness using error contexts extracted from the Wikipedia revision history

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
A study of hybrid similarity measures for semantic relation extraction

HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
KORE: keyphrase overlap relatedness for entity disambiguation

Proceedings of the 21st ACM international conference on Information and knowledge management
Collaboratively built semi-structured content and Artificial Intelligence: The story so far

Artificial Intelligence
Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

Artificial Intelligence
Evaluation of a layered approach to question answering over linked data

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce Wiktionary as an emerging lexical semantic resource that can be used as a substitute for expert-made resources in AI applications. We evaluate Wiktionary on the pervasive task of computing semantic relatedness for English and German by means of correlation with human rankings and solving word choice problems. For the first time, we apply a concept vector based measure to a set of different concept representations like Wiktionary pseudo glosses, the first paragraph of Wikipedia articles, English WordNet glosses, and GermaNet pseudo glosses. We show that: (i) Wiktionary is the best lexical semantic resource in the ranking task and performs comparably to other resources in the word choice task, and (ii) the concept vector based approach yields the best results on all datasets in both evaluations.