UBA: Using automatic translation and Wikipedia for cross-lingual lexical substitution

Authors:
Pierpaolo Basile;Giovanni Semeraro
Affiliations:
University of Bari "Aldo Moro", Bari (ITALY);University of Bari "Aldo Moro", Bari (ITALY)
Venue:
SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Year:
2010

Citing 4
Cited 3

SemEval-2007 task 10: English lexical substitution task

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2010 task 2: cross-lingual lexical substitution

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
DBpedia - A crystallization point for the Web of Data

Web Semantics: Science, Services and Agents on the World Wide Web
SemEval-2010 task 2: Cross-lingual lexical substitution

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation

Unsupervised cross-lingual lexical substitution

EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
Joining forces pays off: multilingual joint word sense disambiguation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
The cross-lingual lexical substitution task

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the participation of the University of Bari (UBA) at the SemEval-2010 Cross-Lingual Lexical Substitution Task. The goal of the task is to substitute a word in a language Ls, which occurs in a particular context, by providing the best synonyms in a different language Lt which fit in that context. This task has a strict relation with the task of automatic machine translation, but there are some differences: Cross-lingual lexical substitution targets one word at a time and the main goal is to find as many good translations as possible for the given target word. Moreover, there are some connections with Word Sense Disambiguation (WSD) algorithms. Indeed, understanding the meaning of the target word is necessary to find the best substitutions. An important aspect of this kind of task is the possibility of finding synonyms without using a particular sense inventory or a specific parallel corpus, thus allowing the participation of unsupervised approaches. UBA proposes two systems: the former is based on an automatic translation system which exploits Google Translator, the latter is based on a parallel corpus approach which relies on Wikipedia in order to find the best substitutions.