Combining statistical and semantic approaches to the translation of ontologies and taxonomies

Authors:
John McCrae;Mauricio Espinoza;Elena Montiel-Ponsoda;Guadalupe Aguado-de-Cea;Philipp Cimiano
Affiliations:
Universität Bielefeld, Bielefeld, Germany;Universidad de Cuenca, Cuenca, Ecuador;Universidad Politécnica de Madrid, Madrid, Spain;Universidad Politécnica de Madrid, Madrid, Spain;Universität Bielefeld, Bielefeld, Germany
Venue:
SSST-5 Proceedings of the Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
Year:
2011

Citing 16
Cited 0

BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

HLT '02 Proceedings of the second international conference on Human Language Technology Research
BioCaster

Bioinformatics
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Discovering Missing Background Knowledge in Ontology Matching

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Ontology localization

Proceedings of the fifth international conference on Knowledge capture
Domain adaptation for statistical machine translation with domain dictionary and monolingual corpora

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Data-driven semantic analysis for multilingual WSD and lexical selection in translation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Statistical Machine Translation

Statistical Machine Translation
DBpedia: a nucleus for a web of open data

ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Enriching an ontology with multilingual information

ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
A note on ontology localization

Applied Ontology
BabelNet: building a very large multilingual semantic network

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Generating tailored textual summaries from ontologies

ESWC'05 Proceedings of the Second European conference on The Semantic Web: research and Applications
A survey of schema-based matching approaches

Journal on Data Semantics IV

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ontologies and taxonomies are widely used to organize concepts providing the basis for activities such as indexing, and as background knowledge for NLP tasks. As such, translation of these resources would prove useful to adapt these systems to new languages. However, we show that the nature of these resources is significantly different from the "free-text" paradigm used to train most statistical machine translation systems. In particular, we see significant differences in the linguistic nature of these resources and such resources have rich additional semantics. We demonstrate that as a result of these linguistic differences, standard SMT methods, in particular evaluation metrics, can produce poor performance. We then look to the task of leveraging these semantics for translation, which we approach in three ways: by adapting the translation system to the domain of the resource; by examining if semantics can help to predict the syntactic structure used in translation; and by evaluating if we can use existing translated taxonomies to disambiguate translations. We present some early results from these experiments, which shed light on the degree of success we may have with each approach.