Automatic dictionary creation by sub-symbolic encoding of words

  • Authors:
  • Filippo Vella;Giovanni Pilato;Ignazio Motisi;Salvatore Gaglio

  • Affiliations:
  • DINFO – Dipartimento di ingegneria INFOrmatica, University of Palermo, Palermo, Italy;ICAR – Istituto di CAlcolo e Reti ad alte prestazioni, Italian National Research Council, Palermo, Italy;DINFO – Dipartimento di ingegneria INFOrmatica, University of Palermo, Palermo, Italy;DINFO – Dipartimento di ingegneria INFOrmatica, University of Palermo, Palermo, Italy

  • Venue:
  • WIRN'05 Proceedings of the 16th Italian conference on Neural Nets
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a technique for automatic creation of dictionaries using sub-symbolic representation of words in cross-language context. Semantic relationship among words of two languages is extracted from aligned bilingual text corpora. This feature is obtained applying the Latent Semantic Analysis technique to the matrices representing terms co-occurrences in aligned text fragments. The technique allows to find the “best translation” according to a properly defined geometric distance in an automatically created semantic space. Experiments show an interesting correctness of 95% obtained in the best case.