Word translation disambiguation using multinomial classifiers

  • Authors:
  • Jesús Andrés;José R. Navarro;Alfons Juan;Francisco Casacuberta

  • Affiliations:
  • Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politecnica de Valencia;Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politecnica de Valencia;Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politecnica de Valencia;Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politecnica de Valencia

  • Venue:
  • IbPRIA'05 Proceedings of the Second Iberian conference on Pattern Recognition and Image Analysis - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work focuses on a hybrid machine translation system from Spanish into Catalan called SisHiTra. In particular, we focus on its word translation disambiguation module, which has to decide on the correct translation of each ambiguous input word in accordance with its context. We propose the use of statistical pattern recognition techniques for this task and, in particular, multinomial Naive Bayes text classifiers. Extensive empirical results on the use of these classifiers are presented, in which the influence of the window (context) size and parameter smoothing are carefully studied.