Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition

  • Authors:
  • Ido Dagan;Ken Church

  • Affiliations:
  • Department of Mathematics and Computer Science, Bar-Ilan University, 52900 Ramat Gan, Israel. dagan@.cs.biu.ac.il;AT & T Research, 600 Mountain Ave., Murray Hill, NJ 07974, U.S.A. kwc@research.att.com

  • Venue:
  • Machine Translation
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a semi-automatic tool, termight,that supports the constructionof bilingual glossaries. Termight consists of two components whichaddress the two subtasks in glossary construction: (a) preparing amonolingual list of all technical terms in a source-language document,and (b) finding the translations for these terms in parallelsource–target documents. As a first step (in each component) thetool extracts automatically candidate terms and candidatetranslations, based on term-extraction and word-alignmentalgorithms. It then performs several additional preprocessing stepswhich greatly facilitate human post-editing of the candidatelists. These steps include grouping and sorting of candidates andassociating example concordance lines with each candidate. Finally,the data prepared in preprocessing is presented to the user via aninteractive interface which supports quick post-editing operations.Termight was deployed by translators at AT & T Business TranslationServices (formerly AT & T Language Line Services) leading to veryhigh rates of semi-automatic glossary construction.