The AWK programming language
A statistical approach to machine translation
Computational Linguistics
Identifying word correspondence in parallel texts
HLT '91 Proceedings of the workshop on Speech and Natural Language
Fundamentals of speech recognition
Fundamentals of speech recognition
Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition
Machine Translation
A program for aligning sentences in bilingual corpora
Computational Linguistics - Special issue on using large corpora: I
Computational Linguistics - Special issue on using large corpora: I
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Improving Chinese tokenization with linguistic filters on statistical lexical acquisition
ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Automating the acquisition of bilingual terminology
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Text alignment in a tool for translating revised documents
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Aligning sentences in parallel corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Char_align: a program for aligning parallel texts at the character level
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Aligning sentences in bilingual corpora using lexical information
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for finding noun phrase correspondences in bilingual corpora
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
A pattern matching method for finding noun and proper noun translations from noisy parallel corpora
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Aligning a parallel English-Chinese corpus statistically with lexical criteria
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
K-vec: a new approach for aligning parallel texts
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Translating collocations for use in bilingual lexicons
HLT '94 Proceedings of the workshop on Human Language Technology
Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition
Machine Translation
Review Article: Example-based Machine Translation
Machine Translation
Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Knowledge Extraction from Bilingual Corpora
Information Extraction: Towards Scalable, Adaptable Systems
Building Parallel Corpora by Automatic Title Alignment
ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-parallel Corpora
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
A Self-Learning Method of Parallel Texts Alignment
AMTA '00 Proceedings of the 4th Conference of the Association for Machine Translation in the Americas on Envisioning Machine Translation in the Information Future
An information-theoretic perspective of tf—idf measures
Information Processing and Management: an International Journal
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Building parallel corpora by automatic title alignment using length-based and text-based approaches
Information Processing and Management: an International Journal
Using noisy bilingual data for statistical machine translation
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Synonymous collocation extraction using translation information
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Using confidence bands for parallel texts alignment
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity
Computational Linguistics
Optimizing synonym extraction using monolingual and bilingual resources
PARAPHRASE '03 Proceedings of the second international workshop on Paraphrasing - Volume 16
A bio-inspired approach for multi-word expression extraction
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Aligning portuguese and chinese parallel texts using confidence bands
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
An approach for extracting bilingual terminology from Wikipedia
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
A bilingual dictionary extracted from the Wikipedia link structure
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Machine transliteration survey
ACM Computing Surveys (CSUR)
Language-independent context aware query translation using Wikipedia
BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
Developing multilingual text mining workflows in UIMA and u-compare
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Hi-index | 0.00 |
Technical-term translation represents one of the most difficult tasks forhuman translators since (1) most translators are not familiar with terms anddomain-specific terminology and (2) such terms are not adequately coveredby printed dictionaries. This paper describes an algorithm for translatingtechnical words and terms from noisy parallel corpora across languagegroups. Given any word which is part of a technical term in the sourcelanguage, the algorithm produces a ranked candidate match for it in thetarget language. Potential translations for the term are compiled from the matched words and are also ranked. We show how this rankedlist helps translators in technical-term translation. Most algorithms for lexical and termtranslation focus on Indo-European language pairs, and most use asentence-aligned clean parallel corpus without insertion, deletion or OCRnoise. Our algorithm is language- and character-set-independent, and isrobust to noise in the corpus. We show how our algorithm requires minimumpreprocessing and is able to obtain technical-word translations withoutsentence-boundary identification or sentence alignment, from theEnglish–Japanese awk manual corpus with noise arising from text insertions or deletions and onthe English–Chinese HKUST bilingual corpus. Weobtain a precision of 55.35% from the awk corpus for word translationincluding rare words, counting only the best candidate and directtranslations. Translation precision of the best-candidate translation is 89.93% from the HKUST corpus. Potential term translations produced by the programhelp bilingual speakers to get a 47% improvement in translating technical terms.