A statistical approach to machine translation
Computational Linguistics
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Identifying word correspondence in parallel texts
HLT '91 Proceedings of the workshop on Speech and Natural Language
Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
The decomposition of human-written summary sentences
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition
Machine Translation
Bilingual Sentence Alignment: Balancing Robustness and Accuracy
Machine Translation
The Origins of the Translator‘s Workstation
Machine Translation
Alignment and Matching of Bilingual English–Chinese News Texts
Machine Translation
Using hidden Markov modeling to decompose human-written summaries
Computational Linguistics - Summarization
Bilingual Dictionary Based Sentence Alignment for Chinese English Bitext
ICMI '00 Proceedings of the Third International Conference on Advances in Multimodal Interfaces
Knowledge Extraction from Bilingual Corpora
Information Extraction: Towards Scalable, Adaptable Systems
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
Ordering Translation Templates by Assigning Confidence Factors
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
A Self-Learning Method of Parallel Texts Alignment
AMTA '00 Proceedings of the 4th Conference of the Association for Machine Translation in the Americas on Envisioning Machine Translation in the Information Future
Adaptive Bilingual Sentence Alignment
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Bootstrapping the Lexicon Building Process for Machine Translation between `New' Languages
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Fast and Accurate Sentence Alignment of Bilingual Corpora
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Dynamic histogram warping of image pairs for constant image brightness
ICIP '95 Proceedings of the 1995 International Conference on Image Processing (Vol.2)-Volume 2 - Volume 2
Using cognates to align sentences in bilingual corpora
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
Computational Linguistics - Special issue on using large corpora: I
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
A class-based approach to word alignment
Computational Linguistics
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora
Computational Linguistics
Bitext maps and alignment via pattern recognition
Computational Linguistics
Natural Language Engineering
Automatic construction of parallel English-Chinese corpus for cross-language information retrieval
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
An automatic reviser: the TransCheck system
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Word-for-word glossing with contextually similar words
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Termight: identifying and translating technical terminology
ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Example retrieval from a translation memory
Natural Language Engineering
Tagging and alignment of parallel texts: current status of BCP
ANLC '92 Proceedings of the third conference on Applied natural language processing
Automating the acquisition of bilingual terminology
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Text alignment in a tool for translating revised documents
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
An alignment method for noisy parallel corpora based on image processing techniques
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A portable algorithm for mapping bitext correspondence
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A word-to-word model of translational equivalence
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Bitext correspondences through rich mark-up
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
An experiment in hybrid dictionary and statistical sentence alignment
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Methods and practical issues in evaluating alignment techniques
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Aligning sentences in parallel corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Char_align: a program for aligning parallel texts at the character level
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Aligning sentences in bilingual corpora using lexical information
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for finding noun phrase correspondences in bilingual corpora
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Structural matching of parallel texts
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for simultaneously bracketing parallel texts by aligning words
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Aligning a parallel English-Chinese corpus statistically with lexical criteria
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Automatic alignment in parallel corpora
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
K-vec: a new approach for aligning parallel texts
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
A matching technique in Example-Based Machine Translation
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
A part-of-speech-based alignment algorithm
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
MULTEXT: Multilingual Text Tools and Corpora
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Aligning sentences in bilingual texts: French-English and French-Arabic
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Learning translation templates from bilingual text
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
CTM: an example-based translation aid system
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 4
Extracting word correspondences from bilingual corpora based on word co-occurrences information
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Extracting semantic clusters from the alignment of definitions
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Automatic detection of omissions in translations
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Meta-evaluation of summaries in a cross-lingual environment using content-based metrics
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Extracting paraphrases from a parallel corpus
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
TotalRecall: a bilingual concordance for computer assisted translation and language learning
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Chinese-Korean word alignment based on linguistic comparison
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Using confidence bands for parallel texts alignment
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Translating collocations for use in bilingual lexicons
HLT '94 Proceedings of the workshop on Human Language Technology
Constrained EM for parallel text alignment
Natural Language Engineering
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Computational Linguistics
Word alignment of English-Chinese bilingual corpus based on chunks
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Constructing of a large-scale Chinese-English parallel corpus
COLING '02 Proceedings of the 3rd workshop on Asian language resources and international standardization - Volume 12
Alignment and extraction of bilingual legal terminology from context profiles
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
COLING-MTIA '02 Proceedings of the 2002 COLING workshop on Machine translation in Asia - Volume 16
Efficient optimization for bilingual sentence alignment based on linear regression
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Extracting structural paraphrases from aligned monolingual corpora
PARAPHRASE '03 Proceedings of the second international workshop on Paraphrasing - Volume 16
Statistical machine translation using coercive two-level syntactic transduction
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Sentence alignment for monolingual comparable corpora
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Development in parsing technology: from theory to application
New developments in parsing technology
A framework for aligning and indexing movies with their script
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Creating and exploiting a comparable corpus in cross-language information retrieval
ACM Transactions on Information Systems (TOIS)
Subsentential translation memory for computer assisted writing and translation
ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
A DOM tree alignment model for mining parallel data from the web
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
MTTK: an alignment toolkit for statistical machine translation
NAACL-Demonstrations '06 Proceedings of the 2006 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume: demonstrations
Journal of the American Society for Information Science and Technology
SlideSeer: a digital library of aligned document and presentation pairs
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ATLAS: a new text alignment architecture
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Focused web crawling in the acquisition of comparable corpora
Information Retrieval
Chinese Ancient-Modern Sentence Alignment
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Cross Sentence Alignment for Structurally Dissimilar Corpus Based on Singular Value Decomposition
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
Automatic extraction of translations from web-based bilingual materials
Machine Translation
WSEAS Transactions on Computers
A re-examination on features in regression based approach to automatic MT evaluation
HLT-SRWS '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Student Research Workshop
Translation corpus source and size in bilingual retrieval
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
SemEval-2010 task 3: cross-lingual word sense disambiguation
DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Artificial Intelligence in Medicine
A hybrid approach to align sentences and words in English-Hindi parallel corpora
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Comparison, selection and use of sentence alignment algorithms for new language pairs
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Arabic to French sentence alignment: exploration of a cross-language information retrieval approach
Semitic '07 Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources
Compiling a massive, multilingual dictionary via probabilistic inference
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Chinese-Uyghur sentence alignment: an approach based on anchor sentences
BUCC '09 Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: from Parallel to Non-parallel Corpora
Exploiting aligned parallel corpora in multilingual studies and applications
IWIC'07 Proceedings of the 1st international conference on Intercultural collaboration
Incremental speech translation
Incremental speech translation
Panlingual lexical translation via probabilistic inference
Artificial Intelligence
Unsupervised translation disambiguation based on maximum web bilingual relatedness: web as lexicon
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
SemEval-2010 task 3: Cross-lingual word sense disambiguation
SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
IEEE Transactions on Audio, Speech, and Language Processing
Generating phrasal and sentential paraphrases: A survey of data-driven methods
Computational Linguistics
Development of Hindi-Punjabi parallel corpus using existing Hindi-Punjabi machine translation system
Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia
TEP: Tehran English-Persian parallel corpus
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Multilingual sentence alignment from Wikipedia as multilingual comparable corpora
Proceedings of the 13th International Conference on Humans and Computers
From bilingual dictionaries to interlingual document representations
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Explicit length modelling for statistical machine translation
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Simultaneous similarity learning and feature-weight learning for document clustering
TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Alignment of paragraphs in bilingual texts using bilingual dictionaries and dynamic programming
CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
A bilingual corpus of novels aligned at paragraph level
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Evaluation of alignment methods for HTML parallel text
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
New approach for collecting high quality parallel corpora from multilingual websites
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Mining bilingual lexical equivalences out of parallel corpora
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Bilingual sentence alignment based on punctuation statistics and lexicon
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Chinese-Japanese clause alignment
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Improving bilingual projections via sparse covariance matrices
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Extracting parallel paragraphs and sentences from english-persian translated documents
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Explicit length modelling for statistical machine translation
Pattern Recognition
Rediscovering ACL discoveries through the lens of ACL anthology network citing sentences
ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Lexical-based alignment for reconstruction of structure in parallel texts
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Computer Speech and Language
Distributional phrasal paraphrase generation for statistical machine translation
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Sections on Paraphrasing; Intelligent Systems for Socially Aware Computing; Social Computing, Behavioral-Cultural Modeling, and Prediction
Hi-index | 0.00 |
Researchers in both machine translation (e.g., Brown et al., 1990) and bilingual lexicography (e.g., Klavans and Tzoukermann, 1990) have recently become interested in studying parallel texts, texts such as the Canadian Hansards (parliamentary proceedings) which are available in multiple languages (French and English). This paper describes a method for aligning sentences in these parallel texts, based on a simple statistical model of character lengths. The method was developed and tested on a small trilingual sample of Swiss economic reports. A much larger sample of 90 million words of Canadian Hansards has been aligned and donated to the ACL/DCI.