Full text indexing based on lexical relations an application: software libraries
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Word association norms, mutual information, and lexicography
Computational Linguistics
A statistical approach to machine translation
Computational Linguistics
Identifying word correspondence in parallel texts
HLT '91 Proceedings of the workshop on Speech and Natural Language
Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
Elements of information theory
Elements of information theory
Extracting collocations from text. An application: language generation
Extracting collocations from text. An application: language generation
Word sense disambiguation using a second language monolingual corpus
Computational Linguistics
Generating summaries of multiple news articles
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Simulation and the Monte Carlo Method
Simulation and the Monte Carlo Method
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Using cognates to align sentences in bilingual corpora
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
A program for aligning sentences in bilingual corpora
Computational Linguistics - Special issue on using large corpora: I
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Termight: identifying and translating technical terminology
ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Automating the acquisition of bilingual terminology
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Text alignment in a tool for translating revised documents
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Two languages are more informative than one
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Aligning sentences in parallel corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
A program for aligning sentences in bilingual corpora
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Word-sense disambiguation using statistical methods
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
From N-grams to collocations: an evaluation of Xtract
ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Char_align: a program for aligning parallel texts at the character level
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Aligning sentences in bilingual corpora using lexical information
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for finding noun phrase correspondences in bilingual corpora
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Contextual word similarity and estimation from sparse data
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
A pattern matching method for finding noun and proper noun translations from noisy parallel corpora
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A corpus-based approach to automatic compound extraction
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Automatically extracting and representing collocations for language generation
ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
The GE NLToolset: a software foundation for intelligent text processing
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
HLT '93 Proceedings of the workshop on Human Language Technology
The Candide system for machine translation
HLT '94 Proceedings of the workshop on Human Language Technology
Phrasal translation and query expansion techniques for cross-language information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Resolving ambiguity for cross-language retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Automatic identification and organization of index terms for interactive browsing
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Anchor text mining for translation extraction of query terms
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Using text processing techniques to automatically enrich a domain ontology
Proceedings of the international conference on Formal Ontology in Information Systems - Volume 2001
Translation of web queries using anchor text mining
ACM Transactions on Asian Language Information Processing (TALIP)
Learning Translation Templates from Bilingual Translation Examples
Applied Intelligence
Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition
Machine Translation
EPIA '99 Proceedings of the 9th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Knowledge Extraction from Bilingual Corpora
Information Extraction: Towards Scalable, Adaptable Systems
Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees
CICLing '01 Proceedings of the Second International Conference on Computational Linguistics and Intelligent Text Processing
Aligning Multiword Terms Using a Hybrid Approach
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-parallel Corpora
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
Korean-Chinese Machine Translation Based on Verb Patterns
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
A prototype multilingual document browser for ancient Greek texts
The New Review of Hypermedia and Multimedia
A systematic comparison of various statistical alignment models
Computational Linguistics
Bottom-up relational learning of pattern matching rules for information extraction
The Journal of Machine Learning Research
A class-based approach to word alignment
Computational Linguistics
Multilingual coreference resolution
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Semi-automatic acquisition of domain-specific translation lexicons
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Implicit ambiguity resolution using incremental clustering in cross-language information retrieval
Information Processing and Management: an International Journal
Retrieving collocations by co-occurrences and word order constraints
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Learning parse and translation decisions from examples with rich context
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Bitext correspondences through rich mark-up
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A simple hybrid aligner for generating lexical correspondences in parallel texts
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
An IR approach for translating new words from nonparallel, comparable texts
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
High-performance bilingual text alignment using statistical and dictionary information
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Anchor text mining for translation of Web queries: A transitive translation approach
ACM Transactions on Information Systems (TOIS)
Event detection from online news documents for supporting environmental scanning
Decision Support Systems - Special issue: Knowledge management technique
Automatic thesaurus generation through multiple filtering
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Structural feature selection for English-Korean statistical machine translation
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Learning bilingual collocations by word-level sorting
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Translating unknown cross-lingual queries in digital libraries using a web-based approach
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Translating unknown queries with web corpora for cross-language information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Measures of distributional similarity
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic identification of non-compositional phrases
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Mixed language query disambiguation
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Learning translations of named-entity phrases from parallel corpora
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Combining clues for word alignment
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Translation Disambiguation in Mixed Language Queries
Machine Translation
A cheap and fast way to build useful translation lexicons
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A transitive model for extracting translation equivalents of web queries through anchor text mining
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Unknown word extraction for Chinese documents
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A decision tree of bigrams is an accurate predictor of word sense
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Chinese-Korean word alignment based on linguistic comparison
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Optimization of word alignment clues
Natural Language Engineering
A comparative study on translation units for bilingual lexicon extraction
DMMT '01 Proceedings of the workshop on Data-driven methods in machine translation - Volume 14
Identification of relevant terms to support the construction of domain ontologies
HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
NLP found helpful (at least for one text categorization task)
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Alignment and extraction of bilingual legal terminology from context profiles
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Learning sequence-to-sequence correspondences from parallel corpora via sequential pattern mining
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
A bottom-up merging algorithm for Chinese unknown word extraction
SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
Noun-noun compound machine translation: a feasibility study on shallow processing
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Exploiting the Web as the multilingual corpus for unknown query translation
Journal of the American Society for Information Science and Technology
Automatic generation of Japanese–English bilingual thesauri based on bilingual corpora
Journal of the American Society for Information Science and Technology - Research Articles
Automatic extraction of bilingual word pairs using inductive chain learning in various languages
Information Processing and Management: an International Journal
Aligning word senses using bilingual corpora
ACM Transactions on Asian Language Information Processing (TALIP)
Collocation translation acquisition using monolingual corpora
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Creating multilingual translation lexicons with regional variations using web corpora
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Log-linear models for word alignment
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Alignment model adaptation for domain-specific word alignment
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Extracting loanwords from Mongolian corpora and producing a Japanese-Mongolian bilingual dictionary
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Word to word alignment strategies
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ATLAS: a new text alignment architecture
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Modeling anchor text and classifying queries to enhance web document retrieval
Proceedings of the 17th international conference on World Wide Web
Query Classification and Expansion for Translation Mining Via Search Engines
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Bilingually Motivated Word Segmentation for Statistical Machine Translation
ACM Transactions on Asian Language Information Processing (TALIP)
Towards the Selection of Induced Syntactic Relations
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Journal of Information Science
Context-dependent term relations for information retrieval
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Multilingual collocation extraction: issues and solutions
MLRI '06 Proceedings of the Workshop on Multilingual Language Resources and Interoperability
Acquiring word-meaning mappings for natural language interfaces
Journal of Artificial Intelligence Research
DeepLA '05 Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition
Proceedings of the 18th ACM conference on Information and knowledge management
EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Acquiring translation equivalences of multiword expressions by normalized correlation frequencies
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Extraction of transliteration pairs from parallel corpora using a statistical transliteration model
Information Sciences: an International Journal
Selecting target word using contexonym comparison method
Proceedings of the 2007 conference on Human interface: Part I
AcroDef: a quality measure for discriminating expansions of ambiguous acronyms
CONTEXT'07 Proceedings of the 6th international and interdisciplinary conference on Modeling and using context
A comparison of co-occurrence and similarity measures as simulations of context
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Unsupervised translation disambiguation based on maximum web bilingual relatedness: web as lexicon
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Learning to rank with (a lot of) word features
Information Retrieval
To translate or not to translate?
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Using collocation segmentation to augment the phrase table
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
PATATRAS: retrieval model combination and regression models for prior art search
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Measuring historical word sense variation
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Two ways to use a noisy parallel news corpus for improving statistical machine translation
BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
An improved method for finding bilingual collocation correspondences from monolingual corpora
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Accommodating multiword expressions in an arabic LFG grammar
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Improving statistical word alignments with morpho-syntactic transformations
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Mining bilingual lexical equivalences out of parallel corpora
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Learning method for automatic acquisition of translation knowledge
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Improving statistical word alignment with ensemble methods
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A measure based on optimal matching in graph theory for document similarity
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Using natural alignment to extract translation equivalents
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
The influence of collocation segmentation and top 10 items to keyword assignment performance
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
ArabOnto: experimenting a new distributional approach for building Arabic ontological resources
International Journal of Metadata, Semantics and Ontologies
Just title it! (by an online application)
EACL '12 Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Statistical thesaurus construction for a morphologically rich language
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
DOMCAT: a bilingual concordancer for domain-specific computer assisted translation
ACL '12 Proceedings of the ACL 2012 System Demonstrations
Applying collocation segmentation to the ACL anthology reference corpus
ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
A computer science electronic dictionary for NOOJ
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
A Fast and Accurate Method for Bilingual Opinion Lexicon Extraction
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Generalizing sampling-based multilingual alignment
Machine Translation
A possibilistic query translation approach for cross-language information retrieval
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
Hi-index | 0.00 |
Collocations are notoriously difficult for non-native speakers to translate, primarily because they are opaque and cannot be translated on a word-by-word basis. We describe a program named Champollion which, given a pair of parallel corpora in two different languages and a list of collocations in one of them, automatically produces their translations. Our goal is to provide a tool for compiling bilingual lexical information above the word level in multiple languages, for different domains. The algorithm we use is based on statistical methods and produces p-word translations of n-word collocations in which n and p need not be the same. For example, Champollion translates make...decision, employment equity, and stock market into prendre...décision, équité en matière d'emploi, and bourse respectively. Testing Champollion on three years' worth of the Hansards corpus yielded the French translations of 300 collocations for each year, evaluated at 73% accuracy on average. In this paper, we describe the statistical measures used, the algorithm, and the implementation of Champollion, presenting our results and evaluation.