A statistical approach to machine translation
Computational Linguistics
Utaclir @ CLEF 2001 - Effects of Compound Splitting and N-Gram Techniques
CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
Shallow Morphological Analysis in Monolingual Information Retrieval for Dutch, German, and Italian
CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Fast decoding and optimal decoding for machine translation
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A phrase-based, joint probability model for statistical machine translation
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Monolingual Document Retrieval for European Languages
Information Retrieval
Feature-rich statistical translation of noun phrases
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Disambiguation of morphological structure using a PCFG
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Improved statistical machine translation using paraphrases
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
German Compounds in Factored Statistical Machine Translation
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Decompounding query keywords from compounding languages
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
A comparison of merging strategies for translation of German compounds
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Using a maximum entropy model to build segmentation lattices for MT
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
Getting to know Moses: initial experiments on German--English factored translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
The ISL phrase-based MT system for the 2007 ACL workshop on statistical machine translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
English-to-Czech factored machine translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Effects of morphological analysis in translation between German and English
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Towards better machine translation quality for the German--English language pairs
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Optimizing Chinese word segmentation for machine translation performance
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
The RWTH machine translation system for WMT 2009
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Experiments in morphosyntactic processing for translating to and from German
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Improving alignment for SMT by reordering and augmenting the training corpus
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Data-driven compound splitting method for english compounds in domain names
Proceedings of the 18th ACM conference on Information and knowledge management
Shared task: statistical machine translation between European languages
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
RALI: SMT shared task system description
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Morpho-syntactic Arabic preprocessing for Arabic-to-English statistical machine translation
StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
Explicit versus latent concept models for cross-language information retrieval
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Source-language entailment modeling for translating unknown terms
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Unsupervised and knowledge-free learning of compound splits and periphrases
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
German decompounding in a difficult corpus
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
LIMSI's statistical translation systems for WMT'10
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
The RWTH Aachen machine translation system for WMT 2010
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
More linguistic annotation for statistical machine translation
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
The Karlsruhe Institute for technology translation system for the ACL-WMT 2010
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Vs and OOVs: two problems for translation between German and English
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
TESLA: translation evaluation of sentences with linear-programming-based analysis
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Web scale NLP: a case study on url word breaking
Proceedings of the 20th international conference on World wide web
Pre- and postprocessing for statistical machine translation into Germanic languages
HLT-SS '11 Proceedings of the ACL 2011 Student Session
Translating from morphologically complex languages: a paraphrase-based approach
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Language-independent compound splitting with morphological operations
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Using Sublexical Translations to Handle the OOV Problem in Machine Translation
ACM Transactions on Asian Language Information Processing (TALIP)
An investigation of decompounding for cross-language patent search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Modeling infant word segmentation
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
A vector-space dynamic feature for phrase-based statistical machine translation
Journal of Intelligent Information Systems
Recursive decompounding in Afrikaans
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Statistical machine translation of german compound words
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Productive generation of compound words in statistical machine translation
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Shallow semantic trees for SMT
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Joint WMT submission of the QUAERO project
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The Karlsruhe Institute of Technology translation systems for the WMT 2011
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The RWTH Aachen machine translation system for WMT 2011
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Phrase-Based statistical machine translation for a low-density language pair
AI'10 Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
IR-n r2: using normalized passages
CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Learning to deal with the OOV problem in phrase-based MT system: [in Chinese]
ROCLING '11 Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing
Analyzing parallelism and domain similarities in the MAREC patent corpus
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Modeling inflection and word-formation in SMT
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
ONTS: "optima" news translation system
EACL '12 Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Hierarchical Bayesian language modelling for the linguistically informed
EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
A class-based agreement model for generating accurately inflected translations
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
The RWTH Aachen machine translation system for WMT 2012
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Joint WMT 2012 submission of the QUAERO project
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
The Karlsruhe institute of technology translation systems for the WMT 2012
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Segmenting web-domains and hashtags using length specific models
Proceedings of the 21st ACM international conference on Information and knowledge management
An empirical study on word segmentation for chinese machine translation
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Interpretation of coordinations, compound generation, and result fusion for query variants
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Statistical machine translation enhancements through linguistic levels: A survey
ACM Computing Surveys (CSUR)
Generation of compound words in statistical machine translation into compounding languages
Computational Linguistics
Hi-index | 0.00 |
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We evaluate them against a gold standard and measure their impact on performance of statistical MT systems. Results show accuracy of 99.1% and performance gains for MT of 0.039 BLEU on a German-English noun phrase translation task.