A statistical approach to machine translation
Computational Linguistics
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora
Computational Linguistics
Bitext maps and alignment via pattern recognition
Computational Linguistics
Char_align: a program for aligning parallel texts at the character level
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for finding noun phrase correspondences in bilingual corpora
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
An algorithm for simultaneously bracketing parallel texts by aligning words
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Aligning a parallel English-Chinese corpus statistically with lexical criteria
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
K-vec: a new approach for aligning parallel texts
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Inducing multilingual text analysis tools via robust projection across aligned corpora
HLT '01 Proceedings of the first international conference on Human language technology research
Transformation-based learning in the fast lane
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Minimally supervised morphological analysis by multimodal alignment
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Language independent, minimally supervised induction of lexical probabilities
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Improved statistical alignment models
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
DUSTer: A Method for Unraveling Cross-Language Divergences for Statistical Word-Level Alignment
AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Computational Linguistics - Special issue on web as corpus
Inducing multilingual text analysis tools via robust projection across aligned corpora
HLT '01 Proceedings of the first international conference on Human language technology research
Evaluating translational correspondence using annotation projection
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Natural Language Engineering
Bootstrapping parsers via syntactic projection across parallel texts
Natural Language Engineering
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Computational Linguistics
Bootstrapping a multilingual part-of-speech tagger in one person-day
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
POS-tagger for English-Vietnamese bilingual corpus
HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Using 'smart' bilingual projection to feature-tag a monolingual dictionary
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Experiments in parallel-text based grammar induction
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Statistical machine translation by parsing
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
A backoff model for bootstrapping resources for non-English languages
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Word alignment and cross-lingual resource acquisition
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Parsing aligned parallel corpus by projecting syntactic relations from annotated source corpus
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Statistical machine translation
ACM Computing Surveys (CSUR)
Natural Language Processing Across Time: An Empirical Investigation on Italian
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
The bootstrapping of the Yarowsky algorithm in real corpora
Information Processing and Management: an International Journal
Tagging Portuguese with a Spanish tagger using cognates
CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
Rich bitext projection features for parse reranking
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Detecting complex predicates in Hindi using POS projection across parallel corpora
MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
Unsupervised multilingual learning for POS tagging
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
POS tagging of dialectal Arabic: a minimally supervised approach
Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Dependency grammar induction via bitext projection constraints
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Cross-lingual annotation projection of semantic roles
Journal of Artificial Intelligence Research
Transferring structural markup across translations using multilingual alignment and projection
Proceedings of the 10th annual joint conference on Digital libraries
Bitext-based resolution of German subject-object ambiguities
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Posterior Regularization for Structured Latent Variable Models
The Journal of Machine Learning Research
Learning better monolingual models with unannotated bilingual text
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Comparing language similarity across genetic and typologically-based groupings
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A cross-lingual annotation projection approach for relation detection
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Notes on the evaluation of dependency parsers obtained through cross-lingual projection
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Urdu and Hindi: translation and sharing of linguistic resources
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Learning tractable word alignment models with complex constraints
Computational Linguistics
Unsupervised part-of-speech tagging with bilingual graph-based projections
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Using large monolingual and bilingual corpora to improve coordination disambiguation
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
"I thou thee, thou traitor": predicting formal vs. informal address in English literature
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Building a cross-language entity linking collection in twenty-one languages
CLEF'11 Proceedings of the Second international conference on Multilingual and multimodal information access evaluation
Semantic relations in bilingual lexicons
ACM Transactions on Speech and Language Processing (TSLP)
Semi-supervised Learning Framework for Cross-Lingual Projection
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Unsupervised multilingual learning
Unsupervised multilingual learning
Experiments in cross-language morphological annotation transfer
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Unsupervised structure prediction with non-parallel multilingual guidance
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Universal morphological analysis using structured nearest neighbor prediction
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Can projected chains in parallel corpora help coreference resolution?
DAARC'11 Proceedings of the 8th international conference on Anaphora Processing and Applications
Towards a model of formal and informal address in English
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Translation-based projection for multilingual coreference resolution
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
A graph-based cross-lingual projection approach for weakly supervised relation extraction
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Universal grapheme-to-phoneme prediction over Latin alphabets
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Induction of dependency structures based on weighted projection
ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction
ACM Transactions on Asian Language Information Processing (TALIP)
Hi-index | 0.00 |
This paper investigates the potential for projecting linguistic annotations including part-of-speech tags and base noun phrase bracketings from one language to another via automatically word-aligned parallel corpora. First, experiments assess the accuracy of unmodified direct transfer of tags and brackets from the source language English to the target languages French and Chinese, both for noisy machine-aligned sentences and for clean hand-aligned sentences. Performance is then substantially boosted over both of these baselines by using training techniques optimized for very noisy data, yielding 94-96% core French part-of-speech tag accuracy and 90% French bracketing F-measure for stand-alone monolingual tools trained without the need for any human-annotated data in the given language.