Combining clues for word alignment

Authors:
Jörg Tiedemann
Affiliations:
Uppsala University, Uppsala, Sweden
Venue:
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Year:
2003

Citing 5
Cited 24

Translating collocations for bilingual lexicons: a statistical approach

Computational Linguistics
Shallow parsing with pos taggers and linguistic features

The Journal of Machine Learning Research
TnT: a statistical part-of-speech tagger

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A simple hybrid aligner for generating lexical correspondences in parallel texts

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Improved statistical alignment models

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

Optimization of word alignment clues

Natural Language Engineering
Log-linear models for word alignment

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Word alignment in English-Hindi parallel corpus using recency-vector approach: some studies

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Word to word alignment strategies

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Combining clues for lexical level aligning using the null hypothesis approach

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
NeurAlign: combining word alignments using neural networks

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A discriminative matching approach to word alignment

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Multilingual lexical database generation from parallel texts in 20 European languages with endogenous resources

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ATLAS: a new text alignment architecture

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Approximate String Matching Techniques for Effective CLIR Among Indian Languages

WILF '07 Proceedings of the 7th international workshop on Fuzzy Logic and Applications: Applications of Fuzzy Sets Theory
Combining Multiple Resources to Build Reliable Wordnets

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Bilingually Motivated Word Segmentation for Statistical Machine Translation

ACM Transactions on Asian Language Information Processing (TALIP)
A word alignment model based on multiobjective evolutionary algorithms

Computers & Mathematics with Applications
Automatic construction of domain-specific dictionaries on sparse parallel corpora in the Nordic languages

MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Towards a universal wordnet by learning from combined evidence

Proceedings of the 18th ACM conference on Information and knowledge management
Combined word alignments

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Computing word similarity and identifying cognates with pair hidden Markov models

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Cross-lingual annotation projection of semantic roles

Journal of Artificial Intelligence Research
Evidence-based word alignment

MCTLLL '09 Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Extracting sense-disambiguated example sentences from parallel corpora

WDE '09 Proceedings of the 1st Workshop on Definition Extraction
Computing EM-based alignments of routes and route directions as a basis for natural language generation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Panning for EBMT gold, or "Remembering not to forget"

Machine Translation
Improving statistical word alignments with morpho-syntactic transformations

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
A chunk-driven bootstrapping approach to extracting translation patterns

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a word alignment approach is presented which is based on a combination of clues. Word alignment clues indicate associations between words and phrases. They can be based on features such as frequency, part-of-speech, phrase type, and the actual wordform strings. Clues can be found by calculating similarity measures or learned from word aligned data. The clue alignment approach, which is proposed in this paper, makes it possible to combine association clues taking different kinds of linguistic information into account. It allows a dynamic tokenization into token units of varying size. The approach has been applied to an English/Swedish parallel text with promising results.