Translating collocations for bilingual lexicons: a statistical approach
Computational Linguistics
Shallow parsing with pos taggers and linguistic features
The Journal of Machine Learning Research
TnT: a statistical part-of-speech tagger
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A simple hybrid aligner for generating lexical correspondences in parallel texts
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Improved statistical alignment models
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Optimization of word alignment clues
Natural Language Engineering
Log-linear models for word alignment
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Word alignment in English-Hindi parallel corpus using recency-vector approach: some studies
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Word to word alignment strategies
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Combining clues for lexical level aligning using the null hypothesis approach
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
NeurAlign: combining word alignments using neural networks
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A discriminative matching approach to word alignment
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ATLAS: a new text alignment architecture
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Approximate String Matching Techniques for Effective CLIR Among Indian Languages
WILF '07 Proceedings of the 7th international workshop on Fuzzy Logic and Applications: Applications of Fuzzy Sets Theory
Combining Multiple Resources to Build Reliable Wordnets
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Bilingually Motivated Word Segmentation for Statistical Machine Translation
ACM Transactions on Asian Language Information Processing (TALIP)
A word alignment model based on multiobjective evolutionary algorithms
Computers & Mathematics with Applications
MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Towards a universal wordnet by learning from combined evidence
Proceedings of the 18th ACM conference on Information and knowledge management
ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Computing word similarity and identifying cognates with pair hidden Markov models
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Cross-lingual annotation projection of semantic roles
Journal of Artificial Intelligence Research
MCTLLL '09 Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Extracting sense-disambiguated example sentences from parallel corpora
WDE '09 Proceedings of the 1st Workshop on Definition Extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Panning for EBMT gold, or "Remembering not to forget"
Machine Translation
Improving statistical word alignments with morpho-syntactic transformations
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
A chunk-driven bootstrapping approach to extracting translation patterns
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.00 |
In this paper, a word alignment approach is presented which is based on a combination of clues. Word alignment clues indicate associations between words and phrases. They can be based on features such as frequency, part-of-speech, phrase type, and the actual wordform strings. Clues can be found by calculating similarity measures or learned from word aligned data. The clue alignment approach, which is proposed in this paper, makes it possible to combine association clues taking different kinds of linguistic information into account. It allows a dynamic tokenization into token units of varying size. The approach has been applied to an English/Swedish parallel text with promising results.