Word sense disambiguation using a second language monolingual corpus

Authors:
Ido Dagan;Alon Itai
Affiliations:
AT&T Bell Laboratories;Technion---Israel Institute of Technology
Venue:
Computational Linguistics
Year:
1994

Citing 23
Cited 85

Discovery procedures for sublanguage selectional patterns: initial experiments

Computational Linguistics
Transition network grammars for natural language analysis

Readings in natural language processing
Cyc: toward programs with common sense

Communications of the ACM
Word association norms, mutual information, and lexicography

Computational Linguistics
A statistical approach to machine translation

Computational Linguistics
Self-organized language modeling for speech recognition

Readings in speech recognition
Using multiple knowledge sources for word sense discrimination

Computational Linguistics
Dimensions of meaning

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
BABELWARE for the desktop

BYTE
Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars

Proceedings of the International Symposium on Natural Language and Logic
Word Space

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Extracting semantic hierarchies from a large on-line dictionary

ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Two languages are more informative than one

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Structural ambiguity and lexical relations

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Word-sense disambiguation using statistical methods

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Contextual word similarity and estimation from sparse data

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Automatic processing of large corpora for the resolution of anaphora references

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
An active bilingual lexicon for Machine Translation

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Tagging for learning: collecting thematic relations from corpus

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 1
But dictionaries are data too

HLT '93 Proceedings of the workshop on Human Language Technology

Learning morpho-lexical probabilities from an untagged corpus with an application to Hebrew

Computational Linguistics
Translating collocations for bilingual lexicons: a statistical approach

Computational Linguistics
A brief introduction to natural language processing for non-linguists

Learning language in logic
Toward Language-dependent Applications

Machine Translation
Collocation Dictionary Optimization Using WordNetand k-Nearest Neighbor Learning

Machine Translation
A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-parallel Corpora

AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
Understanding Politics by Studying Weather: A Cognitive Approach to Representation of Polish Verbs of Motion, Appearance, and Existence

AMTA '00 Proceedings of the 4th Conference of the Association for Machine Translation in the Americas on Envisioning Machine Translation in the Information Future
Introduction to the special issue on word sense disambiguation: the state of the art

Computational Linguistics - Special issue on word sense disambiguation
Topical clustering of MRD senses based on information retrieval techniques

Computational Linguistics - Special issue on word sense disambiguation
Using corpus statistics and WordNet relations for sense identification

Computational Linguistics - Special issue on word sense disambiguation
Selective sampling for example-based word sense disambiguation

Computational Linguistics
The grammar of sense: Using part-of-speech tags as a first step in semantic disambiguation

Natural Language Engineering
Hebrew Computational Linguistics: Past and Future

Artificial Intelligence Review
Homonymy and polysemy in information retrieval

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Target word selection as proximity in semantic space

ACL '98 Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2
A concept-based adaptive approach to word sense disambiguation

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
An IR approach for translating new words from nonparallel, comparable texts

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Term-list translation using mono-lingual word co-occurrence vectors

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Two-level, many-paths generation

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
To what extent does case contribute to verb sense disambiguation?

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Using a probabilistic class-based lexicon for lexical ambiguity resolution

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Extraction of lexical translations from non-aligned corpora

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Word translation disambiguation using bilingual bootstrapping

Computational Linguistics
Automatic identification of non-compositional phrases

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Mixed language query disambiguation

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Translation Disambiguation in Mixed Language Queries

Machine Translation
Translation selection through source word sense disambiguation and target word selection

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Unsupervised word sense disambiguation using bilingual comparable corpora

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Measuring the similarity between compound nouns in different languages using non-parallel corpora

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Syntactic features for high precision word sense disambiguation

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Location normalization for information extraction

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Crosslinguistic transfer in automatic verb classification

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A multilingual paradigm for automatic verb classification

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Word translation disambiguation using Bilingual Bootstrapping

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Word sense acquisition from bilingual comparable corpora

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Phrase-pattern-based Korean to English machine translation using two level translation pattern selection

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Web-based models for natural language processing

ACM Transactions on Speech and Language Processing (TSLP)
Exploiting parallel texts in the creation of multilingual semantically annotated resources: the MultiSemCor Corpus

Natural Language Engineering
An unsupervised method for multilingual word sense tagging using parallel corpora: a preliminary investigation

WWSM '00 Proceedings of the ACL-2000 workshop on Word senses and multi-linguality - Volume 8
Sense discrimination with parallel corpora

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Learning bilingual translations from comparable corpora to cross-language information retrieval: hybrid statistics-based and linguistics-based approach

AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
InfoXtract location normalization: a hybrid approach to geographic references in information extraction

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Practical Word-Sense Disambiguation Using Co-occurring Concept Codes

Machine Translation
Aligning word senses using bilingual corpora

ACM Transactions on Asian Language Information Processing (TALIP)
Collocation translation acquisition using monolingual corpora

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Unsupervised sense disambiguation using bilingual probabilistic models

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Word sense disambiguation using label propagation based semi-supervised learning

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Evaluating cross-language annotation transfer in the MultiSemCor corpus

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Deeper sentiment analysis using machine translation technology

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Word sense disambiguation using sense examples automatically acquired from a second language

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A semi-supervised feature clustering algorithm with application to word sense disambiguation

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Learning model order from labeled and unlabeled data for partially supervised classification, with application to word sense disambiguation

Computer Speech and Language
Opening the legal literature portal to multilingual access

DCMI '04 Proceedings of the 2004 international conference on Dublin Core and metadata applications: metadata across languages and cultures
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
The bootstrapping of the Yarowsky algorithm in real corpora

Information Processing and Management: an International Journal
Word Clustering for Collocation-Based Word Sense Disambiguation

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Word sense disambiguation using automatically translated sense examples

CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
Partially supervised sense disambiguation by learning sense number from tagged and untagged corpora

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Mention detection crossing the language barrier

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised multilingual word sense disambiguation via an interlingua

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
An unsupervised method for multilingual word sense tagging using parallel corpora: a preliminary investigation

WorkSense '00 Proceedings of the ACL-2000 Workshop on Word Senses and Multi-Linguality
Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval

Artificial Intelligence in Medicine
Cross-Language Information Propagation for Arabic Mention Detection

ACM Transactions on Asian Language Information Processing (TALIP)
A Reexamination of MRD-Based Word Sense Disambiguation

ACM Transactions on Asian Language Information Processing (TALIP)
Selecting target word using contexonym comparison method

Proceedings of the 2007 conference on Human interface: Part I
HR-WSD: System description for all-words word sense disambiguation on a specific domain at SemEval-2010

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Using comparable corpora to improve the effectiveness of cross-language information retrieval

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Flexible-attribute problems

Computational Optimization and Applications
Joint bilingual sentiment classification with unlabeled parallel corpora

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Word sense disambiguation with multilingual features

IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics
ParaSense or how to use parallel corpora for word sense disambiguation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Supporting Arabic cross-lingual retrieval using contextual information

IRFC'11 Proceedings of the Second international conference on Multidisciplinary information retrieval facility
An improved method for finding bilingual collocation correspondences from monolingual corpora

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Translation selection through machine learning with language resources

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Unsupervised bilingual word sense disambiguation using web statistics

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Target word selection for korean verbs using a bilingual dictionary and wordnet

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
An unsupervised method for ranking translation words using a bilingual dictionary and wordnet

IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
Word sense disambiguation by semi-supervised learning

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Instance pruning by filtering uninformative words: an information extraction case study

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Exploiting the translation context for multilingual WSD

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
The need for application-dependent WSD strategies: a case study in MT

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Target word selection in English to Persian translation using unsupervised approach

International Journal of Artificial Intelligence and Soft Computing
BiCWS: mining cognitive differences from bilingual web search results

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
A new fuzzy rule-based classification system for word sense disambiguation

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a new approach for resolving lexical ambiguities in one language using statistical data from a monolingual corpus of another language. This approach exploits the differences between mappings of words to senses in different languages. The paper concentrates on the problem of target word selection in machine translation, for which the approach is directly applicable. The presented algorithm identifies syntactic relations between words, using a source language parser, and maps the alternative interpretations of these relations to the target language, using a bilingual lexicon. The preferred senses are then selected according to statistics on lexical relations in the target language. The selection is based on a statistical model and on a constraint propagation algorithm, which simultaneously handles all ambiguities in the sentence. The method was evaluated using three sets of Hebrew and German examples and was found to be very useful for disambiguation. The paper includes a detailed comparative analysis of statistical sense disambiguation methods.