One sense per discourse

Authors:
William A. Gale;Kenneth W. Church;David Yarowsky
Affiliations:
AT&T Bell Laboratories, Murray Hill NJ;AT&T Bell Laboratories, Murray Hill NJ;AT&T Bell Laboratories, Murray Hill NJ
Venue:
HLT '91 Proceedings of the workshop on Speech and Natural Language
Year:
1992

Citing 6
Cited 128

Semantic interpretation and the resolution of ambiguity

Semantic interpretation and the resolution of ambiguity
An experiment in computational discrimination of English word senses

IBM Journal of Research and Development
Automatic text processing

Automatic text processing
Two languages are more informative than one

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Word-sense disambiguation using statistical methods

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2

Information retrieval based on context distance and morphology

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An automatic method for generating sense tagged corpora

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
The impact on retrieval effectiveness of skewed frequency distributions

ACM Transactions on Information Systems (TOIS)
Automatic adaptation of proper noun dictionaries through cooperation of machine learning and probabilistic methods

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Document centered approach to text normalization

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Periods, capitalized words, etc.

Computational Linguistics
The interaction of knowledge sources in word sense disambiguation

Computational Linguistics
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Introduction to the special issue on word sense disambiguation: the state of the art

Computational Linguistics - Special issue on word sense disambiguation
Using corpus statistics and WordNet relations for sense identification

Computational Linguistics - Special issue on word sense disambiguation
Unsupervised named entity recognition using syntactic and semantic contextual evidence

Computational Linguistics
Dedication to William A. Gale

Natural Language Engineering
Introduction to the special issue on evaluating word sense disambiguation systems

Natural Language Engineering
The role of domain information in Word Sense Disambiguation

Natural Language Engineering
Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation

Natural Language Engineering
Semantic tagging of unknown proper nouns

Natural Language Engineering
Finding a domain-appropriate sense inventory for semantically tagging a corpus

Natural Language Engineering
Using a semantic network for information extraction

Natural Language Engineering
The grammar of sense: Using part-of-speech tags as a first step in semantic disambiguation

Natural Language Engineering
How verb subcategorization frequencies are affected by corpus choice

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Word sense disambiguation using optimised combinations of knowledge sources

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A concept-based adaptive approach to word sense disambiguation

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Automatic semantic tagging of unknown proper names

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
One tokenization per source

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Robust parsing based on discourse information: completing partial parses of ill-formed sentences on the basis of discourse information

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Estimating upper and lower bounds on the performance of word-sense disambiguation programs

ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
Robust method of pronoun resolution using full-text information

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Portable knowledge sources for machine translation

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Word translation disambiguation using bilingual bootstrapping

Computational Linguistics
Man vs. machine: a case study in base noun phrase learning

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
A method for word sense disambiguation of unrestricted text

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
SIGIR 2003 workshop on text analysis and search for bioinformatics

ACM SIGIR Forum
Structural Semantic Interconnections: A Knowledge-Based Approach to Word Sense Disambiguation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Augmenting noun taxonomies by combining lexical similarity metrics

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A graph model for unsupervised lexical acquisition

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Location normalization for information extraction

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Building semantic perceptron net for topic spotting

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Word translation disambiguation using Bilingual Bootstrapping

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Bootstrapping for named entity tagging using concept-based seeds

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
A bootstrapping approach to named entity classification using successive learners

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
One sense per collocation

HLT '93 Proceedings of the workshop on Human Language Technology
Enhancing a biomedical information extraction system with dictionary mining and context disambiguation

IBM Journal of Research and Development
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
Language independent NER using a unified model of internal and contextual evidence

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Word sense disambiguation with pictures

HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Grounding spatial named entities for information extraction and question answering

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
InfoXtract location normalization: a hybrid approach to geographic references in information extraction

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
On document relevance and lexical cohesion between query terms

Information Processing and Management: an International Journal
Atomic topical segments detection for instructional videos

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Weakly supervised learning for cross-document person name disambiguation supported by information extraction

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
PageRank on semantic networks, with application to word sense disambiguation

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Bootstrapping without the boot

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Domain-specific sense distributions and predominant sense acquisition

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Exploring phrasal context and error correction heuristics in bootstrapping for geographic named entity annotation

Information Systems
Reinforcing English countability prediction with one countability per discourse property

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Geographic co-occurrence as a tool for gir.

Proceedings of the 4th ACM workshop on Geographical information retrieval
Evaluation of Localized Semantics: Data, Methodology, and Experiments

International Journal of Computer Vision
Ambiguous queries: test collections need more sense

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Abbreviation Disambiguation: Experiments with Various Variants of the One Sense per Discourse Hypothesis

NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Intra-document structural frequency features for semi-supervised domain adaptation

Proceedings of the 17th ACM conference on Information and knowledge management
Knowledge-based gene symbol disambiguation

Proceedings of the 2nd international workshop on Data and text mining in bioinformatics
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
A Method for Reinforcing Noun Countability Prediction

IEICE - Transactions on Information and Systems
Critical analysis of WSD algorithms

Proceedings of the International Conference on Advances in Computing, Communication and Control
SOFIE: a self-organizing framework for information extraction

Proceedings of the 18th international conference on World wide web
Improved Unsupervised Name Discrimination with Very Wide Bigrams and Automatic Cluster Stopping

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Extracting Geographic Context from the Web: GeoReferencing in MyMoSe

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Khurshid Ahmad, Christopher Brewster, Mark Stevenson (Eds), Words and Intelligence I: Selected Papers by Yorick Wilks

Machine Translation
Combined one sense disambiguation of abbreviations

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Implementation of Croatian NERC system

ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Adapting an NER-system for German to the biomedical domain

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
An Approach to Web-Scale Named-Entity Disambiguation

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Implementing a sense tagger in a general architecture for text engineering

NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
What is at stake: a case study of Russian expressions starting with a preposition

MWE '04 Proceedings of the Workshop on Multiword Expressions: Integrating Processing
UNN-WePS: web person search using co-present names and lexical Chains

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
One translation per discourse

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
One class per named entity: exploiting unlabeled text for named entity recognition

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Improving word sense disambiguation in lexical chaining

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Automatic knowledge representation using a graph-based algorithm for language-independent lexical chaining

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
Relieving Polysemy Problem for Synonymy Detection

EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
An information-theoretic based model for large-scale contextual text processing

Information Sciences: an International Journal
Geographic signatures for semantic retrieval

Proceedings of the 6th Workshop on Geographic Information Retrieval
WSD as a distributed constraint optimization problem

ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Context adaptation in statistical machine translation using models with exponentially decaying cache

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Inducing fine-grained semantic classes via hierarchical and collective classification

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Sense-based biomedical indexing and retrieval

NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
Automatic discovery of word semantic relations using paraphrase alignment and distributional lexical semantics analysis

Natural Language Engineering
Word sense disambiguation methods

Programming and Computing Software
Ontology-based distinction between polysemy and homonymy

IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics
Automatic acquisition of huge training data for bio-medical named entity recognition

BioNLP '11 Proceedings of BioNLP 2011 Workshop
The web is not a person, Berners-Lee is not an organization, and African-Americans are not locations: an analysis of the performance of named-entity recognition

MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
How many multiword expressions do people know?

MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Towards automatic acquisition of a fully sense tagged corpus for persian

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Online news event extraction for global crisis surveillance

Transactions on computational collective intelligence V
A hybrid approach to chinese abbreviation expansion

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Spanish all-words semantic class disambiguation using Cast3LB corpus

MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
Identification, expansion, and disambiguation of acronyms in biomedical texts

ISPA'05 Proceedings of the 2005 international conference on Parallel and Distributed Processing and Applications
Dictionary-based amharic-french information retrieval

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
The XLDB group at GeoCLEF 2005

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
On some optimization heuristics for lesk-like WSD algorithms

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Classification-based contextual preferences

TIWTE '11 Proceedings of the TextInfer 2011 Workshop on Textual Entailment
Cache-based document-level statistical machine translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Heuristic methods for reducing errors of geographic named entities learned by bootstrapping

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
A bootstrapping approach for geographic named entity annotation

AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
A case study of using web search statistics: case restoration

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Targeted disambiguation of ad-hoc, homogeneous sets of named entities

Proceedings of the 21st international conference on World Wide Web
Resolving ambiguity in biomedical text to improve summarization

Information Processing and Management: an International Journal
Toponym disambiguation using ontology-based semantic similarity

PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Web query disambiguation using PageRank

Journal of the American Society for Information Science and Technology
Mining query subtopics from search log data

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Bootstrapping events and relations from text

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Word sense induction for novel sense detection

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Unsupervised detection of downward-entailing operators by maximizing classification certainty

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Multi event extraction guided by global constraints

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Encouraging consistent translation choices

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
The SENSEVAL-2 panel on domains, topics and senses

SENSEVAL '01 The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems
The Japanese translation task: lexical and structural perspectives

SENSEVAL '01 The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems
Regular polysemy: a distributional model

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Open-domain anatomical entity mention detection

ACL '12 Proceedings of the Workshop on Detecting Structure in Scholarly Discourse
The trouble with SMT consistency

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
A new clustering method for detecting rare senses of abbreviations in clinical notes

Journal of Biomedical Informatics
Analysis and refinement of cross-lingual entity linking

CLEF'12 Proceedings of the Third international conference on Information Access Evaluation: multilinguality, multimodality, and visual analytics
How many multiword expressions do people know?

ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 1
Worst-case complexity and empirical evaluation of artificial intelligence methods for unsupervised word sense disambiguation

International Journal of Web Engineering and Technology
Comparing resources for spanish lexical simplification

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Latent word context model for information retrieval

Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is well-known that there are polysemous words like sentence whose "meaning" or "sense" depends on the context of use. We have recently reported on two new word-sense disambiguation systems, one trained on bilingual material (the Canadian Hansards) and the other trained on monolingual material (Roget's Thesaurus and Grolier's Encyclopedia). As this work was nearing completion, we observed a very strong discourse effect. That is, if a polysemous word such as sentence appears two or more times in a well-written discourse, it is extremely likely that they will all share the same sense. This paper describes an experiment which confirmed this hypothesis and found that the tendency to share sense in the same discourse is extremely strong (98%). This result can be used as an additional source of constraint for improving the performance of the word-sense disambiguation algorithm. In addition, it could also be used to help evaluate disambiguation algorithms that did not make use of the discourse constraint.