Near-synonymy and lexical choice

Authors:
Philip Edmonds;Graeme Hirst
Affiliations:
Sharp Laboratories of Europe Limited, Oxford Science Park, Edmund Halley Road, Oxford OX4 4GB, England;Department of Computer Science, University of Toronto, Ontario, Canada M5S 3G4
Venue:
Computational Linguistics
Year:
2002

Citing 32
Cited 24

Text generation: using discourse strategies and focus constraints to generate natural language text

Text generation: using discourse strategies and focus constraints to generate natural language text
Synonymy and semantic classification

Synonymy and semantic classification
Description directed control: its implications for natural language generation

Readings in natural language processing
A statistical approach to machine translation

Computational Linguistics
Relational models of the lexicon

Relational models of the lexicon
Using a lexicon of canonical graphs in a semantic interpreter

Relational models of the lexicon
Machine translation: a view from the Lexicon

Machine translation: a view from the Lexicon
Improving statistical language model performance with automatically generated word hierarchies

Computational Linguistics
EuroWordNet: a multilingual database with lexical semantic networks

EuroWordNet: a multilingual database with lexical semantic networks
Lexical semantics and knowledge representation in multilingual text generation

Lexical semantics and knowledge representation in multilingual text generation
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Generating Natural Language under Pragmatic Constraints

Generating Natural Language under Pragmatic Constraints
Machine Translation: A Knowledge-Based Approach

Machine Translation: A Knowledge-Based Approach
Target-Text Mediated Interactive Machine Translation

Machine Translation
Logical Structures in the Lexicon

Proceedings of the First SIGLEX Workshop on Lexical Semantics and Knowledge Representation
Syntax-Driven and Ontology-Driven Lexical Semantics

Proceedings of the First SIGLEX Workshop on Lexical Semantics and Knowledge Representation
Representation of Semantic Knowledge with Term Subsumption Languages

Proceedings of the First SIGLEX Workshop on Lexical Semantics and Knowledge Representation
Semantic representations of near-synonyms for automatic lexical choice

Semantic representations of near-synonyms for automatic lexical choice
A computational theory of goal-directed style in syntax

Computational Linguistics
Floating constraints in lexical choice

Computational Linguistics
Automatic word sense discrimination

Computational Linguistics - Special issue on word sense disambiguation
Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation

Natural Language Engineering
Building applied natural language generation systems

Natural Language Engineering
Similarity between words computed by spreading activation on an English dictionary

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Lexical choice criteria in language generation

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Choosing the word most typical in context using a lexical co-occurrence network

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Multilingual computational semantic lexicons in action: the WYSINNWYG approach to NLP

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Resolving translation mismatches with information flow

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
Contextual word similarity and estimation from sparse data

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Using information content to evaluate semantic similarity in a taxonomy

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1

A WordNet interface to APL2

APL '02 Proceedings of the 2002 conference on APL: array processing languages: lore, problems, and applications
A model for matching semantic maps between languages (French/English, English/French)

Computational Linguistics
Choosing words in computer-generated weather forecasts

Artificial Intelligence - Special volume on connecting language to the world
Acquiring collocations for lexical choice between near-synonyms

ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Learning the meaning and usage of time phrases from a parallel text-data corpus

HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Population testing: extracting semantic information on near-synonymy from native speakers

HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Building and Using a Lexical Knowledge Base of Near-Synonym Differences

Computational Linguistics
Paraphrasing for automatic evaluation

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
A survey on sentiment detection of reviews

Expert Systems with Applications: An International Journal
Empirical acquisition of differentiating relations from definitions

ElectricDict '04 Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries
Choosing words in computer-generated weather forecasts

Artificial Intelligence - Special volume on connecting language to the world
Synonym extraction using a semantic distance on a dictionary

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
Inference of lexical ontologies. The LeOnI methodology

Artificial Intelligence
Automatic sense disambiguation of the near-synonyms in a dictionary entry

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Near-synonym lexical choice in latent semantic space

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Generating phrasal and sentential paraphrases: A survey of data-driven methods

Computational Linguistics
Comparing distributional and mirror translation similarities for extracting synonyms

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Invariants and variability of synonymy networks: self mediated agreement by confluence

TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Methodologies for the reliable construction of ontological knowledge

ICCS'05 Proceedings of the 13th international conference on Conceptual Structures: common Semantics for Sharing Knowledge
Lexical choice via topic adaptation for paraphrasing written language to spoken language

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Exploring extensive linguistic feature sets in near-synonym lexical choice

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Extractive email thread summarization: can we do better than he said she said?

INLG '12 Proceedings of the Seventh International Natural Language Generation Conference
Semi-automatic enrichment of crowdsourced synonymy networks: the WISIGOTH system applied to Wiktionary

Language Resources and Evaluation
Assessing the influence of personal preferences on the choice of vocabulary for natural language generation

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop a new computational model for representing the fine-grained meanings of near-synonyms and the differences between them. We also develop a lexical-choice process that can decide which of several near-synonyms is most appropriate in a particular situation. This research has direct applications in machine translation and text generation.We first identify the problems of representing near-synonyms in a computational lexicon and show that no previous model adequately accounts for near-synonymy. We then propose a preliminary theory to account for near-synonymy, relying crucially on the notion of granularity of representation, in which the meaning of a word arises out of a context-dependent combination of a context-independent core meaning and a set of explicit differences to its near-synonyms. That is, near-synonyms cluster together.We then develop a clustered model of lexical knowledge, derived from the conventional ontological model. The model cuts off the ontology at a coarse grain, thus avoiding an awkward proliferation of language-dependent concepts in the ontology, yet maintaining the advantages of efficient computation and reasoning. The model groups near-synonyms into subconceptual clusters that are linked to the ontology. A cluster differentiates near-synonyms in terms of fine-grained aspects of denotation, implication, expressed attitude, and style. The model is general enough to account for other types of variation, for instance, in collocational behavior.An efficient, robust, and flexible fine-grained lexical-choice process is a consequence of a clustered model of lexical knowledge. To make it work, we formalize criteria for lexical choice as preferences to express certain concepts with varying indirectness, to express attitudes, and to establish certain styles. The lexical-choice process itself works on two tiers: between clusters and between near-synonyns of clusters. We describe our prototype implementation of the system, called I-Saurus.