Bootstrapping distributional feature vector quality

Authors:
Maayan Zhitomirsky-Geffet;Ido Dagan
Affiliations:
-;-
Venue:
Computational Linguistics
Year:
2009

Citing 35
Cited 7

Word association norms, mutual information, and lexicography

Computational Linguistics
Experiment on linguistically-based term associations

Information Processing and Management: an International Journal
Similarity-based approaches to natural language processing

Similarity-based approaches to natural language processing
Similarity-Based Models of Word Cooccurrence Probabilities

Machine Learning - Special issue on natural language learning
Foundations of statistical natural language processing

Foundations of statistical natural language processing
A corpus analysis approach for automatic query expansion and its extension to multiple databases

ACM Transactions on Information Systems (TOIS)
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Accurate methods for the statistics of surprise and coincidence

Computational Linguistics - Special issue on using large corpora: I
Discovery of inference rules for question-answering

Natural Language Engineering
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Principle-based parsing without overgeneration

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Statistical sense disambiguation with relatively small corpora using dictionary definitions

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Word-sense disambiguation using statistical models of Roget's categories trained on large corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Measures of distributional similarity

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic construction of a hypernym-labeled noun hierarchy from text

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Extracting paraphrases from a parallel corpus

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Unsupervised methods for developing taxonomies by combining syntactic and statistical information

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Computational Linguistics
Improvements in automatic thesaurus extraction

ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Supersense tagging of unknown nouns using semantic similarity

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
The distributional inclusion hypotheses and lexical entailment

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Domain kernels for word sense disambiguation

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Feature vector quality and distributional similarity

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Towards terascale knowledge acquisition

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Feature weighting for co-occurrence-based classification of words

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Characterising measures of lexical distributional similarity

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Dependency-Based Construction of Semantic Space Models

Computational Linguistics
Integrating pattern-based and distributional similarity methods for lexical entailment acquisition

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Textual entailment through extended lexical overlap and lexico-semantic matching

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
The PASCAL recognising textual entailment challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Recognizing textual entailment: is word similarity enough?

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment

A survey of paraphrasing and textual entailment methods

Journal of Artificial Intelligence Research
Directional distributional similarity for lexical inference

Natural Language Engineering
The automatic identification of lexical variation between language varieties

Natural Language Engineering
Distributional memory: A general framework for corpus-based semantics

Computational Linguistics
Comparing distributional and mirror translation similarities for extracting synonyms

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Entailment above the word level in distributional semantics

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Identifying hypernyms in distributional semantic spaces

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article presents a novel bootstrapping approach for improving the quality of feature vector weighting in distributional word similarity. The method was motivated by attempts to utilize distributional similarity for identifying the concrete semantic relationship of lexical entailment. Our analysis revealed that a major reason for the rather loose semantic similarity obtained by distributional similarity methods is insufficient quality of the word feature vectors, caused by deficient feature weighting. This observation led to the definition of a bootstrapping scheme which yields improved feature weights, and hence higher quality feature vectors. The underlying idea of our approach is that features which are common to similar words are also most characteristic for their meanings, and thus should be promoted. This idea is realized via a bootstrapping step applied to an initial standard approximation of the similarity space. The superior performance of the bootstrapping method was assessed in two different experiments, one based on direct human gold-standard annotation and the other based on an automatically created disambiguation dataset. These results are further supported by applying a novel quantitative measurement of the quality of feature weighting functions. Improved feature weighting also allows massive feature reduction, which indicates that the most characteristic features for a word are indeed concentrated at the top ranks of its vector. Finally, experiments with three prominent similarity measures and two feature weighting functions showed that the bootstrapping scheme is robust and is independent of the original functions over which it is applied.