The computation of word associations: comparing syntagmatic and paradigmatic approaches

Authors:
Reinhard Rapp
Affiliations:
University of Mainz, FASK, Germersheim, Germany
Venue:
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Year:
2002

Citing 9
Cited 16

Experiment on linguistically-based term associations

Information Processing and Management: an International Journal
Semantic feature extraction from technical texts with limited human intervention

Semantic feature extraction from technical texts with limited human intervention
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Accurate methods for the statistics of surprise and coincidence

Computational Linguistics - Special issue on using large corpora: I
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Finding parts in very large corpora

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic identification of word translations from unrelated English and German corpora

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

A practical solution to the problem of automatic word sense induction

ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
Fast computation of lexical affinity models

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Mining term association patterns from search logs for effective query reformulation

Proceedings of the 17th ACM conference on Information and knowledge management
Co-dispersion: a windowless approach to lexical association

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Language resources for a network-based dictionary

ElectricDict '04 Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries
A graph-theoretic model of lexical syntactic acquisition

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
CRCTOL: A semantic-based domain ontology learning system

Journal of the American Society for Information Science and Technology
Extracting lexical reference rules from Wikipedia

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A comparison of windowless and window-based computational association measures as predictors of syntagmatic human associations

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Expectation vectors: a semiotics inspired approach to geometric lexical-semantic representation

GEMS '10 Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics
Enhancing clinical concept extraction with distributional semantics

Journal of Biomedical Informatics
Detecting similar software applications

Proceedings of the 34th International Conference on Software Engineering
Granules of words to represent text: an approach based on fuzzy relations and spectral clustering

ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
A tensor encoding model for semantic processing

Proceedings of the 21st ACM international conference on Information and knowledge management
Mapping the intellectual structure by co-word: a case of international management science

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)

Quantified Score

Hi-index	0.00

Visualization

Abstract

It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze the distribution of words in large text corpora. According to the law of association by contiguity, the acquisition of word associations can be explained by Hebbian learning. The free word associations as produced by subjects on presentation of single stimulus words can thus be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. The reason is that synonyms rarely occur together but appear in similar lexical neighborhoods. Both approaches are systematically compared and are validated on empirical data. It turns out that for both tasks the performance of the statistical system is comparable to the performance of human subjects.