Experiment on linguistically-based term associations
Information Processing and Management: an International Journal
Semantic feature extraction from technical texts with limited human intervention
Semantic feature extraction from technical texts with limited human intervention
Explorations in Automatic Thesaurus Discovery
Explorations in Automatic Thesaurus Discovery
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Finding parts in very large corpora
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic identification of word translations from unrelated English and German corpora
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
A practical solution to the problem of automatic word sense induction
ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
Fast computation of lexical affinity models
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Mining term association patterns from search logs for effective query reformulation
Proceedings of the 17th ACM conference on Information and knowledge management
Co-dispersion: a windowless approach to lexical association
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Language resources for a network-based dictionary
ElectricDict '04 Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries
A graph-theoretic model of lexical syntactic acquisition
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
CRCTOL: A semantic-based domain ontology learning system
Journal of the American Society for Information Science and Technology
Extracting lexical reference rules from Wikipedia
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Expectation vectors: a semiotics inspired approach to geometric lexical-semantic representation
GEMS '10 Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics
Enhancing clinical concept extraction with distributional semantics
Journal of Biomedical Informatics
Detecting similar software applications
Proceedings of the 34th International Conference on Software Engineering
Granules of words to represent text: an approach based on fuzzy relations and spectral clustering
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
A tensor encoding model for semantic processing
Proceedings of the 21st ACM international conference on Information and knowledge management
Mapping the intellectual structure by co-word: a case of international management science
WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Hi-index | 0.00 |
It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze the distribution of words in large text corpora. According to the law of association by contiguity, the acquisition of word associations can be explained by Hebbian learning. The free word associations as produced by subjects on presentation of single stimulus words can thus be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. The reason is that synonyms rarely occur together but appear in similar lexical neighborhoods. Both approaches are systematically compared and are validated on empirical data. It turns out that for both tasks the performance of the statistical system is comparable to the performance of human subjects.