Elements of information theory
Elements of information theory
Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought
Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought
Concept decompositions for large sparse text data using clustering
Machine Learning
Coupled clustering: a method for detecting structural correspondence
The Journal of Machine Learning Research
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Corpus-based Learning of Analogies and Semantic Relations
Machine Learning
A generalized framework for revealing analogous themes across related topics
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Hi-index | 0.00 |
We present a method for identifying corresponding themes across several corpora that are focused on related, but distinct, domains. This task is approached through simultaneous clustering of keyword sets extracted from the analyzed corpora. Our algorithm extends the information-bottleneck soft clustering method for a suitable setting consisting of several datasets. Experimentation with topical corpora reveals similar aspects of three distinct religions. The evaluation is by way of comparison to clusters constructed manually by an expert.