The structure-mapping engine: algorithm and examples
Artificial Intelligence
Word clustering and disambiguation based on co-occurrence data
Natural Language Engineering
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Cross-dataset clustering: revealing corresponding themes across multiple corpora
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Hi-index | 0.00 |
This work addresses the task of identifying thematic correspondences across sub-corpora focused on different topics. We introduce an unsupervised algorithmic framework based on distributional data clustering, which generalizes previous initial works on this task. The empirical results reveal interesting commonalities of different religions. We evaluate the results through measuring the overlap of our clusters with clusters compiled manually by experts. The tested variants of our framework are shown to outperform alternative methods applicable to the task.