The String-to-String Correction Problem
Journal of the ACM (JACM)
Modern Information Retrieval
LRD: latent relation discovery for vector space expansion and information retrieval
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Hi-index | 0.00 |
Sierra & McNaught (2000) proposed a clustering algorithm based on analogy. It takes as input pairs of definitions sharing the same term headword, in the same domain but from different sources, compares these definitions and identifies couples of words in the same extended semantic context. These couples are then grouped to yield semantically-related clusters. Here, we further evaluate the clustering algorithm, and develop an improved version whose results are also evaluated.