A coarse grained parallel algorithm for closest larger ancestors in trees with applications to single link clustering

  • Authors:
  • Albert Chan;Chunmei Gao;Andrew Rau-Chaplin

  • Affiliations:
  • Research partially supported by the Natural Sciences and Engineering Research Council of Canada;Department of Mathematics and Computer Science, Fayetteville State University, Fayetteville, NC;Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada

  • Venue:
  • HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical clustering methods are important in many data mining and pattern recognition tasks. In this paper we present an efficient coarse grained parallel algorithm for Single Link Clustering; a standard inter-cluster linkage metric. Our approach is to first describe algorithms for the Prefix Larger Integer Set and the Closest Larger Ancestor problems and then to show how these can be applied to solve the Single Link Clustering problem. In an extensive performance analysis an implementation of these algorithms on a Linux-based cluster has shown to scale well, exhibiting near linear relative speedup.