DGCL: an efficient density and grid based clustering algorithm for large spatial database

  • Authors:
  • Ho Seok Kim;Song Gao;Ying Xia;Gyoung Bae Kim;Hae Young Bae

  • Affiliations:
  • Department of Computer Science and Information Engineering, Inha University, Incheon, Korea;College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Nan'an Distinct, ChongQing City, P.R. China;College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Nan'an Distinct, ChongQing City, P.R. China;Department of Computer Education, Seowon University, Chungbuk, Korea;Department of Computer Science and Information Engineering, Inha University, Incheon, Korea

  • Venue:
  • WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

Spatial clustering, which groups similar objects based on their distance, connectivity, or their relative density in space, is an important component of spatial data mining. Clustering large data sets has always been a serious challenge for clustering algorithms, because huge data set makes the clustering process extremely costly. In this paper, we propose DGCL, an enhanced Density-Grid based Clustering algorithm for Large spatial database. The characteristics of dense area can be enhanced by considering the affection of the surrounding area. Dense areas are analytically identified as clusters by removing sparse area or outliers with the help of a density threshold. Synthetic datasets are used for testing and the result shows the superiority of our approach.