Discovery of feature-based hot spots using supervised clustering

Authors:
Wei Ding;Tomasz F. Stepinski;Rachana Parmar;Dan Jiang;Christoph F. Eick
Affiliations:
Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125-3393, USA;Lunar and Planetary Institute, 3600 Bay Area Blvd., Houston, TX 77058, USA;Department of Computer Science, University of Houston, Houston, TX 77204-3010, USA;Department of Computer Science, University of Houston, Houston, TX 77204-3010, USA;Department of Computer Science, University of Houston, Houston, TX 77204-3010, USA
Venue:
Computers & Geosciences
Year:
2009

Citing 11
Cited 3

Parallel Architectures and Algorithms for Image Component Labeling

IEEE Transactions on Pattern Analysis and Machine Intelligence
Performance Evaluation and Analysis of Monocular Building Extraction From Aerial Imagery

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discovery of Spatial Association Rules in Geographic Information Databases

SSD '95 Proceedings of the 4th International Symposium on Advances in Spatial Databases
Complex Spatial Relationships

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Fast mining of spatial collocations

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering Colocation Patterns from Spatial Data Sets: A General Approach

IEEE Transactions on Knowledge and Data Engineering
Supervised Clustering " Algorithms and Benefits

ICTAI '04 Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence
Mining Co-Location Patterns with Rare Events from Spatial Data Sets

Geoinformatica
On supervised density estimation techniques and their application to spatial data mining

Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
A simple unsupervised MRF model based image segmentation approach

IEEE Transactions on Image Processing
MOSAIC: a proximity graph approach for agglomerative clustering

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery

Controlling patterns of geospatial phenomena

Geoinformatica
Subject-oriented top-k hot region queries in spatial dataset

Proceedings of the 20th ACM international conference on Information and knowledge management
An information fusion approach to integrate image annotation and text mining methods for geographic knowledge discovery

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.02

Visualization

Abstract

Feature-based hot spots are localized regions where the attributes of objects attain high values. There is considerable interest in automatic identification of feature-based hot spots. This paper approaches the problem of finding feature-based hot spots from a data mining perspective, and describes a method that relies on supervised clustering to produce a list of hot spot regions. Supervised clustering uses a fitness function rewarding isolation of the hot spots to optimally subdivide the dataset. The clusters in the optimal division are ranked using the interestingness of clusters that encapsulate their utility for being hot spots. Hot spots are associated with the top ranked clusters. The effectiveness of supervised clustering as a hot spot identification method is evaluated for four conceptually different clustering algorithms using a dataset describing the spatial distribution of ground ice on Mars. Clustering solutions are visualized by specially developed raster approximations. Further assessment of the ability of different algorithms to yield hot spots is performed using raster approximations. Density-based clustering algorithm is found to be the most effective for hot spot identification. The results of the hot spot discovery by supervised clustering are comparable to those obtained using the G^* statistic, but the new method offers a high degree of automation, making it an ideal tool for mining large datasets for the existence of potential hot spots.