Incremental clustering for trajectories

Authors:
Zhenhui Li;Jae-Gil Lee;Xiaolei Li;Jiawei Han
Affiliations:
Univ. of Illinois at Urbana-Champaign;IBM Almaden Research Center;Microsoft;Univ. of Illinois at Urbana-Champaign
Venue:
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Year:
2010

Citing 9
Cited 8

BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
OPTICS: ordering points to identify the clustering structure

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Trajectory clustering with mixtures of regression models

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A general probabilistic framework for clustering individuals and objects

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Data bubbles: quality preserving performance boosting for hierarchical clustering

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Incremental Clustering for Mining in a Data Warehousing Environment

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Trajectory clustering: a partition-and-group framework

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
A framework for clustering evolving data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
On-line discovery of hot motion paths

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology

Fast and accurate trajectory streams clustering

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Dynamic clustering with soft computing

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Constructing street networks from GPS trajectories

ESA'12 Proceedings of the 20th Annual European conference on Algorithms
Finding homogeneous groups in trajectory streams

Proceedings of the Third ACM SIGSPATIAL International Workshop on GeoStreaming
Probabilistic street-intersection reconstruction from GPS trajectories: approaches and challenges

Proceedings of the Third ACM SIGSPATIAL International Workshop on Querying and Mining Uncertain Spatio-Temporal Data
Effectively grouping trajectory streams

NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
Semantic trajectories modeling and analysis

ACM Computing Surveys (CSUR)
Dealing with trajectory streams by clustering and mathematical transforms

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Trajectory clustering has played a crucial role in data analysis since it reveals underlying trends of moving objects. Due to their sequential nature, trajectory data are often received incrementally, e.g., continuous new points reported by GPS system. However, since existing trajectory clustering algorithms are developed for static datasets, they are not suitable for incremental clustering with the following two requirements. First, clustering should be processed efficiently since it can be frequently requested. Second, huge amounts of trajectory data must be accommodated, as they will accumulate constantly. An incremental clustering framework for trajectories is proposed in this paper. It contains two parts: online micro-cluster maintenance and offline macro-cluster creation. For online part, when a new bunch of trajectories arrives, each trajectory is simplified into a set of directed line segments in order to find clusters of trajectory subparts. Micro-clusters are used to store compact summaries of similar trajectory line segments, which take much smaller space than raw trajectories. When new data are added, micro-clusters are updated incrementally to reflect the changes. For offline part, when a user requests to see current clustering result, macro-clustering is performed on the set of micro-clusters rather than on all trajectories over the whole time span. Since the number of micro-clusters is smaller than that of original trajectories, macro-clusters are generated efficiently to show clustering result of trajectories. Experimental results on both synthetic and real data sets show that our framework achieves high efficiency as well as high clustering quality.