Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Partition based spatial-merge join
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Finding Localized Associations in Market Basket Data
IEEE Transactions on Knowledge and Data Engineering
Discovering Colocation Patterns from Spatial Data Sets: A General Approach
IEEE Transactions on Knowledge and Data Engineering
ACM Transactions on Database Systems (TODS)
Zonal Co-location Pattern Discovery with Dynamic Parameters
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Finding regional co-location patterns for sets of continuous variables in spatial datasets
Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
Hi-index | 0.00 |
Regional co-location patterns (RCPs) represent collections of feature types frequently located together in certain localities. For example, RCP Bar, Alcohol -- Crimes), Downtown suggests that a co-location pattern involving alcohol-related crimes and bars is often localized to downtown regions. Given a set of Boolean feature types, their geo-located instances, a spatial neighbor relation, and a prevalence threshold, the RCP discovery problem finds all prevalent RCPs (pairs of co-locations and their prevalence localities). RCP discovery is important in many societal applications, including public safety, public health, climate science and ecology. The RCP discovery problem involves three major challenges: (a) an exponential number of subsets of feature types, (b) an exponential number of candidate localities and (c) a tradeoff between accurately modeling pattern locality and achieving computational efficiency. Related work does not provide computationally efficient methods to discover all interesting RCPs with their natural prevalence localities. To address these limitations, this paper proposes a neighborhood graph based approach that discovers all interesting RCPs and is aware of a pattern's prevalence localities. We identify partitions based on the pattern instances and neighbor graph. We introduce two new interest measures, a regional participation ratio and a regional participation index to quantify the strength of RCPs. We present two new algorithms, Pattern Space (PS) enumeration and Maximal Locality (ML) enumeration and show that they are correct and complete. Experiments using real crime datasets show that ML pruning outperforms PS enumeration.