Mining for empty spaces in large data sets

Authors:
Jeff Edmonds;Jarek Gryz;Dongming Liang;Renée J. Miller
Affiliations:
Department of Computer Science, York University, 4700 Kelle Street, Toronto Ont., M3J 1P3, Canada;Department of Computer Science, York University, 4700 Kelle Street, Toronto Ont., M3J 1P3, Canada;Department of Computer Science, York University, 4700 Kelle Street, Toronto Ont., M3J 1P3, Canada;Department of Computer Science, University of Toronto, Toronto, Ont., Canada
Venue:
Theoretical Computer Science - Database theory
Year:
2003

Citing 10
Cited 15

Computing the largest empty rectangle

SIAM Journal on Computing
A note on finding a maximum empty rectangle

Discrete Applied Mathematics
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Association rules over interval data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
Implementation of Two Semantic Query Optimization Techniques in DB2 Universal Database

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Semantic Compression and Pattern Extraction with Fascicles

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Using Decision Tree Induction for Discovering Holes in Data

PRICAI '98 Proceedings of the 5th Pacific Rim International Conference on Artificial Intelligence: Topics in Artificial Intelligence
Discovering interesting holes in data

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2

A Fast Algorithm for Finding Maximal Empty Rectangles for Dynamic FPGA Placement

Proceedings of the conference on Design, automation and test in Europe - Volume 1
An efficient algorithm for finding empty space for online FPGA placement

Proceedings of the 41st annual Design Automation Conference
Design of Fault-Tolerant and Dynamically-Reconfigurable Microfluidic Biochips

Proceedings of the conference on Design, Automation and Test in Europe - Volume 2
A rank-by-feature framework for interactive exploration of multidimensional data

Information Visualization
Module placement for fault-tolerant microfluidics-based biochips

Proceedings of the 41st annual Design Automation Conference
Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast algorithms for finding disjoint subsequences with extremal densities

Pattern Recognition
Maximal strips data structure to represent free space on partially reconfigurable FPGAs

International Journal of Parallel, Emergent and Distributed Systems - Advances in Parallel and Distributed Computational Models
Drawing borders efficiently

FUN'07 Proceedings of the 4th international conference on Fun with algorithms
Unbiased, adaptive stochastic sampling for rendering inhomogeneous participating media

ACM SIGGRAPH Asia 2010 papers
Architecture and operating system support for two-dimensional runtime partial reconfiguration

The Journal of Supercomputing
The mono- and bichromatic empty rectangle and square problems in all dimensions

LATIN'10 Proceedings of the 9th Latin American conference on Theoretical Informatics
Fast algorithms for finding disjoint subsequences with extremal densities

ISAAC'05 Proceedings of the 16th international conference on Algorithms and Computation
Finding the largest empty rectangle containing only a query point in large multidimensional databases

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Localized geometric query problems

Computational Geometry: Theory and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many data mining approaches focus on the discovery of similar (and frequent) data values in large data sets. We present an alternative, but complementary approach in which we search for empty regions in the data. We consider the problem of finding all maximal empty rectangles in large, two-dimensional data sets. We introduce a novel, scalable algorithm for finding all such rectangles. The algorithm achieves this with a single scan over a sorted data set and requires only a small bounded amount of memory. We extend the algorithm to find all maximal empty hyper-rectangles in a multi-dimensional space. We consider the complexity of this search problem and present new bounds on the number of maximal empty hyper-rectangles. We briefly overview experimental results obtained by applying our algorithm to real and synthetic data sets and describe one application of empty-space knowledge to query optimization.