Main Memory Evaluation of Monitoring Queries Over Moving Objects
Distributed and Parallel Databases
An approximate algorithm for top-k closest pairs join query in large high dimensional data
Data & Knowledge Engineering
Domain-independent data cleaning via analysis of entity-relationship graph
ACM Transactions on Database Systems (TODS)
Fast similarity join for multi-dimensional data
Information Systems
An empirical study on selective partitioning dimensions for partition-based similarity joins
Data & Knowledge Engineering
ACM Transactions on Database Systems (TODS)
Solving similarity joins and range queries in metric spaces with the list of twin clusters
Journal of Discrete Algorithms
Super-EGO: fast multi-dimensional similarity join
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.03 |
The efficient processing of similarity joins is importantfor a large class of applications. The dimensionality of thedata for these applications ranges from low to high. Mostexisting methods have focussed on the execution of high-dimensional joins over large amount of disk-based data.The increasing sizes of main memory available on currentcomputers, and the need for efficient processing of patialjoins suggest that spatial joins for a large class of problemscan be processed in main memory. In this paper we developtwo new spatial join algorithms, the Grid-join and EGO*-join, and study their performance in comparison to the stateof the art algorithm EGO-join and the RSJ algorithm.Through evaluation we explore the domain of applicability of each algorithm and provide recommendations for thechoice of join algorithm depending upon the dimensionality of the data as well as the critical \varepsilon parameter. We alsopoint out the significance of the choice of this parameter forensuring that the electivity achieved is reasonable.