The Hybrid Tree: An Index Structure for High Dimensional Feature Spaces

Authors:
Affiliations:
Venue:
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Year:
1999

Citing 0
Cited 56

Vector approximation based indexing for non-uniform high dimensional data sets

Proceedings of the ninth international conference on Information and knowledge management
A cost model for query processing in high dimensional data spaces

ACM Transactions on Database Systems (TODS)
Modeling high-dimensional index structures using sampling

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Adaptive nearest neighbor search for relevance feedback in large image databases

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Efficient processing of conical queries

Proceedings of the tenth international conference on Information and knowledge management
Locally adaptive dimensionality reduction for indexing large time series databases

ACM Transactions on Database Systems (TODS)
Searching in metric spaces with user-defined and approximate distances

ACM Transactions on Database Systems (TODS)
How to improve the pruning ability of dynamic metric access methods

Proceedings of the eleventh international conference on Information and knowledge management
SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees

Journal of Intelligent Information Systems
VQ-index: an index structure for similarity searching in multimedia databases

Proceedings of the tenth ACM international conference on Multimedia
Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Efficient Similarity Search in Feature Spaces with the Q-Tree

ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
The SH-tree: A Super Hybrid Index Structure for Multidimensional Data

DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
A hierarchical access control model for video database systems

ACM Transactions on Information Systems (TOIS)
CSVD: Clustering and Singular Value Decomposition for Approximate Similarity Search in High-Dimensional Spaces

IEEE Transactions on Knowledge and Data Engineering
Effective Management of Hierarchical Storage Using Two Levels of Data Clustering

MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
QCluster: relevance feedback using adaptive clustering for content-based image retrieval

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
ClusterTree: Integration of Cluster Representation and Nearest-Neighbor Search for Large Data Sets with High Dimensions

IEEE Transactions on Knowledge and Data Engineering
Kernel VA-files for relevance feedback retrieva

MMDB '03 Proceedings of the 1st ACM international workshop on Multimedia databases
An Enhanced Concurrency Control Scheme for Multidimensional Index Structures

IEEE Transactions on Knowledge and Data Engineering
High dimensional reverse nearest neighbor queries

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Dimensionality reduction using magnitude and shape approximations

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Index-driven similarity search in metric spaces (Survey Article)

ACM Transactions on Database Systems (TODS)
Evaluating Refined Queries in Top-k Retrieval Systems

IEEE Transactions on Knowledge and Data Engineering
On accessing data in high-dimensional spaces: a comparative study of three space partitioning strategies

Journal of Systems and Software - Special issue: Performance modeling and analysis of computer systems and networks
Object-based and image-based object representations

ACM Computing Surveys (CSUR)
Compressing Bitmap Indices by Data Reorganization

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
iDistance: An adaptive B+-tree based indexing method for nearest neighbor search

ACM Transactions on Database Systems (TODS)
DDR: an index method for large time-series datasets

Information Systems
A space-partitioning-based indexing method for multidimensional non-ordered discrete data spaces

ACM Transactions on Information Systems (TOIS)
Dynamic indexing for multidimensional non-ordered discrete data spaces using a data-partitioning approach

ACM Transactions on Database Systems (TODS)
InMAF: indexing music databases via multiple acoustic features

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
A geometrical solution to time series searching invariant to shifting and scaling

Knowledge and Information Systems
High dimensional nearest neighbor searching

Information Systems
Efficient benchmarking of content-based image retrieval via resampling

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
The Concentration of Fractional Distances

IEEE Transactions on Knowledge and Data Engineering
The ND-tree: a dynamic indexing technique for multidimensional non-ordered discrete data spaces

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Optimal subspace dimensionality for k-nearest-neighbor queries on clustered and dimensionality reduced datasets with SVD

Multimedia Tools and Applications
Automatic image annotation using visual content and folksonomies

Multimedia Tools and Applications
The C-ND tree: a multidimensional index for hybrid continuous and non-ordered discrete data spaces

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
The Optimization of In-Memory Space Partitioning Trees for Cache Utilization

IEICE - Transactions on Information and Systems
A revised r*-tree in comparison with related index structures

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
An efficient high-dimensional indexing method for content-based retrieval in large image databases

Image Communication
The MM-tree: a memory-based metric tree without overlap between nodes

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
High-dimensional indexing: transformational approaches to high-dimensional range and similarity searches

High-dimensional indexing: transformational approaches to high-dimensional range and similarity searches
Variable granularity space filling curve for indexing multidimensional data

ADBIS'11 Proceedings of the 15th international conference on Advances in databases and information systems
An incremental updating method for clustering-based high-dimensional data indexing

CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
An index structure for parallel processing of multidimensional data

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
Indexing structures for content-based retrieval of large image databases: a review

AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
An efficient phantom protection method for multi-dimensional index structures

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Privacy-Preserving search and updates for outsourced tree-structured data on untrusted servers

iTrust'05 Proceedings of the Third international conference on Trust Management
Semantic visualization of 3D urban environments

Multimedia Tools and Applications
Time-series data mining

ACM Computing Surveys (CSUR)
Indexing RFID data using the VG-curve

ADC '12 Proceedings of the Twenty-Third Australasian Database Conference - Volume 124
Parallel multi-dimensional range query processing with R-trees on GPU

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Feature based similarity search is emerging as an important search paradigm in database systems. The technique used is to map the data items as points into a high dimensional feature space which is indexed using a multidimensional data structure. Similarity search then corresponds to a range search over the data structure. Although several data structures have been proposed for feature indexing, none of them is known to scale beyond 10-15 dimensional spaces. This paper introduces the hybrid tree -- a multidimensional data structure for indexing high dimensional feature spaces. Unlike other multidimensional data structures, the hybrid tree cannot be classified as either a pure data partitioning (DP) index structure (e.g., R-tree, SS-tree, SR-tree) or a pure space partitioning (SP) one (e.g., KDB-tree, hB-tree); rather, it ``combines'' positive aspects of the two types of index structures a single data structure to achieve search performance more scalable to high dimensionalities than either of the above techniques (hence, the name ``hybrid''). Furthermore, unlike many data structures (e.g., distance based index structures like SS-tree, SR-tree), the hybrid tree can support queries based on arbitrary dist ance functions. Our experiments on ``real'' high dimensional large size feature databases demonstrate that the hybrid tree scal es well to high dimensionality and large database sizes. It significantly outperforms both purely DP-based and SP-based index mechanisms as well as linear scan at all dimensionalities for large sized databases.