Indexing High-Dimensional Data for Content-Based Retrieval in Large Databases

Authors:
Manuel J. Fonseca;Joaquim A. Jorge
Affiliations:
-;-
Venue:
DASFAA '03 Proceedings of the Eighth International Conference on Database Systems for Advanced Applications
Year:
2003

Citing 0
Cited 19

Sketch-based retrieval of ClipArt drawings

Proceedings of the working conference on Advanced visual interfaces
Multi-step density-based clustering

Knowledge and Information Systems
Interactive high-dimensional index for large Chinese calligraphic character databases

ACM Transactions on Asian Language Information Processing (TALIP)
Composite distance transformation for indexing and k-nearest-neighbor searching in high-dimensional spaces

Journal of Computer Science and Technology
Indexing high-dimensional data in dual distance spaces: a symmetrical encoding approach

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Geometric matching for clip-art drawing retrieval

Journal of Visual Communication and Image Representation
A Web-Based Search Engine for Chinese Calligraphic Manuscript Images

ICWL '009 Proceedings of the 8th International Conference on Advances in Web Based Learning
Sketch-based retrieval of complex drawings using hierarchical topology and geometry

Computer-Aided Design
Sketch-based retrieval of drawings using spatial proximity

Journal of Visual Languages and Computing
Thesaurus-based 3D Object Retrieval with Part-in-Whole Matching

International Journal of Computer Vision
Efficient nearest neighbor query based on extended B+-tree in high-dimensional space

Pattern Recognition Letters
MuVis: an application for interactive exploration of large music collections

Proceedings of the international conference on Multimedia
Subspace tree: high dimensional multimedia indexing with logarithmic temporal complexity

Journal of Intelligent Information Systems
Fast answering k-nearest-neighbor queries over large image databases using dual distance transformation

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Parallel density-based clustering of complex objects

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Time-HOBI: Index for optimizing star queries

Information Systems
Mixing images and sketches for retrieving vector drawings

EGMM'04 Proceedings of the Seventh Eurographics conference on Multimedia
Towards 3D modeling using sketches and retrieval

SBM'04 Proceedings of the First Eurographics conference on Sketch-Based Interfaces and Modeling
PL-Tree: an efficient indexing method for high-dimensional data

SSTD'13 Proceedings of the 13th international conference on Advances in Spatial and Temporal Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many indexing approaches for high-dimensional datapoints have evolved into very complex and hard to codealgorithms. Sometimes this complexity is not matched byincrease in performance. Motivated by these ideas, we takea step back and look at simpler approaches to indexing multimedia data. In this paper we propose a simple, (not simplisti) yet efficient indexing structure for high-dimensionaldata points of variable dimension, using dimension reduction. Our approach maps multidimensional points to a 1Dline by computing their Euclidean Norm and use a B+-Treeto store data points. We exploit B+-Tree efficient sequential search to develop simple, yet performant methodsto implement point, range and nearest-neighbor queries.To evaluate our technique we conducted a set of experiments, using both synthetic and real data. We analyze creation, insertion and query times as a function of data setsize and dimension. Results so far show that our simplescheme outperforms current approaches, such as the Pyramid Technique, the A-Tree and the SR-Tree, for many datadistributions. Moreover, our approach seems to scale betterboth with growing dimensionality and data set size, whileexhibiting low insertion and search times.