Combining fuzzy information: an overview
ACM SIGMOD Record
Introduction to MPEG-7: Multimedia Content Description Interface
Introduction to MPEG-7: Multimedia Content Description Interface
Similarity Search: The Metric Space Approach (Advances in Database Systems)
Similarity Search: The Metric Space Approach (Advances in Database Systems)
Image retrieval: Ideas, influences, and trends of the new age
ACM Computing Surveys (CSUR)
Counting distance permutations
Journal of Discrete Algorithms
Generic similarity search engine demonstrated by an image retrieval application
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Measuring the difficulty of distance-based indexing
SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Distinct nearest neighbors queries for similarity search in very large multimedia databases
Proceedings of the eleventh international workshop on Web information and data management
On locality-sensitive indexing in generic metric spaces
Proceedings of the Third International Conference on SImilarity Search and APplications
An approach to content-based image retrieval based on the Lucene search engine library
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Stabilizing the recall in similarity search
Proceedings of the Fourth International Conference on SImilarity Search and APplications
Automatic weight selection for multi-metric distances
Proceedings of the Fourth International Conference on SImilarity Search and APplications
Similarity query postprocessing by ranking
AMR'10 Proceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion
Large-scale similarity data management with distributed Metric Index
Information Processing and Management: an International Journal
Visual image search: feature signatures or/and global descriptors
SISAP'12 Proceedings of the 5th international conference on Similarity Search and Applications
Hi-index | 0.01 |
The Content-based Photo Image Retrieval (CoPhIR) dataset is the largest available database of digital images with corresponding visual descriptors. It contains five MPEG-7 global descriptors extracted from more than 106 million images from Flickr photo-sharing system. In this paper, we analyze this dataset focusing on 1) efficiency of similarity-based indexing and searching and on 2) expressiveness of combination of the descriptors with respect to subjective perception of visual similarity. We treat the descriptors as metric spaces and then combine them into a multi-metric space. We analyze distance distributions of individual descriptors, measure intrinsic dimensionality of these datasets and statistically evaluate correlation between these descriptors. Further, we use two methods to assess subjective accuracy and satisfaction of similarity retrieval based on a combination of descriptors that is recommended for CoPhIR, and we compare these results on databases of 10 and 100 million CoPhIR images. Finally, we suggest, explore and evaluate two approaches to improve the accuracy: 1) applying logarithms in order to weaken influence of a single descriptor contribution if it deviates from the rest, and 2) the possibility of categorization of the dataset and identifying visual characteristics important for individual categories.