Distance approximation techniques to reduce the dimensionality for multimedia databases

  • Authors:
  • Yongkwon Kim;Chin-Wan Chung;Seok-Lyong Lee;Deok-Hwan Kim

  • Affiliations:
  • KAIST, Division of Computer Science, 305-701, Daejeon, Korea;KAIST, Division of Computer Science, 305-701, Daejeon, Korea;Hankuk University of Foreign Studies, School of Industrial and Information Engineering, 449-701, Yongin-si, Gyeonggi-do, Korea;Inha University, School of Electronics and Electrical Engineering, 402-751, Incheon, Korea

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recently, databases have been used to store multimedia data such as images, maps, video clips, and music clips. In order to search them, they should be represented by various features, which are composed of high-dimensional vectors. As a result, the dimensionality of data is increased considerably, which causes ‘the curse of dimensionality’. The increase of data dimensionality causes poor performance of index structures. To overcome the problem, the research on the dimensionality reduction has been conducted. However, some reduction methods do not guarantee no false dismissal, while others incur high computational cost. This paper proposes dimensionality reduction techniques that guarantee no false dismissal while providing efficiency considerable by approximating distances with a few values. To provide the no false dismissal property, approximated distances should always be smaller than original distances. The Cauchy–Schwarz inequality and two trigonometrical equations are used as well as the dimension partitioning technique is applied to approximate distances in such a way to reduce the difference between the approximated distance and the original distance. As a result, the proposed techniques reduce the candidate set of a query result for efficient query processing.