On vocabulary size in bag-of-visual-words representation

  • Authors:
  • Jian Hou;Jianxin Kang;Naiming Qi

  • Affiliations:
  • School of Astronautics, Harbin Institute of Technology, Harbin, China;School of Astronautics, Harbin Institute of Technology, Harbin, China and School of Engineering, Northeast Agriculture University, Harbin, China;School of Astronautics, Harbin Institute of Technology, Harbin, China

  • Venue:
  • PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bag-of-visual-words is a popular image representation that produces high matching accuracy and efficiency. While vocabulary size impacts on matching accuracy, existing research usually selects the vocabulary size empirically. Research on representative local descriptors shows that with similarity based clustering, the intra-cluster similarity extent of descriptors plays the same role in straightforward matching as vocabulary size in visual words matching. Based on this observation, we propose to use similarity based clustering to determine the optimal vocabulary size for a given dataset in visual words matching. Preliminary experiments with three datasets produce encouraging results and demonstrate the potential of the proposed approach.