Region Contextual Visual Words for scene categorization

  • Authors:
  • Shuoyan Liu;De Xu;Songhe Feng

  • Affiliations:
  • Institute of Computer & Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of Computer & Information Technology, Beijing Jiaotong University, Beijing 100044, China;Institute of Computer & Information Technology, Beijing Jiaotong University, Beijing 100044, China and Beijing Key Lab of Intelligent Telecommunications Software and Multimedia, Beijing University ...

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 12.05

Visualization

Abstract

This paper proposes a method for scene categorization by integrating region contextual information into the popular Bag-of-Visual-Words approach. The Bag-of-Visual-Words approach describes an image as a bag of discrete visual words, where the frequency distributions of these words are used for image categorization. However, the traditional visual words suffer from the problem when faced these patches with similar appearances but distinct semantic concepts. The drawback stems from the independently construction each visual word. This paper introduces Region-Conditional Random Fields model to learn each visual word depending on the rest of the visual words in the same region. Comparison with the traditional Conditional Random Fields model, there are two areas of novelty. First, the initial label of each patch is automatically defined based on its visual feature rather than manually labeling with semantic labels. Furthermore, the novel potential function is built under the region contextual constraint. The experimental results on the three well-known datasets show that Region Contextual Visual Words indeed improves categorization performance compared to traditional visual words.