Object categorization with sketch representation and generalized samples

  • Authors:
  • Liang Lin;Xiaobai Liu;Shaowu Peng;Hongyang Chao;Yongtian Wang;Bo Jiang

  • Affiliations:
  • School of Software, Sun Yat-Sen University, Guangzhou, China and SCTS & CGCL, Huazhong University of Science and Technology, Wuhan, China;SCTS & CGCL, Huazhong University of Science and Technology, Wuhan, China and Lotus Hill Research Institute, China;Lotus Hill Research Institute, China;School of Software, Sun Yat-Sen University, Guangzhou, China;Beijing Institute of Technology, Beijing, China;School of Software, Sun Yat-Sen University, Guangzhou, China

  • Venue:
  • Pattern Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we present a framework for object categorization via sketch graphs that incorporate shape and structure information. In this framework, we integrate the learnable And-Or graph model, a hierarchical structure that combines the reconfigurability of a stochastic context free grammar (SCFG) with the constraints of a Markov random field (MRF). Considering the computation efficiency, we generalize instances from the And-Or graph models and perform a set of sequential tests for cascaded object categorization, rather than directly inferring with the And-Or graph models. We study 33 categories, each consisting of a small data set of 30 instances, and 30 additional templates with varied appearance are generalized from the learned And-Or graph model. These samples better span the appearance space and form an augmented training set @W"T of 1980 (60x33) training templates. To perform recognition on a testing image, we use a set of sequential tests to project @W"T into different representation spaces to narrow the number of candidate matches in @W"T. We use ''graphlets'' (structural elements), as our local features and model @W"T at each stage using histograms of graphlets over categories, histograms of graphlets over object instances, histograms of pairs of graphlets over objects, and shape context. Each test is increasingly computationally expensive, and by the end of the cascade we have a small candidate set remaining to use with our most powerful test, a top-down graph matching algorithm. We apply the proposed approach on the challenging public dataset including 33 object categories, and achieve state-of-the-art performance.