Image Parsing: Unifying Segmentation, Detection, and Recognition

  • Authors:
  • Zhuowen Tu;Xiangrong Chen;Alan L. Yuille;Song-Chun Zhu

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a general framework for parsing images into regions andobjects. In this framework, the detection and recognition ofobjects proceed simultaneously with image segmentation in acompetitive and cooperative manner. We illustrate our approach onnatural images of complex city scenes where the objects of primaryinterest are faces and text. This method makes use of bottom-upproposals combined with top-down generative models using the DataDriven Markov Chain Monte Carlo (DDMCMC) algorithm which isguaranteed to converge to the optimal estimate asymptotically. Moreprecisely, we define generative models for faces, text, and genericregions- e.g. shading, texture, and clutter. These models areactivated by bottom-up proposals. The proposals for faces and textare learnt using a probabilistic version of AdaBoost. The DDMCMCcombines reversible jump and diffusion dynamics to enable thegenerative models to explain the input images in a competitive andcooperative manner. Our experiments illustrate the advantages andimportance of combining bottom-up and top-down models and ofperforming segmentation and object detection/recognitionsimultaneously.