Learning Recognition and Segmentation Using the Cresceptron

  • Authors:
  • John (Juyang) Weng;Narendra Ahuja;Thomas S. Huang

  • Affiliations:
  • Department of Computer Science, Michigan State University, East Lansing, MI 48824 USA;Beckman Institute, 405 N. Mathews Avenue, University of Illinois, Urbana, IL 61801 USA;Beckman Institute, 405 N. Mathews Avenue, University of Illinois, Urbana, IL 61801 USA

  • Venue:
  • International Journal of Computer Vision
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a framework called Cresceptron for view-basedlearning, recognition and segmentation. Specifically, it recognizesand segments image patterns that are similar to those learned, usinga stochastic distortion model and view-based interpolation, allowingother view points that are moderately different from those used inlearning. The learning phase is interactive. The user trains thesystem using a collection of training images. For each trainingimage, the user manually draws a polygon outlining the region ofinterest and types in the label of its class. Then, from thedirectional edges of each of the segmented regions, the Cresceptronuses a hierarchical self-organization scheme to grow a sparselyconnected network automatically, adaptively and incrementally duringthe learning phase. At each level, the system detects new imagestructures that need to be learned and assigns a new neural plane foreach new feature. The network grows by creating new nodes andconnections which memorize the new image structures and their contextas they are detected. Thus, the structure of the network is afunction of the training exemplars. The Cresceptron incorporates bothindividual learning and class learning; with the former, eachtraining example is treated as a different individual while with thelatter, each example is a sample of a class. In the performancephase, segmentation and recognition are tightly coupled. Noforeground extraction is necessary, which is achieved by backtrackingthe response of the network down the hierarchy to the image partscontributing to recognition. Several stochastic shape distortionmodels are analyzed to show why multilevel matching such as that inthe Cresceptron can deal with more general stochastic distortionsthat a single-level matching scheme cannot. The system isdemonstrated using images from broadcast television and other videosegments to learn faces and other objects, and then later to locateand to recognize similar, but possibly distorted, views of the sameobjects.