A Principal Component Clustering Approach to Object-Oriented Motion Segmentation and Estimation

  • Authors:
  • Yun-Ting Lin;Yen-Kuang Chen;S. Y. Kung

  • Affiliations:
  • Department of Electrical Engineering, Princeton University, Princeton, NJ 08544;Department of Electrical Engineering, Princeton University, Princeton, NJ 08544;Department of Electrical Engineering, Princeton University, Princeton, NJ 08544

  • Venue:
  • Journal of VLSI Signal Processing Systems - Special issue on recent development in video: algorithms, implementation and applications
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a framework for object-oriented scenesegmentation in video, which uses motion as the major characteristic todistinguish different moving objects and then to segment the scene intoobject regions. From the feature block (FB) correspondences through at leasttwo frames obtained via a tracking algorithm, the reference featuremeasurement matrix and feature displacement matrix are formed. We propose atechnique for initial motion clustering of the FBs, in which the principalcomponents (PC) of the two matrices are adopted as the motion features. Themotion features have several advantages: (1) They are low-dimensional(2-dim). (2) They preserve well both the spatial closeness and the motionsimilarity of their corresponding FBs. (3) They tend to form distinctiveclusters in the feature space, thus allowing simple clustering schemes to beapplied. The Expectation-Maximization (EM) algorithm is applied forclustering the motion features. For those scenes involving mainly the cameramotion, the PC-based motion features will exhibit nearly parallel lines inthe feature space. This facilitates a simple and yet effective layerextraction scheme. The final motion-based segmentation involves labeling ofall the blocks in the frame. The EM algorithm is again applied to minimizean energy function which takes motion consistency andneighborhood-sensitivity into account. The proposed algorithm has beenapplied to several test sequences and the simulation results suggest apromising potential for video applications.