The synergy of 3d SIFT and sparse codes for classification of viewpoints from echocardiogram videos

  • Authors:
  • Yu Qian;Lianyi Wang;Chunyan Wang;Xiaohong Gao

  • Affiliations:
  • School of Engineering and Information Sciences, Middlesex University, U.K.;Heart Center, First Hospital of Tsinghua University, China;Heart Center, First Hospital of Tsinghua University, China;School of Engineering and Information Sciences, Middlesex University, U.K.

  • Venue:
  • MCBR-CDS'12 Proceedings of the Third MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Echocardiography plays an important part in diagnostic aid in cardiology. During an echocardiogram exam images or image sequences are usually taken from different locations with various directions in order to comprehend a comprehensive view of the anatomical structure of the 3D moving heart. The automatic classification of echocardiograms based on the viewpoint constitutes an essential step in a computer-aided diagnosis. The challenge remains the high noise to signal ratio of an echocardiography, leading to low resolution of echocardiograms. In this paper, a new synergy is proposed based on well-established algorithms to classify view positions of echocardiograms. Bags of Words (BoW) are coupled with linear SVMs. Sparse coding is employed to train an echocardiogram video dictionary based on a set of 3D SIFT descriptors of space-time interest points detected by a Cuboid detector. Multiple scales of max pooling features are applied to representat the echocardiogram video. The linear multiclass SVM is employed to classify echocardiogram videos into eight views. Based on the collection of 219 echocardiogram videos, the evaluation is carried out. The preliminary results exhibit 72% Average Accuracy Rate (AAR) for the classification with eight view angles and 90% with three primary view locations.