Randomness and sparsity induced codebook learning with application to cancer image classification

  • Authors:
  • Quannan Li;Cong Yao;Liwei Wang;Zhuowen Tu

  • Affiliations:
  • Lab of Neuro Imaging, University of California, Los Angeles and Microsoft Research Asia, China;Microsoft Research Asia, China, Huazhong University of Science and Technology, China;Microsoft Research Asia, China, The Chinese University of Hong Kong, China;Lab of Neuro Imaging, University of California, Los Angeles and Microsoft Research Asia, China

  • Venue:
  • MCV'12 Proceedings of the Second international conference on Medical Computer Vision: recognition techniques and applications in medical imaging
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Codebook learning is one of the central research topics in computer vision and machine learning. In this paper, we propose a new codebook learning algorithm, Randomized Forest Sparse Coding (RFSC), by harvesting the following three concepts: (1) ensemble learning, (2) divide-and-conquer, and (3) sparse coding. Given a set of training data, a randomized tree can be used to perform data partition (divide-and-conquer); after a tree is built, a number of bases are learned from the data within each leaf node for a sparse representation (subspace learning via sparse coding); multiple trees with diversities are trained (ensemble), and the collection of bases of these trees constitute the codebook. These three concepts in our codebook learning algorithm have the same target but with different emphasis: subspace learning via sparse coding makes a compact representation, and reduces the information loss; the divide-and-conquer process efficiently obtains the local data clusters; an ensemble of diverse trees provides additional robustness. We have conducted classification experiments on cancer images as well as a variety of natural image datasets and the experiment results demonstrate the efficiency and effectiveness of the proposed method.