Semi-supervised kernel canonical correlation analysis with application to human fMRI

  • Authors:
  • Matthew B. Blaschko;Jacquelyn A. Shelton;Andreas Bartels;Christoph H. Lampert;Arthur Gretton

  • Affiliations:
  • Department of Engineering Science, University of Oxford, Parks Road, Oxford OX1 3PJ, United Kingdom;Frankfurt Institute for Advanced Studies, Goethe Universität Frankfurt, Ruth-Moufang-Str. 1, 60438 Frankfurt am Main, Germany;Centre for Integrative Neuroscience, Universität Tübingen, Paul-Ehrlich-Str. 17, 72076 Tübingen, Germany and Department of Neurophysiology, Max Planck Institute for Biological Cyber ...;Institute of Science and Technology (IST), Am Campus 1, 3400 Klosterneuburg, Austria;Gatsby Computational Neuroscience Unit, University College London, Alexandra House, 17 Queen Square, London WC1N 3AR, United Kingdom and Department of Empirical Inference, Max Planck Institute for ...

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

Kernel canonical correlation analysis (KCCA) is a general technique for subspace learning that incorporates principal components analysis (PCA) and Fisher linear discriminant analysis (LDA) as special cases. By finding directions that maximize correlation, KCCA learns representations that are more closely tied to the underlying process that generates the data and can ignore high-variance noise directions. However, for data where acquisition in one or more modalities is expensive or otherwise limited, KCCA may suffer from small sample effects. We propose to use semi-supervised Laplacian regularization to utilize data that are present in only one modality. This approach is able to find highly correlated directions that also lie along the data manifold, resulting in a more robust estimate of correlated subspaces. Functional magnetic resonance imaging (fMRI) acquired data are naturally amenable to subspace techniques as data are well aligned. fMRI data of the human brain are a particularly interesting candidate. In this study we implemented various supervised and semi-supervised versions of KCCA on human fMRI data, with regression to single and multi-variate labels (corresponding to video content subjects viewed during the image acquisition). In each variate condition, the semi-supervised variants of KCCA performed better than the supervised variants, including a supervised variant with Laplacian regularization. We additionally analyze the weights learned by the regression in order to infer brain regions that are important to different types of visual processing.