Detecting outlying subjects in high-dimensional neuroimaging datasets with regularized minimum covariance determinant

  • Authors:
  • Virgile Fritsch;Gael Varoquaux;Benjamin Thyreau;Jean-Baptiste Poline;Bertrand Thirion

  • Affiliations:
  • Parietal Team, INRIA Saclay-Île-de-France, Saclay and CEA, DSV, I2BM, Gif-Sur-Yvette, France;Inserm, Gif-Sur-Yvette and Parietal Team, INRIA Saclay-Île-de-France, Saclay and CEA, DSV, I2BM, Gif-Sur-Yvette, France;CEA, DSV, I2BM, Gif-Sur-Yvette, France;CEA, DSV, I2BM, Gif-Sur-Yvette and Parietal Team, INRIA Saclay-Île-de-France, Saclay;Parietal Team, INRIA Saclay-Île-de-France, Saclay and CEA, DSV, I2BM, Gif-Sur-Yvette, France

  • Venue:
  • MICCAI'11 Proceedings of the 14th international conference on Medical image computing and computer-assisted intervention - Volume Part III
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Medical imaging datasets used in clinical studies or basic research often comprise highly variable multi-subject data. Statisticallycontrolled inclusion of a subject in a group study, i.e. deciding whether its images should be considered as samples from a given population or whether they should be rejected as outlier data, is a challenging issue. While the informal approaches often used do not provide any statistical assessment that a given dataset is indeed an outlier, traditional statistical procedures are not well-suited to the noisy, high-dimensional, settings encountered in medical imaging, e.g. with functional brain images. In this work, we modify the classical Minimum Covariance Determinant approach by adding a regularization term, that ensures that the estimation is well-posed in high-dimensional settings and in the presence of many outliers. We show on simulated and real data that outliers can be detected satisfactorily, even in situations where the number of dimensions of the data exceeds the number of observations.