Active learning for an efficient training strategy of computer-aided diagnosis systems: application to diabetic retinopathy screening

  • Authors:
  • C. I. Sánchez;M. Niemeijer;M. D. Abràmoff;B. van Ginneken

  • Affiliations:
  • Department of Radiology, Radboud University Nijmegen Medical Centre, The Netherlands;Image Sciences Institute, University Medical Center Utrecht, The Netherlands;Department of Ophthalmology and Visual Sciences, University of Iowa;Department of Radiology, Radboud University Nijmegen Medical Centre, The Netherlands and Image Sciences Institute, University Medical Center Utrecht, The Netherlands

  • Venue:
  • MICCAI'10 Proceedings of the 13th international conference on Medical image computing and computer-assisted intervention: Part III
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The performance of computer-aided diagnosis (CAD) systems can be highly influenced by the training strategy. CAD systems are traditionally trained using available labeled data, extracted from a specific data distribution or from public databases. Due to the wide variability of medical data, these databases might not be representative enough when the CAD system is applied to data extracted from a different clinical setting, diminishing the performance or requiring more labeled samples in order to get better data generalization. In this work, we propose the incorporation of an active learning approach in the training phase of CAD systems for reducing the number of required training samples while maximizing the system performance. The benefit of this approach has been evaluated using a specific CAD system for Diabetic Retinopathy screening. The results show that 1) using a training set obtained from a different data source results in a considerable reduction of the CAD performance; and 2) using active learning the selected training set can be reduced from 1000 to 200 samples while maintaining an area under the Receiver Operating Characteristic curve of 0.856.