Dissimilarity-based classification in the absence of local ground truth: Application to the diagnostic interpretation of chest radiographs

  • Authors:
  • Y. Arzhaeva;D. M. J. Tax;B. van Ginneken

  • Affiliations:
  • Image Sciences Institute, University Medical Center Utrecht, Heidelberglaan 100 Q.S.4.300, 3584 CX Utrecht, the Netherlands;Information and Communication Theory Group, Delft University of Technology, Mekelweg 4, 2628 CD Delft, the Netherlands;Image Sciences Institute, University Medical Center Utrecht, Heidelberglaan 100 Q.S.4.300, 3584 CX Utrecht, the Netherlands

  • Venue:
  • Pattern Recognition
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper classification on dissimilarity representations is applied to medical imaging data with the task of discrimination between normal images and images with signs of disease. We show that dissimilarity-based classification is a beneficial approach in dealing with weakly labeled data, i.e. when the location of disease in an image is unknown and therefore local feature-based classifiers cannot be trained. A modification to the standard dissimilarity-based approach is proposed that makes a dissimilarity measure multi-valued, hence, able to retain more information. A multi-valued dissimilarity between an image and a prototype becomes an image representation vector in classification. Several classification outputs with respect to different prototypes are further integrated into a final image decision. Both standard and proposed methods are evaluated on data sets of chest radiographs with textural abnormalities and compared to several feature-based region classification approaches applied to the same data. On a tuberculosis data set the multi-valued dissimilarity-based classification performs as well as the best region classification method applied to the fully labeled data, with an area under the receiver operating characteristic (ROC) curve (A"z) of 0.82. The standard dissimilarity-based classification yields A"z=0.80. On a data set with interstitial abnormalities both dissimilarity-based approaches achieve A"z=0.98 which is closely behind the best region classification method.