Mix-ratio sampling: Classifying multiclass imbalanced mouse brain images using support vector machine

  • Authors:
  • Min Hyeok Bae;Teresa Wu;Rong Pan

  • Affiliations:
  • Department of Industrial, Systems and Operations Engineering, Arizona State University, Tempe, Arizona 85287-5906, USA;Department of Industrial, Systems and Operations Engineering, Arizona State University, Tempe, Arizona 85287-5906, USA;Department of Industrial, Systems and Operations Engineering, Arizona State University, Tempe, Arizona 85287-5906, USA

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2010

Quantified Score

Hi-index 12.05

Visualization

Abstract

Support Vector Machine (SVM) is a classifier designed to achieve optimized classification accuracy. It has been applied to numerous applications associated with images. Yet challenges remain when applying SVM on segmenting mouse brain images. This is due to the fact that each high-resolution mouse brain image is a very large data set and it is a multiclass classification problem with extremely imbalanced data size for different classes. To address these issues, a mix-ratio sampling approach for SVM is proposed which determines various over-sampling ratios for different minority classes. In addition, to improve the imaging classification accuracy, spatial information is incorporated into the classification problem. Five mouse Magnetic Resonance Microscopy (MRM) images are collected to test the accuracy of classifying 21 brain structures. The first comparison experiment demonstrates the SVM with mix-ratio sampling method relieves the imbalance problem for multiclass more effectively and efficiently than the SVM with simple over-sampling method. In the second comparison experiment, another classifier, Artificial Neural Network (ANN) is used to compare against SVM based on the same mix-ratio sampled data and the results indicate that SVM shows better classification performance than ANN. Thirdly, the cross validation is conducted to demonstrate SVM with mix-ration sampling can classify multiclass imbalanced data with high accuracy.