Intelligible support vector machines for diagnosis of diabetes mellitus

  • Authors:
  • Nahla H. Barakat;Andrew P. Bradley;Mohamed Nabil H. Barakat

  • Affiliations:
  • Department of Applied Information Technology, German University of Technology in Oman, Muscat, Oman;School of Information Technology and Electrical Engineering, University of Queensland, Brisbane, QLD, Australia;Department of the Noncommunicable Diseases Surveillance and Control, Ministry of Health, Muscat, Oman

  • Venue:
  • IEEE Transactions on Information Technology in Biomedicine
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Diabetes mellitus is a chronic disease and a major public health challenge worldwide. According to the International Diabetes Federation, there are currently 246 million diabetic people worldwide, and this number is expected to rise to 380 million by 2025. Furthermore, 3.8 million deaths are attributable to diabetes complications each year. It has been shown that 80% of type 2 diabetes complications can be prevented or delayed by early identification of people at risk. In this context, several data mining and machine learning methods have been used for the diagnosis, prognosis, and management of diabetes. In this paper, we propose utilizing support vector machines (SVMs) for the diagnosis of diabetes. In particular, we use an additional explanation module, which turns the "black box" model of an SVM into an intelligible representation of the SVM's diagnostic (classification) decision. Results on a real-life diabetes dataset show that intelligible SVMs provide a promising tool for the prediction of diabetes, where a comprehensible ruleset have been generated, with prediction accuracy of 94%, sensitivity of 93%, and specificity of 94%. Furthermore, the extracted rules are medically sound and agree with the outcome of relevant medical studies.