Evaluation and classification of otoneurological data with new data analysis methods based on machine learning

  • Authors:
  • Markku Siermala;Martti Juhola;Jorma Laurikkala;Kati Iltanen;Erna Kentala;Ilmari Pyykkö

  • Affiliations:
  • Institute of Medical Technology, 33014 University of Tampere, Tampere, Finland;Department of Computer Sciences, 33014 University of Tampere, Tampere, Finland;Department of Computer Sciences, 33014 University of Tampere, Tampere, Finland;Department of Computer Sciences, 33014 University of Tampere, Tampere, Finland;Department of Otorhinolaryngology, Helsinki University Central Hospital, Helsinki, Finland;Department of Otorhinolaryngology, Tampere University Hospital, Tampere, Finland

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2007

Quantified Score

Hi-index 0.07

Visualization

Abstract

We improved the classification ability of multilayer perceptron networks by constructing a set of networks of as many as output classes and investigated the influence of different input variables on the classification. We have developed methods named scattering, spectrum and response analysis to express the classification complexity, especially the overlap of output classes, to disentangle the relation between the input variables and output classes of perceptron neural networks, and to establish the importance of input variables. The methods were tested by exploring complicated otoneurological data. In contrast to the variable selection problem, our methods characterize the importance of variables for classification and also describe the importance of the different values of each variable for output (disease) classes. When complex data is distributed in a biased manner between disease classes, we improved classification accuracy by developing a network set called NetSet, which increased average sensitivity and positive predictive value for at least 10% up to 85% and 83% respectively, compared to our earlier neural network classifications with the same data, which clarified class distribution effects and supported our comprehension of the significance of input.