Ensemble generation and feature selection for the identification of students with learning disabilities

  • Authors:
  • Loris Nanni;Alessandra Lumini

  • Affiliations:
  • DEIS, IEIIT - CNR, Universití di Bologna, Viale Risorgimento 2, 40136 Bologna, Italy;DEIS, IEIIT - CNR, Universití di Bologna, Viale Risorgimento 2, 40136 Bologna, Italy

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 12.05

Visualization

Abstract

In this paper, we have made an extensive study of artificial intelligence (AI) techniques like ensemble of classifiers and feature selection for the identification of students with learning disabilities. The experimental results show that our best method, which combines both ensemble of classifiers and feature selection, can correctly identify up to 50% of the learning disabilities (LD) students with 100% confidence. Also when predicting samples in ''junior high school'' using model built on the ''elementary school'' students and when the ''junior high school'' samples are used to build the model we predict the samples in the ''elementary school'' dataset. In particular, we propose variants of two recent Feature Transform-based ensemble methods (Rotation Forest and Input Decimated Ensemble). In the Rotation Forest, the feature set is randomly split into subsets and Principal Component Analysis (PCA) is used to transform the features that belong to a subset. The Input Decimated Ensemble first singles out a given class i and runs PCA on this data only. This transformation is applied to the whole dataset and a classifier D"i is trained using these transformed patterns. This choice limits the size of the ensemble to the number of classes. In this paper, we perform an empirical comparison varying the Feature Transform method used in the Rotation Forest technique and we propose a clustering method to overcome the drawback of the Input Decimated Ensemble.