Approximate k-NN delta test minimization method using genetic algorithms: Application to time series

  • Authors:
  • Fernando Mateo;Dušan Sovilj;Rafael Gadea

  • Affiliations:
  • Institute of Applications of Information Technologies and Advanced Communications, Universidad Politécnica de Valencia, Valencia, Spain;Laboratory of Information and Computer Science, Helsinki University of Technology, Espoo, Finland;Institute of Applications of Information Technologies and Advanced Communications, Universidad Politécnica de Valencia, Valencia, Spain

  • Venue:
  • Neurocomputing
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In many real world problems, the existence of irrelevant input variables (features) hinders the predictive quality of the models used to estimate the output variables. In particular, time series prediction often involves building large regressors of artificial variables that can contain irrelevant or misleading information. Many techniques have arisen to confront the problem of accurate variable selection, including both local and global search strategies. This paper presents a method based on genetic algorithms that intends to find a global optimum set of input variables that minimize the Delta Test criterion. The execution speed has been enhanced by substituting the exact nearest neighbor computation by its approximate version. The problems of scaling and projection of variables have been addressed. The developed method works in conjunction with MATLAB's Genetic Algorithm and Direct Search Toolbox. The goodness of the proposed methodology has been evaluated on several popular time series examples, and also generalized to other non-time-series datasets.