Support vector regression from simulation data and few experimental samples

  • Authors:
  • Gérard Bloch;Fabien Lauer;Guillaume Colin;Yann Chamaillard

  • Affiliations:
  • Centre de Recherche en Automatique de Nancy (CRAN), Nancy-University, CNRS, Parc Robert Bentz F-54500 Vanduvre-les-Nancy, France;Centre de Recherche en Automatique de Nancy (CRAN), Nancy-University, CNRS, Parc Robert Bentz F-54500 Vanduvre-les-Nancy, France;Institut Pluridisciplinaire de Recherche en Ingénierie des Systèmes, Mécanique, Energétique (PRISME), University of Orléans, 8, rue Léonard de Vinci, 45072 Orléa ...;Institut Pluridisciplinaire de Recherche en Ingénierie des Systèmes, Mécanique, Energétique (PRISME), University of Orléans, 8, rue Léonard de Vinci, 45072 Orléa ...

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2008

Quantified Score

Hi-index 0.07

Visualization

Abstract

This paper considers nonlinear modeling based on a limited amount of experimental data and a simulator built from prior knowledge. The problem of how to best incorporate the data provided by the simulator, possibly biased, into the learning of the model is addressed. This problem, although particular, is very representative of numerous situations met in engine control, and more generally in engineering, where complex models, more or less accurate, exist and where the experimental data which can be used for calibration are difficult or expensive to obtain. The first proposed method constrains the function to fit to the values given by the simulator with a certain accuracy, allowing to take the bias of the simulator into account. The second method constrains the derivatives of the model to fit to the derivatives of a prior model previously estimated on the simulation data. The combination of these two forms of prior knowledge is also possible and considered. These approaches are implemented in the linear programming support vector regression (LP-SVR) framework by the addition, to the optimization problem, of constraints, which are linear with respect to the parameters. Tests are then performed on an engine control application, namely, the estimation of the in-cylinder residual gas fraction in Spark Ignition (SI) engine with Variable Camshaft Timing (VCT). Promising results are obtained on this application. The experiments have also shown the importance of adding potential support vectors in the model when using Gaussian RBF kernels with very few training samples.