Supervised pre-processing approaches in multiple class variables classification for fish recruitment forecasting

  • Authors:
  • Jose A. Fernandes;Jose A. Lozano;IñAki Inza;Xabier Irigoien;Aritz PéRez;Juan D. RodríGuez

  • Affiliations:
  • AZTI-Tecnalia, Marine Research Division, Herrera Kaia z/g, E-20110 Pasaia (Gipuzkoa), Spain and University of the Basque Country, Department of Computer Science and AI, Intelligent Systems Group ( ...;University of the Basque Country, Department of Computer Science and AI, Intelligent Systems Group (ISG), Paseo Manuel de Lardizabal, 1. E-20018 Donostia - San Sebastián, Spain;University of the Basque Country, Department of Computer Science and AI, Intelligent Systems Group (ISG), Paseo Manuel de Lardizabal, 1. E-20018 Donostia - San Sebastián, Spain;AZTI-Tecnalia, Marine Research Division, Herrera Kaia z/g, E-20110 Pasaia (Gipuzkoa), Spain and King Abdullah University of Science and Technology (KAUST), Chemical and Life Sciences and Engineeri ...;University of the Basque Country, Department of Computer Science and AI, Intelligent Systems Group (ISG), Paseo Manuel de Lardizabal, 1. E-20018 Donostia - San Sebastián, Spain;University of the Basque Country, Department of Computer Science and AI, Intelligent Systems Group (ISG), Paseo Manuel de Lardizabal, 1. E-20018 Donostia - San Sebastián, Spain

  • Venue:
  • Environmental Modelling & Software
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A multi-species approach to fisheries management requires taking into account the interactions between species in order to improve recruitment forecasting of the fish species. Recent advances in Bayesian networks direct the learning of models with several interrelated variables to be forecasted simultaneously. These models are known as multi-dimensional Bayesian network classifiers (MDBNs). Pre-processing steps are critical for the posterior learning of the model in these kinds of domains. Therefore, in the present study, a set of 'state-of-the-art' uni-dimensional pre-processing methods, within the categories of missing data imputation, feature discretization and feature subset selection, are adapted to be used with MDBNs. A framework that includes the proposed multi-dimensional supervised pre-processing methods, coupled with a MDBN classifier, is tested with synthetic datasets and the real domain of fish recruitment forecasting. The correctly forecasting of three fish species (anchovy, sardine and hake) simultaneously is doubled (from 17.3% to 29.5%) using the multi-dimensional approach in comparison to mono-species models. The probability assessments also show high improvement reducing the average error (estimated by means of Brier score) from 0.35 to 0.27. Finally, these differences are superior to the forecasting of species by pairs.