Bayesian nonlinear regression for large p small n problems

  • Authors:
  • Sounak Chakraborty;Malay Ghosh;Bani K. Mallick

  • Affiliations:
  • Department of Statistics, University of Missouri-Columbia, 209F Middlebush Hall, Columbia, MO 65211, USA;Department of Statistics, University of Florida, 103 Griffin/Floyd Hall, Gainesville, FL 32611-8545, USA;Department of Statistics, Texas A & M University, 415D Blocker Building, College Station, TX 77843-3143, USA

  • Venue:
  • Journal of Multivariate Analysis
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Statistical modeling and inference problems with sample sizes substantially smaller than the number of available covariates are challenging. This is known as large p small n problem. Furthermore, the problem is more complicated when we have multiple correlated responses. We develop multivariate nonlinear regression models in this setup for accurate prediction. In this paper, we introduce a full Bayesian support vector regression model with Vapnik's @e-insensitive loss function, based on reproducing kernel Hilbert spaces (RKHS) under the multivariate correlated response setup. This provides a full probabilistic description of support vector machine (SVM) rather than an algorithm for fitting purposes. We have also introduced a multivariate version of the relevance vector machine (RVM). Instead of the original treatment of the RVM relying on the use of type II maximum likelihood estimates of the hyper-parameters, we put a prior on the hyper-parameters and use Markov chain Monte Carlo technique for computation. We have also proposed an empirical Bayes method for our RVM and SVM. Our methods are illustrated with a prediction problem in the near-infrared (NIR) spectroscopy. A simulation study is also undertaken to check the prediction accuracy of our models.