Definition of Valid Proteomic Biomarkers: A Bayesian Solution

Authors:
Keith Harris;Mark Girolami;Harald Mischak
Affiliations:
Inference Group, Department of Computing Science, University of Glasgow, UK;Inference Group, Department of Computing Science, University of Glasgow, UK;Mosaiques Diagnostics and Therapeutics AG, Hannover, Germany
Venue:
PRIB '09 Proceedings of the 4th IAPR International Conference on Pattern Recognition in Bioinformatics
Year:
2009

Citing 2
Cited 0

Gene selection using a two-level hierarchical Bayesian model

Bioinformatics
An empirical analysis of the probabilistic K-nearest neighbour classifier

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clinical proteomics is suffering from high hopes generated by reports on apparent biomarkers, most of which could not be later substantiated via validation. This has brought into focus the need for improved methods of finding a panel of clearly defined biomarkers. To examine this problem, urinary proteome data was collected from healthy adult males and females, and analysed to find biomarkers that differentiated between genders. We believe that models that incorporate sparsity in terms of variables are desirable for biomarker selection, as proteomics data typically contains a huge number of variables (peptides) and few samples making the selection process potentially unstable. This suggests the application of a two-level hierarchical Bayesian probit regression model for variable selection which assumes a prior that favours sparseness. The classification performance of this method is shown to improve that of the Probabilistic K-Nearest Neighbour model.