On the generative-discriminative tradeoff approach: Interpretation, asymptotic efficiency and classification performance

Authors:
Jing-Hao Xue;D. Michael Titterington
Affiliations:
Department of Statistics, University of Glasgow, Glasgow G12 8QQ, UK and Department of Statistical Science, University College London, London WC1E 6BT, UK;Department of Statistics, University of Glasgow, Glasgow G12 8QQ, UK
Venue:
Computational Statistics & Data Analysis
Year:
2010

Citing 7
Cited 3

A hybrid generative/discriminative approach to text classification with additional information

Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Semi-supervised classification with hybrid generative/discriminative methods

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Bias-Variance Tradeoff in Hybrid Generative-Discriminative Models

ICMLA '07 Proceedings of the Sixth International Conference on Machine Learning and Applications
Comment on "On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes"

Neural Processing Letters
Interpretation of hybrid generative/discriminative algorithms

Neurocomputing
Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap

Computational Statistics & Data Analysis
Multi-conditional learning: generative/discriminative training for clustering and classification

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1

Joint discriminative-generative modelling based on statistical tests for classification

Pattern Recognition Letters
Editors Choice Article: I2VM: Incremental import vector machines

Image and Vision Computing
Object class detection: A survey

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.03

Visualization

Abstract

The interpretation of generative, discriminative and hybrid approaches to classification is discussed, in particular for the generative-discriminative tradeoff (GDT), a hybrid approach. The asymptotic efficiency of the GDT, relative to that of its generative or discriminative counterpart, is presented theoretically and, by using linear normal discrimination as an example, numerically. On real and simulated datasets, the classification performance of the GDT is compared with those of normal-based linear discriminant analysis (LDA) and linear logistic regression (LLR). Four arguments are made as follows. First, the GDT is a generative model integrating both discriminative and generative learning. It is therefore subject to model misspecification of the data-generating process and hindered by complex optimisation. Secondly, among the three approaches being compared, the asymptotic efficiency of the GDT is higher than that of the discriminative approach but lower than that of the generative approach, when no model misspecification occurs. Thirdly, without model misspecification, LDA performs the best; with model misspecification, LLR or the GDT with an optimal, large weight on its discriminative component may perform the best. Finally, LLR is affected by the imbalance between groups of data.