Compressed fisher linear discriminant analysis: classification of randomly projected data

Authors:
Robert J. Durrant;Ata Kaban
Affiliations:
University of Birmingham, Edgbaston, United Kingdom;University of Birmingham, Edgbaston, United Kingdom
Venue:
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2010

Citing 8
Cited 3

Matrix analysis

Matrix analysis
An elementary proof of a theorem of Johnson and Lindenstrauss

Random Structures & Algorithms
Experiments with Random Projection

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Learning Mixtures of Gaussians

FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
Database-friendly random projections: Johnson-Lindenstrauss with binary coins

Journal of Computer and System Sciences - Special issu on PODS 2001
Experiments with random projections for machine learning

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
An algorithmic theory of learning: Robust concepts and random projection

Machine Learning
On the eigenspectrum of the gram matrix and the generalization error of kernel-PCA

IEEE Transactions on Information Theory

On the distance concentration awareness of certain data reduction techniques

Pattern Recognition
A tight bound on the performance of Fisher's linear discriminant in randomly projected data spaces

Pattern Recognition Letters
Enabling advanced inference on sensor nodes through direct use of compressively-sensed signals

DATE '12 Proceedings of the Conference on Design, Automation and Test in Europe

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider random projections in conjunction with classification, specifically the analysis of Fisher's Linear Discriminant (FLD) classifier in randomly projected data spaces. Unlike previous analyses of other classifiers in this setting, we avoid the unnatural effects that arise when one insists that all pairwise distances are approximately preserved under projection. We impose no sparsity or underlying low-dimensional structure constraints on the data; we instead take advantage of the class structure inherent in the problem. We obtain a reasonably tight upper bound on the estimated misclassification error on average over the random choice of the projection, which, in contrast to early distance preserving approaches, tightens in a natural way as the number of training examples increases. It follows that, for good generalisation of FLD, the required projection dimension grows logarithmically with the number of classes. We also show that the error contribution of a covariance misspecification is always no worse in the low-dimensional space than in the initial high-dimensional space. We contrast our findings to previous related work, and discuss our insights.