On classification with empirically observed statistics and universal data compression

Authors:
J. Ziv
Affiliations:
Dept. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa
Venue:
IEEE Transactions on Information Theory
Year:
2006

Citing 0
Cited 1

Search for sparse active inputs: a review

Information Theory, Combinatorics, and Search Theory

Quantified Score

Hi-index	754.84

Visualization

Abstract

Classification with empirically observed statistics is studied for finite alphabet sources. Efficient universal discriminant functions are described and shown to be related to universal data compression. It is demonstrated that if one of the probability measure of the two classes is not known, it is still possible to define a universal discrimination function which performs as the optimal (likelihood ratio) discriminant function (which can be evaluated only if the probability measures of the two classes are available). If both of the probability measures are not available but training vectors from at least one of the two classes are available, it is demonstrated that no discriminant function can perform efficiency of the length of the training vectors does not grow at least linearly with the length of the classified vector. A universal discriminant function is introduced and shown to perform efficiently when the length of the training vectors grows linearly with the length of the classified sequence, in the sense that it yields an error exponent that is arbitrarily close to that of the optimal discriminant function