Error Entropy in Classification Problems: A Univariate Data Analysis

Authors:
Luís M. Silva;Carlos S. Felgueiras;Luís A. Alexandre;J. Marques de Sá
Affiliations:
lmsilva@fe.up.pt;casf@fe.up.pt and Instituto de Engenharia Biomédica, Laboratório Sinal e Imagem Biomédica, 4200-465, Porto, Portugal;lfbaa@di.ubi.pt and Departamento de Informática, Universidade da Beira Interior and Instituto de Telecomunicações, Networks and Multimedia Group, Covilhã, Portugal;jmsa@fe.up.pt and Instituto de Engenharia Biomédica, Laboratório Sinal e Imagem Biomédica and Fac. de Eng. da Univ. do Porto, Depo. de Engenharia Electrotécnica e Computadores, ...
Venue:
Neural Computation
Year:
2006

Citing 3
Cited 4

Self-Organization in a Perceptual Network

Computer
Numerical Recipes in C: The Art of Scientific Computing

Numerical Recipes in C: The Art of Scientific Computing
An error-entropy minimization algorithm for supervised training ofnonlinear adaptive systems

IEEE Transactions on Signal Processing

The mee principle in data classification: A perceptron-based analysis

Neural Computation
Δ-Entropy: Definition, properties and applications in system identification with quantized data

Information Sciences: an International Journal
Single layer complex valued neural network with entropic cost function

ICANN'11 Proceedings of the 21th international conference on Artificial neural networks - Volume Part I
New results on minimum error entropy decision trees

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Entropy-based cost functions are enjoying a growing attractiveness in unsupervised and supervised classification tasks. Better performances in terms both of error rate and speed of convergence have been reported. In this letter, we study the principle of error entropy minimization (EEM) from a theoretical point of view. We use Shannon's entropy and study univariate data splitting in two-class problems. In this setting, the error variable is a discrete random variable, leading to a not too complicated mathematical analysis of the error entropy. We start by showing that for uniformly distributed data, there is equivalence between the EEM split and the optimal classifier. In a more general setting, we prove the necessary conditions for this equivalence and show the existence of class configurations where the optimal classifier corresponds to maximum error entropy. The presented theoretical results provide practical guidelines that are illustrated with a set of experiments with both real and simulated data sets, where the effectiveness of EEM is compared with the usual mean square error minimization.