Comparison of maximum entropy and minimal mutual information in a nonlinear setting

  • Authors:
  • Fabian J. Theis;Ch. Bauer;Elmar W. Lang

  • Affiliations:
  • Institute of Biophysics, University of Regensburg, A.G Neuro-und Bioinformsatik Inst. fur Biophysik, D-93040 Regensburg, Germany;Institute of Biophysics, University of Regensburg, A.G Neuro-und Bioinformsatik Inst. fur Biophysik, D-93040 Regensburg, Germany;Institute of Biophysics, University of Regensburg, A.G Neuro-und Bioinformsatik Inst. fur Biophysik, D-93040 Regensburg, Germany

  • Venue:
  • Signal Processing
  • Year:
  • 2002

Quantified Score

Hi-index 0.08

Visualization

Abstract

In blind source separation (BSS), two different separation techniques are mainly used: minimal mutual information (MMI), where minimization of the mutual output information yields an independent random vector, and maximum entropy (ME), where the output entropy is maximized. However, it is yet unclear why ME should solve the separation problem, i.e. result in an independent vector. Yang and Amari have given a partial confirmation for ME in the linear case in [18], where they prove that under the assumption of vanishing expectation of the sources ME does not change the solutions of MMI except for scaling and permutation. In this paper, we generalize Yang and Amari's approach to nonlinear BSS problems, where random vectors are mixed by output functions of layered neural networks. We show that certain solution points of MMI are kept fixed by ME if no scaling in all layers is allowed. In general, ME, however, might also change the scaling in the non-output network layers, hence, leaving the MMI solution points. Therefore, we conclude this paper by suggesting that in nonlinear ME algorithms, the norm of all weight matrix rows of each non-output layer should be kept fixed in later epochs during network training.