Comparison of maximum entropy and minimal mutual information in a nonlinear setting
Autor: | Elmar Lang, Fabian J. Theis, Ch. Bauer |
---|---|
Rok vydání: | 2002 |
Předmět: |
Multivariate random variable
Principle of maximum entropy Mutual information Independent component analysis Blind signal separation Nonlinear system Control and Systems Engineering Signal Processing Entropy (information theory) Applied mathematics Computer Vision and Pattern Recognition Electrical and Electronic Engineering Algorithm Scaling Software Mathematics |
Zdroj: | Signal Processing. 82:971-980 |
ISSN: | 0165-1684 |
DOI: | 10.1016/s0165-1684(02)00200-1 |
Popis: | In blind source separation (BSS), two different separation techniques are mainly used: minimal mutual information (MMI), where minimization of the mutual output information yields an independent random vector, and maximum entropy (ME), where the output entropy is maximized. However, it is yet unclear why ME should solve the separation problem, i.e. result in an independent vector. Yang and Amari have given a partial confirmation for ME in the linear case in [18], where they prove that under the assumption of vanishing expectation of the sources ME does not change the solutions of MMI except for scaling and permutation. In this paper, we generalize Yang and Amari's approach to nonlinear BSS problems, where random vectors are mixed by output functions of layered neural networks. We show that certain solution points of MMI are kept fixed by ME if no scaling in all layers is allowed. In general, ME, however, might also change the scaling in the non-output network layers, hence, leaving the MMI solution points. Therefore, we conclude this paper by suggesting that in nonlinear ME algorithms, the norm of all weight matrix rows of each non-output layer should be kept fixed in later epochs during network training. |
Databáze: | OpenAIRE |
Externí odkaz: |