ICA with Sparse Connections: Revisited

Authors:
Kun Zhang;Heng Peng;Laiwan Chan;Aapo Hyvärinen
Affiliations:
Dept of Computer Science & HIIT, University of Helsinki, Finland;Dept of Mathematics, Hong Kong Baptist University, Hong Kong,;Dept of Computer Science and Engineering, Chinese University of Hong Kong,;Dept of Computer Science & HIIT, University of Helsinki, Finland and Dept of Mathematics and Statistics, University of Helsinki, Finland
Venue:
ICA '09 Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
Year:
2009

Citing 6
Cited 4

Second Order Derivatives for Network Pruning: Optimal Brain Surgeon

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Acceleration Techniques for the Backpropagation Algorithm

Proceedings of the EURASIP Workshop 1990 on Neural Networks
A Linear Non-Gaussian Acyclic Model for Causal Discovery

The Journal of Machine Learning Research
On Model Selection Consistency of Lasso

The Journal of Machine Learning Research
ICA with sparse connections

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Blind separation of mixture of independent sources through aquasi-maximum likelihood approach

IEEE Transactions on Signal Processing

Causality Discovery with Additive Disturbances: An Information-Theoretical Perspective

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity

The Journal of Machine Learning Research
An adaptive threshold in joint approximate diagonalization by assuming exponentially distributed errors

Neurocomputing
A Bayesian inverse solution using independent component analysis

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

When applying independent component analysis (ICA), sometimes we expect the connections between the observed mixtures and the recovered independent components (or the original sources) to be sparse, to make the interpretation easier or to reduce the random effect in the results. In this paper we propose two methods to tackle this problem. One is based on adaptive Lasso, which exploits the L 1 penalty with data-adaptive weights. We show the relationship between this method and the classic information criteria such as BIC and AIC. The other is based on optimal brain surgeon, and we show how its stopping criterion is related to the information criteria. This method produces the solution path of the transformation matrix, with different number of zero entries. These methods involve low computational loads. Moreover, in each method, the parameter controlling the sparsity level of the transformation matrix has clear interpretations. By setting such parameters to certain values, the results of the proposed methods are consistent with those produced by classic information criteria.