Dynamics of learning near singularities in layered networks

Authors:
Haikun Wei;Jun Zhang;Florent Cousseau;Tomoko Ozeki;Shun-ichi Amari
Affiliations:
RIKEN Brain Science Institute, Saitama, 3510198, Japan, Southeast University, Nanjing, 210096, China, and Kyushu Institute of Technology, Kitakyushu 8080196, Japan. weihaikun@brain.riken.jp;RIKEN Brain Science Institute, Saitama, 3510198, Japan, and University of Michigan, Ann Arbor, MI 48109, U.S.A. junz@umich.edu;RIKEN Brain Science Institute, Saitama, 3510198, Japan, and University of Tokyo, Chiba, 2778561, Japan. florent@mns.k.u-tokyo.ac.jp;RIKEN Brain Science Institute, Saitama, 3510198, Japan, and Tokai University, Kanagawa, 2591292, Japan. tozeki@tokai.ac.jp;RIKEN Brain Science Institute, Saitama, 3510198, Japan. amari@brain.riken.jp
Venue:
Neural Computation
Year:
2008

Citing 16
Cited 3

Original Contribution: Uniqueness of the weights for minimal feedforward nets with a given input-output map

Neural Networks
On the geometry of feedforward neural network error surfaces

Neural Computation
A regularity condition of the information matrix of a multilayer perceptron network

Neural Networks
Online learning in radial basis function networks

Neural Computation
Natural gradient works efficiently in learning

Neural Computation
Local minima and plateaus in hierarchical structures of multilayer perceptions

Neural Networks
Adaptive natural gradient learning algorithms for various stochastic models

Neural Networks
Algebraic geometrical methods for hierarchical learning machines

Neural Networks
On the problem in model selection of neural network regression in overrealizable scenario

Neural Computation
Learning coefficients of layered models when the true distribution mismatches the singularities

Neural Computation
Difficulty of Singularity in Population Coding

Neural Computation
Algebraic Analysis for Nonidentifiable Learning Machines

Neural Computation
Singularities Affect Dynamics of Learning in Neuromanifolds

Neural Computation
Dynamics of learning near singularities in radial basis function networks

Neural Networks
On the construction and training of reformulated radial basis function neural networks

IEEE Transactions on Neural Networks
Dynamics of Learning in Multilayer Perceptrons Near Singularities

IEEE Transactions on Neural Networks

Dynamics of learning near singularities in radial basis function networks

Neural Networks
Singularity and Slow Convergence of the EM algorithm for Gaussian Mixtures

Neural Processing Letters
How to modify a neural network gradually without changing its input-output functionality

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

We explicitly analyze the trajectories of learning near singularities in hierarchical networks, such as multilayer perceptrons and radial basis function networks, which include permutation symmetry of hidden nodes, and show their general properties. Such symmetry induces singularities in their parameter space, where the Fisher information matrix degenerates and odd learning behaviors, especially the existence of plateaus in gradient descent learning, arise due to the geometric structure of singularity. We plot dynamic vector fields to demonstrate the universal trajectories of learning near singularities. The singularity induces two types of plateaus, the on-singularity plateau and the near-singularity plateau, depending on the stability of the singularity and the initial parameters of learning. The results presented in this letter are universally applicable to a wide class of hierarchical models. Detailed stability analysis of the dynamics of learning in radial basis function networks and multilayer perceptrons will be presented in separate work.