Training restricted Boltzmann machines: An introduction

Authors:
Asja Fischer;Christian Igel
Affiliations:
-;-
Venue:
Pattern Recognition
Year:
2014

Citing 24
Cited 1

Information processing in dynamical systems: foundations of harmony theory

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Learning and relearning in Boltzmann machines

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Training products of experts by minimizing contrastive divergence

Neural Computation
Information Theory, Inference & Learning Algorithms

Information Theory, Inference & Learning Algorithms
The rate adapting poisson model for information retrieval and object recognition

ICML '06 Proceedings of the 23rd international conference on Machine learning
A fast learning algorithm for deep belief nets

Neural Computation
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Restricted Boltzmann machines for collaborative filtering

Proceedings of the 24th international conference on Machine learning
Representational power of restricted boltzmann machines and deep belief networks

Neural Computation
Classification using discriminative restricted Boltzmann machines

Proceedings of the 25th international conference on Machine learning
Training restricted Boltzmann machines using approximations to the likelihood gradient

Proceedings of the 25th international conference on Machine learning
Shark

The Journal of Machine Learning Research
Factored conditional restricted Boltzmann Machines for modeling motion style

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Using fast weights to improve persistent contrastive divergence

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Justifying and generalizing contrastive divergence

Neural Computation
Learning Deep Architectures for AI

Foundations and Trends® in Machine Learning
Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning

Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning
Empirical analysis of the divergence of Gibbs sampling based learning algorithms for restricted Boltzmann machines

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part III
Refinements of universal approximation results for deep belief networks and restricted boltzmann machines

Neural Computation
Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning a generative model of images by factoring appearance and shape

Neural Computation
Bounding the bias of contrastive divergence learning

Neural Computation
Robust Boltzmann Machines for recognition and denoising

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning ensemble classifiers via restricted Boltzmann machines

Pattern Recognition Letters

Quantified Score

Hi-index	0.01

Visualization

Abstract

Restricted Boltzmann machines (RBMs) are probabilistic graphical models that can be interpreted as stochastic neural networks. They have attracted much attention as building blocks for the multi-layer learning systems called deep belief networks, and variants and extensions of RBMs have found application in a wide range of pattern recognition tasks. This tutorial introduces RBMs from the viewpoint of Markov random fields, starting with the required concepts of undirected graphical models. Different learning algorithms for RBMs, including contrastive divergence learning and parallel tempering, are discussed. As sampling from RBMs, and therefore also most of their learning algorithms, are based on Markov chain Monte Carlo (MCMC) methods, an introduction to Markov chains and MCMC techniques is provided. Experiments demonstrate relevant aspects of RBM training.