Comparative study of stochastic algorithms for system optimization based on gradient approximations

Authors:
D. C. Chin
Affiliations:
Appl. Phys. Lab., Johns Hopkins Univ., Laurel, MD
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
1997

Citing 0
Cited 8

Stochastic optimization and the simultaneous perturbation method

Proceedings of the 31st conference on Winter simulation: Simulation---a bridge to the future - Volume 1
Global random optimization by simultaneous perturbation stochastic approximation

Proceedings of the 33nd conference on Winter simulation
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization

ACM Transactions on Modeling and Computer Simulation (TOMACS)
An optimization-based approach to special-events traffic signal timing control

Control and Intelligent Systems
Stochastic linear programming to optimize some stochastic systems

ICS'06 Proceedings of the 10th WSEAS international conference on Systems
Methods of multiextremal optimization under constraints for separably quasimonotone functions

Journal of Computer and Systems Sciences International
Distributed model-free stochastic optimization in wireless sensor networks

DCOSS'06 Proceedings of the Second IEEE international conference on Distributed Computing in Sensor Systems
Adaptive Smoothed Functional Algorithms for Optimal Staffing Levels in Service Systems

Service Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

Stochastic approximation (SA) algorithms can be used in system optimization problems for which only noisy measurements of the system are available and the gradient of the loss function is not. This type of problem can be found in adaptive control, neural network training, experimental design, stochastic optimization, and many other areas. This paper studies three types of SA algorithms in a multivariate Kiefer-Wolfowitz setting, which uses only noisy measurements of the loss function (i.e., no loss function gradient measurements). The algorithms considered are: the standard finite-difference SA (FDSA) and two accelerated algorithms, the random directions SA (RDSA) and the simultaneous-perturbation SA (SPSA). RDSA and SPSA use randomized gradient approximations based on (generally) far fewer function measurements than FDSA in each Iteration. This paper describes the asymptotic error distribution for a class of RDSA algorithms, and compares the RDSA, SPSA, and FDSA algorithms theoretically (using mean-square errors computed from asymptotic distributions) and numerically. Based on the theoretical and numerical results, SPSA is the preferable algorithm to use