Learning to act using real-time dynamic programming
Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Reinforcement Learning
Introduction to Stochastic Dynamic Programming: Probability and Mathematical
Introduction to Stochastic Dynamic Programming: Probability and Mathematical
Optimization over Time
Neuro-Dynamic Programming
Rollout Algorithms for Combinatorial Optimization
Journal of Heuristics
Meta-heuristics: The State of the Art
ECAI '00 Proceedings of the Workshop on Local Search for Planning and Scheduling-Revised Papers
A cooperative parallel rollout algorithm for the sequential ordering problem
Parallel Computing - Special issue: Parallel computing in logistics
Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes
Discrete Event Dynamic Systems
Online pricing for bandwidth provisioning in multi-class networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Sensor management using an active sensing approach
Signal Processing
Parallelization Strategies for Rollout Algorithms
Computational Optimization and Applications
Nonmyopic sensor scheduling and its efficient implementation for target tracking applications
EURASIP Journal on Applied Signal Processing
Computers and Operations Research
Stochastic rollout and justification to solve the resource-constrained project scheduling problem
Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come
A survey on metaheuristics for stochastic combinatorial optimization
Natural Computing: an international journal
An improved quantum genetic algorithm for stochastic job shop problem
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
EURASIP Journal on Advances in Signal Processing - Special issue on signal processing advances in robots and autonomy
Partially Observable Markov Decision Process Approximations for Adaptive Sensing
Discrete Event Dynamic Systems
Approximate stochastic dynamic programming for sensor scheduling to track multiple targets
Digital Signal Processing
Online planning algorithms for POMDPs
Journal of Artificial Intelligence Research
Rollout strategy-based probabilistic causal model approach for the multiple fault diagnosis
Robotics and Computer-Integrated Manufacturing
Opportunistic Fair Scheduling in Wireless Networks: An Approximate Dynamic Programming Approach
Mobile Networks and Applications
Brief paper: Sensor control for multi-object state-space estimation using random finite sets
Automatica (Journal of IFAC)
TECHNICAL NOTE---The Adaptive Knapsack Problem with Stochastic Rewards
Operations Research
A Monte-Carlo AIXI approximation
Journal of Artificial Intelligence Research
Online stochastic weighted matching: improved approximation algorithms
WINE'11 Proceedings of the 7th international conference on Internet and Network Economics
Approximate planning and verification for large markov decision processes
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Potential-based reward shaping for POMDPs
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Analysis of watson's strategies for playing Jeopardy!
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
Stochasticscheduling problems are difficult stochastic control problemswith combinatorial decision spaces. In this paper we focus ona class of stochastic scheduling problems, the quiz problem andits variations. We discuss the use of heuristics for their solution,and we propose rollout algorithms based on these heuristics whichapproximate the stochastic dynamic programming algorithm. Weshow how the rollout algorithms can be implemented efficiently,with considerable savings in computation over optimal algorithms.We delineate circumstances under which the rollout algorithmsare guaranteed to perform better than the heuristics on whichthey are based. We also show computational results which suggestthat the performance of the rollout policies is near-optimal,and is substantially better than the performance of their underlyingheuristics.