Feed-Forward Learning: Fast Reinforcement Learning of Controllers
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
A Retrograde Approximation Algorithm for Multi-player Can't Stop
CG '08 Proceedings of the 6th international conference on Computers and Games
Value Function Based Reinforcement Learning in Changing Markovian Environments
The Journal of Machine Learning Research
Queueing Systems: Theory and Applications
New Error Bounds for Approximations from Projected Linear Equations
Recent Advances in Reinforcement Learning
Projected equation methods for approximate solution of large linear systems
Journal of Computational and Applied Mathematics
Gaussian process dynamic programming
Neurocomputing
Scheduling with limited information in wireless systems
Proceedings of the tenth ACM international symposium on Mobile ad hoc networking and computing
OPEDo: a tool for the optimization of performance and dependability models
ACM SIGMETRICS Performance Evaluation Review
Linear Bellman combination for control of character animation
ACM SIGGRAPH 2009 papers
Congestion management in delay tolerant networks
Proceedings of the 4th Annual International Conference on Wireless Internet
A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
EURASIP Journal on Advances in Signal Processing - Special issue on signal processing advances in robots and autonomy
Partially Observable Markov Decision Process Approximations for Adaptive Sensing
Discrete Event Dynamic Systems
Intervention in context-sensitive probabilistic Boolean networks revisited
EURASIP Journal on Bioinformatics and Systems Biology - Special issue on applications of signal procesing techniques to bioinformatics, genomics, and proteomics
Feature Selection for Value Function Approximation Using Bayesian Model Selection
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Optimal and approximate Q-value functions for decentralized POMDPs
Journal of Artificial Intelligence Research
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Reinforcement Learning in RoboCup KeepAway with Partial Observability
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents
Computer Networks: The International Journal of Computer and Telecommunications Networking
An elective surgery scheduling problem considering patient priority
Computers and Operations Research
A Survey of Motion Planning Algorithms from the Perspective of Autonomous UAV Guidance
Journal of Intelligent and Robotic Systems
Coding and control for communication networks
Queueing Systems: Theory and Applications
Efficient optimization of building emergency evacuation considering social bond of evacuees
CASE'09 Proceedings of the fifth annual IEEE international conference on Automation science and engineering
An optimization framework for heterogeneous access management
WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
GameNets'09 Proceedings of the First ICST international conference on Game Theory for Networks
Dynamic adaptation of user migration policies in distributed virtual environments
Dynamic adaptation of user migration policies in distributed virtual environments
Limited feedback for multi-carrier beamforming: a rate-distortion approach
ISIT'09 Proceedings of the 2009 IEEE international conference on Symposium on Information Theory - Volume 1
Approximate dynamic programming using Bellman residual elimination and Gaussian process regression
ACC'09 Proceedings of the 2009 conference on American Control Conference
Efficient suboptimal solutions of switched LQR problems
ACC'09 Proceedings of the 2009 conference on American Control Conference
Option pricing for inventory management and control
ACC'09 Proceedings of the 2009 conference on American Control Conference
Adaptive intervention in probabilistic boolean networks
ACC'09 Proceedings of the 2009 conference on American Control Conference
icLQG: combining local and global optimization for control in information space
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
On the connections between PCTL and dynamic programming
Proceedings of the 13th ACM international conference on Hybrid systems: computation and control
IEEE Transactions on Signal Processing
ACM SIGGRAPH 2010 papers
Optimal sleep patterns for serving delay-tolerant jobs
Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking
Optimal state estimation in the presence of communication costs and packet drops
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Media Revenue Management with Audience Uncertainty: Balancing Upfront and Spot Market Sales
Manufacturing & Service Operations Management
Passive discovery of IEEE 802.15.4-based body sensor networks
Ad Hoc Networks
Minimum-length scheduling for multicast traffic under channel uncertainty
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
Minimizing delay and maximizing lifetime for wireless sensor networks with anycast
IEEE/ACM Transactions on Networking (TON)
Cost and target-based scheduling for switch power control
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Event-driven optimal feedback control for multiantenna beamforming
IEEE Transactions on Signal Processing
Delay-optimal power and subcarrier allocation for OFDMA systems via stochastic approximation
IEEE Transactions on Wireless Communications
Energy optimal transmission scheduling in wireless sensor networks
IEEE Transactions on Wireless Communications
Zero-error feedback capacity of channels with state information via dynamic programming
IEEE Transactions on Information Theory
Traffic-aware optimization of heterogeneous access management
IEEE Transactions on Communications
Learning to optimally exploit multi-channel diversity in wireless systems
INFOCOM'10 Proceedings of the 29th conference on Information communications
Rate adaptation games in wireless LANs: nash equilibrium and price of anarchy
INFOCOM'10 Proceedings of the 29th conference on Information communications
WONS'10 Proceedings of the 7th international conference on Wireless on-demand network systems and services
Error Bounds for Approximations from Projected Linear Equations
Mathematics of Operations Research
PAC-MDP learning with knowledge-based admissible models
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Minimum-length scheduling and rate control for time-varying wireless networks
MILCOM'09 Proceedings of the 28th IEEE conference on Military communications
Energy efficient multi-object tracking in sensor networks
IEEE Transactions on Signal Processing
Numerical analysis of continuous time Markov decision processes over finite horizons
Computers and Operations Research
Distributive stochastic learning for delay-optimal OFDMA power and subband allocation
IEEE Transactions on Signal Processing
Selection policy-induced reduction mappings for Boolean networks
IEEE Transactions on Signal Processing
Dynamic admission and service rate control of a queue
Queueing Systems: Theory and Applications
Stochastic real-time games with qualitative timed automata objectives
CONCUR'10 Proceedings of the 21st international conference on Concurrency theory
Adaptive bases for reinforcement learning
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Optimal OFDMA downlink scheduling under a control signaling cost constraint
IEEE Transactions on Communications
IEEE Transactions on Wireless Communications
Towards analysis of semi-Markov decision processes
AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
Scheduling in Wireless Networks
Foundations and Trends® in Networking
Adaptive modulation with smoothed flow utility
EURASIP Journal on Wireless Communications and Networking
Hybrid metaheuristics in combinatorial optimization: A survey
Applied Soft Computing
Reinforcement learning techniques for the control of wastewater treatment plants
IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Efficient planning under uncertainty with macro-actions
Journal of Artificial Intelligence Research
Theoretical considerations of potential-based reward shaping for multi-agent systems
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
CASC'11 Proceedings of the 13th international conference on Computer algebra in scientific computing
Optimal anycast technique for delay-sensitive energy-constrained asynchronous sensor networks
IEEE/ACM Transactions on Networking (TON)
Survey of Motion Planning Literature in the Presence of Uncertainty: Considerations for UAV Guidance
Journal of Intelligent and Robotic Systems
Lagrangian relaxation and constraint generation for allocation and advanced scheduling
Computers and Operations Research
Segment-based packet combining: how to schedule a dense relayer cluster?
Wireless Networks
Optimal multi-layered congestion based pricing schemes for enhanced QoS
Computer Networks: The International Journal of Computer and Telecommunications Networking
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Mathematics of Operations Research
Online stochastic weighted matching: improved approximation algorithms
WINE'11 Proceedings of the 7th international conference on Internet and Network Economics
Analyzing dynamic fitness landscapes of the targeting problem of chaotic systems
EvoApplications'12 Proceedings of the 2012t European conference on Applications of Evolutionary Computation
Approximate Dynamic Programming via a Smoothed Linear Program
Operations Research
Dynamic potential-based reward shaping
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Analysis of methods for solving MDPs
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
A comparative study of reinforcement learning techniques on dialogue management
EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
Hybrid metaheuristics in combinatorial optimization: a tutorial
TPNC'12 Proceedings of the First international conference on Theory and Practice of Natural Computing
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Bayesian Learning of Noisy Markov Decision Processes
ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special Issue on Monte Carlo Methods in Statistics
Pathwise Optimization for Optimal Stopping Problems
Management Science
Optimization via simulation with Bayesian statistics and dynamic programming
Proceedings of the Winter Simulation Conference
Proceedings of the Winter Simulation Conference
Diagnostic Accuracy Under Congestion
Management Science
Stochastic optimal control as a theory of brain-machine interface operation
Neural Computation
American option pricing with randomized quasi-Monte Carlo simulations
Proceedings of the Winter Simulation Conference
Robust Markov Decision Processes
Mathematics of Operations Research
Multivariate context collection in mobile sensor networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
TEStore: exploiting thermal and energy storage to cut the electricity bill for datacenter cooling
Proceedings of the 8th International Conference on Network and Service Management
Reliable approximations of probability-constrained stochastic linear-quadratic control
Automatica (Journal of IFAC)
Stochastic game for wireless network virtualization
IEEE/ACM Transactions on Networking (TON)
Finite-sample analysis of least-squares policy iteration
The Journal of Machine Learning Research
The Journal of Machine Learning Research
On sample size control in sample average approximations for solving smooth stochastic programs
Computational Optimization and Applications
Dynamic analysis of naive adaptive brain-machine interfaces
Neural Computation
Dynamic analysis of naive adaptive brain-machine interfaces
Neural Computation
Approximate Linear Programming for Average Cost MDPs
Mathematics of Operations Research
Dynamic fluid-based scheduling in a multi-class abandonment queue
Performance Evaluation
Construction of approximation spaces for reinforcement learning
The Journal of Machine Learning Research
Optimal forwarding in delay-tolerant networks with multiple destinations
IEEE/ACM Transactions on Networking (TON)
Hi-index | 0.08 |
A major revision of the second volume of a textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization. The second volume is oriented towards mathematical analysis and computation, and treats infinite horizon problems extensively. New features of the 3rd edition are: 1) A major enlargement in size and scope: the length has increased by more than 50%, and most of the old material has been restructured and/or revised. 2) Extensive coverage (more than 100 pages) of recent research on simulation-based approximate dynamic programming (neuro-dynamic programming), which allow the practical application of dynamic programming to large and complex problems. 3) An in-depth development of the average cost problem (more than 100 pages), including a full analysis of multichain problems, and an extensive analysis of infinite-spaces problems. 4) An introduction to infinite state space stochastic shortest path problems. 5) Expansion of the theory and use of contraction mappings in infinite state space problems and in neuro-dynamic programming. 6) A substantive appendix on the mathematical measure-theoretic issues that must be addressed for a rigorous theory of stochastic dynamic programming. Much supplementary material can be found in the book's web page: http://www.athenasc.com/dpbook.html