Dynamic Programming and Optimal Control, Two Volume Set

Authors:
Dimitri P. Bertsekas
Affiliations:
-
Venue:
Dynamic Programming and Optimal Control, Two Volume Set
Year:
1995

Citing 0
Cited 62

Learning curve bounds for a Markov decision process with undiscounted rewards

COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Dynamic power management of electronic systems

Proceedings of the 1998 IEEE/ACM international conference on Computer-aided design
Computational challenges in portfolio management

Computing in Science and Engineering
Pricing in multiservice loss networks: static pricing, asymptotic optimality, and demand substitution effects

IEEE/ACM Transactions on Networking (TON)
Planning and Control in Artificial Intelligence: A Unifying Perspective

Applied Intelligence
The Relations Among Potentials, Perturbation Analysis,and Markov Decision Processes

Discrete Event Dynamic Systems
Kernel-Based Reinforcement Learning

Machine Learning
Risk-Sensitive Reinforcement Learning

Machine Learning
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning

Discrete Event Dynamic Systems
Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes

Discrete Event Dynamic Systems
Towards a Universal Theory of Artificial Intelligence Based on Algorithmic Probability and Sequential Decisions

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
On the Use of Option Policies for Autonomous Robot Navigation

IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Optimizing Average Reward Using Discounted Rewards

COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
Using Markovian decision problems to analyze animal performance in random and variable ratio schedules of reinforcement

ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Greedy linear value-approximation for factored Markov decision processes

Eighteenth national conference on Artificial intelligence
Analysis of a Rollout Approach to Sequencing Problems with Stochastic Routing Applications

Journal of Heuristics
Learning Generalized Policies from Planning Examples Using Concept Languages

Applied Intelligence
Optimal Stock Allocation for a Capacitated Supply System

Management Science
A Dynamic Programming Procedure for Pricing American-Style Asian Options

Management Science
Adaptive Inventory Control for Nonstationary Demand and Partial Information

Management Science
Supply Chain Management with Guaranteed Delivery

Management Science
CONVERGENCE OF SIMULATION-BASED POLICY ITERATION

Probability in the Engineering and Informational Sciences
Policy iteration type algorithms for recurrent state Markov decision processes

Computers and Operations Research
Optimality of Four-Threshold Policies in Inventory Systems with Customer Returns and Borrowing/Storage Options

Probability in the Engineering and Informational Sciences
Basic Ideas for Event-Based Optimization of Markov Systems

Discrete Event Dynamic Systems
Bayesian sparse sampling for on-line reward optimization

ICML '05 Proceedings of the 22nd international conference on Machine learning
On the Introduction of an Agile, Temporary Workforce into a Tandem Queueing System

Queueing Systems: Theory and Applications
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents

Computer Networks: The International Journal of Computer and Telecommunications Networking
Computing the Optimally Fitted Spike Train for a Synapse

Neural Computation
Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes

Theoretical Computer Science - Tools and algorithms for the construction and analysis of systems (TACAS 2004)
Model checking discounted temporal properties

Theoretical Computer Science - Tools and algorithms for the construction and analysis of systems (TACAS 2004)
On The Structure Of Optimal Ordering Policies For Stochastic Inventory Systems With Minimum Order Quantity

Probability in the Engineering and Informational Sciences
Admission Control With Incomplete Information To A Finite Buffer Queue

Probability in the Engineering and Informational Sciences
Dynamic Control of a Multiclass Queue with Thin Arrival Streams

Operations Research
Revenue Management for a Multiclass Single-Server Queue via a Fluid Model Analysis

Operations Research
Quantitative verification: models techniques and tools

Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Motion segmentation and retrieval for 3D video based on modified shape distribution

EURASIP Journal on Applied Signal Processing
Quantitative verification: models, techniques and tools

The 6th Joint Meeting on European software engineering conference and the ACM SIGSOFT symposium on the foundations of software engineering: companion papers
Comparison-based algorithms are robust and randomized algorithms are anytime

Evolutionary Computation
Brief paper: Policy iteration based feedback control

Automatica (Journal of IFAC)
Fault-tolerant control of a distributed database system

Journal of Control Science and Engineering - Robustness Issues in Fault Diagnosis and Fault Tolerant Control
A New Natural Policy Gradient by Stationary Distribution Metric

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A survey on metaheuristics for stochastic combinatorial optimization

Natural Computing: an international journal
Compact, convex upper bound iteration for approximate POMDP planning

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
An algorithm better than AO*?

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Faster heuristic search algorithms for planning with uncertainty and full feedback

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Solving POMDPs: RTDP-bel vs. point-based algorithms

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Sensor maneuver design for microwave source localization

ACC'09 Proceedings of the 2009 conference on American Control Conference
New Algorithms for Solving Simple Stochastic Games

Electronic Notes in Theoretical Computer Science (ENTCS)
Conformant plans and beyond: Principles and complexity

Artificial Intelligence
On-Line Policy Gradient Estimation with Multi-Step Sampling

Discrete Event Dynamic Systems
Bounded parameter Markov decision processes with average reward criterion

COLT'07 Proceedings of the 20th annual conference on Learning theory
Deterministic POMDPs revisited

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Sampled fictitious play for approximate dynamic programming

Computers and Operations Research
Optimal path planning for flexible redundant robot manipulators

CONTROL'05 Proceedings of the 2005 WSEAS international conference on Dynamical systems and control
Non-deterministic policies in Markovian decision processes

Journal of Artificial Intelligence Research
Qualitative MDPs and POMDPs: an order-of-magnitude approximation

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
A characterization of meaningful schedulers for continuous-time markov decision processes

FORMATS'06 Proceedings of the 4th international conference on Formal Modeling and Analysis of Timed Systems
Innovation and Price Competition in a Two-Sided Market

Journal of Management Information Systems
Model checking interactive markov chains

TACAS'10 Proceedings of the 16th international conference on Tools and Algorithms for the Construction and Analysis of Systems
External estimates of the reachability sets of nonlinear controlled systems

Automation and Remote Control
Technical Communique: A unified approach to Markov decision problems and performance sensitivity analysis

Automatica (Journal of IFAC)

Quantified Score

Hi-index	0.01

Visualization

Abstract