Algorithms for sequential decision-making

Authors:
Michael Lederman Littman
Affiliations:
-
Venue:
Algorithms for sequential decision-making
Year:
1996

Citing 0
Cited 61

Module-Based Reinforcement Learning: Experiments with a Real Robot

Machine Learning - Special issue on learning in autonomous robots
Co-Evolution in the Successful Learning of Backgammon Strategy

Machine Learning
Colearning in Differential Games

Machine Learning
Elevator Group Control Using Multiple Reinforcement Learning Agents

Machine Learning
Reinforcement learning and mistake bounded algorithms

COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Complexity of finite-horizon Markov decision process problems

Journal of the ACM (JACM)
A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem

Proceedings of the 2002 ACM symposium on Applied computing
Module-Based Reinforcement Learning: Experiments with a Real Robot

Autonomous Robots
Module Based Reinforcement Learning: An Application to a Real Robot

EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Integration of Biologically Inspired Temporal Mechanisms into a Cortical Framework fo Sequence Processing

Sequence Learning - Paradigms, Algorithms, and Applications
Sequential Decision Making Based on Direct Search

Sequence Learning - Paradigms, Algorithms, and Applications
On policy iteration as a Newton's method and polynomial policy iteration algorithms

Eighteenth national conference on Artificial intelligence
On the undecidability of probabilistic planning and related stochastic optimization problems

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis

Machine Learning
An online POMDP algorithm for complex multiagent environments

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Neural Computation
Dynamic multiagent probabilistic inference

International Journal of Approximate Reasoning
Dynamics based control with PSRs

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Planning and Learning in Environments with Delayed Feedback

ECML '07 Proceedings of the 18th European conference on Machine Learning
Value Iteration

25 Years of Model Checking
United We Stand: Population Based Methods for Solving Unknown POMDPs

Recent Advances in Reinforcement Learning
Learning and planning in environments with delayed feedback

Autonomous Agents and Multi-Agent Systems
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Learning partially observable action schemas

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Purely epistemic markov decision processes

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Value-function approximations for partially observable Markov decision processes

Journal of Artificial Intelligence Research
Speeding up the convergence of value iteration in partially observable Markov decision processes

Journal of Artificial Intelligence Research
Anytime point-based approximations for large POMDPs

Journal of Artificial Intelligence Research
Online planning algorithms for POMDPs

Journal of Artificial Intelligence Research
Learning partially observable deterministic action models

Journal of Artificial Intelligence Research
AEMS: an anytime online search algorithm for approximate policy refinement in large POMDPs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A planning algorithm for predictive state representations

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning partially observable deterministic action models

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Greedy algorithms for sequential sensing decisions

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
The Complexity of Solving Stochastic Games on Graphs

ISAAC '09 Proceedings of the 20th International Symposium on Algorithms and Computation
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

ACM Transactions on Algorithms (TALG)
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Deterministic POMDPs revisited

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Quasi deterministic POMDPs and DecPOMDPs

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Qualitative analysis of partially-observable Markov decision processes

MFCS'10 Proceedings of the 35th international conference on Mathematical foundations of computer science
Cost-based query answering in action probabilistic logic programs

SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
On model checking techniques for randomized distributed systems

IFM'10 Proceedings of the 8th international conference on Integrated formal methods
Requirements and initial model for KnowLang: a language for knowledge representation in autonomic service-component ensembles

Proceedings of The Fourth International C* Conference on Computer Science and Software Engineering
Information technology for healthcare transformation

IBM Journal of Research and Development
On the complexity of policy iteration

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Approximate planning for factored POMDPs using belief state simplification

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Polynomial value iteration algorithms for deterministic MDPs

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Reinforcement learning with partially known world dynamics

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Probabilistic ω-automata

Journal of the ACM (JACM)
Algorithms for omega-regular games with imperfect information

CSL'06 Proceedings of the 20th international conference on Computer Science Logic
A pumping algorithm for ergodic stochastic mean payoff games with perfect information

IPCO'10 Proceedings of the 14th international conference on Integer Programming and Combinatorial Optimization
Learning to make predictions in partially observable environments without a generative model

Journal of Artificial Intelligence Research
Survey A survey of computational complexity results in systems and control

Automatica (Journal of IFAC)
Event-learning and robust policy heuristics

Cognitive Systems Research
Finding patterns in an unknown graph

AI Communications - The Symposium on Combinatorial Search
Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Journal of the ACM (JACM)
A survey of point-based POMDP solvers

Autonomous Agents and Multi-Agent Systems
Parallel Abductive Query Answering in Probabilistic Logic Programs

ACM Transactions on Computational Logic (TOCL)
A pseudo-polynomial algorithm for mean payoff stochastic games with perfect information and a few random positions

ICALP'13 Proceedings of the 40th international conference on Automata, Languages, and Programming - Volume Part I
An intelligent broker agent for energy trading: an MDP approach

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Algorithms for sequential decision-making

Quantified Score

Visualization

Abstract