Module-Based Reinforcement Learning: Experiments with a Real Robot
Machine Learning - Special issue on learning in autonomous robots
Co-Evolution in the Successful Learning of Backgammon Strategy
Machine Learning
Colearning in Differential Games
Machine Learning
Elevator Group Control Using Multiple Reinforcement Learning Agents
Machine Learning
Reinforcement learning and mistake bounded algorithms
COLT '99 Proceedings of the twelfth annual conference on Computational learning theory
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Complexity of finite-horizon Markov decision process problems
Journal of the ACM (JACM)
A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem
Proceedings of the 2002 ACM symposium on Applied computing
Module-Based Reinforcement Learning: Experiments with a Real Robot
Autonomous Robots
Module Based Reinforcement Learning: An Application to a Real Robot
EWLR-6 Proceedings of the 6th European Workshop on Learning Robots
Sequence Learning - Paradigms, Algorithms, and Applications
Sequential Decision Making Based on Direct Search
Sequence Learning - Paradigms, Algorithms, and Applications
On policy iteration as a Newton's method and polynomial policy iteration algorithms
Eighteenth national conference on Artificial intelligence
On the undecidability of probabilistic planning and related stochastic optimization problems
Artificial Intelligence - special issue on planning with uncertainty and incomplete information
An online POMDP algorithm for complex multiagent environments
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms
Neural Computation
Dynamic multiagent probabilistic inference
International Journal of Approximate Reasoning
Dynamics based control with PSRs
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Planning and Learning in Environments with Delayed Feedback
ECML '07 Proceedings of the 18th European conference on Machine Learning
25 Years of Model Checking
United We Stand: Population Based Methods for Solving Unknown POMDPs
Recent Advances in Reinforcement Learning
Learning and planning in environments with delayed feedback
Autonomous Agents and Multi-Agent Systems
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths
SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Learning partially observable action schemas
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Purely epistemic markov decision processes
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Value-function approximations for partially observable Markov decision processes
Journal of Artificial Intelligence Research
Speeding up the convergence of value iteration in partially observable Markov decision processes
Journal of Artificial Intelligence Research
Anytime point-based approximations for large POMDPs
Journal of Artificial Intelligence Research
Online planning algorithms for POMDPs
Journal of Artificial Intelligence Research
Learning partially observable deterministic action models
Journal of Artificial Intelligence Research
AEMS: an anytime online search algorithm for approximate policy refinement in large POMDPs
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A planning algorithm for predictive state representations
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Learning partially observable deterministic action models
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Greedy algorithms for sequential sensing decisions
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
The Complexity of Solving Stochastic Games on Graphs
ISAAC '09 Proceedings of the 20th International Symposium on Algorithms and Computation
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths
ACM Transactions on Algorithms (TALG)
Deterministic POMDPs revisited
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Quasi deterministic POMDPs and DecPOMDPs
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Qualitative analysis of partially-observable Markov decision processes
MFCS'10 Proceedings of the 35th international conference on Mathematical foundations of computer science
Cost-based query answering in action probabilistic logic programs
SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
On model checking techniques for randomized distributed systems
IFM'10 Proceedings of the 8th international conference on Integrated formal methods
Proceedings of The Fourth International C* Conference on Computer Science and Software Engineering
Information technology for healthcare transformation
IBM Journal of Research and Development
On the complexity of policy iteration
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Approximate planning for factored POMDPs using belief state simplification
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Polynomial value iteration algorithms for deterministic MDPs
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Reinforcement learning with partially known world dynamics
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Journal of the ACM (JACM)
Algorithms for omega-regular games with imperfect information
CSL'06 Proceedings of the 20th international conference on Computer Science Logic
A pumping algorithm for ergodic stochastic mean payoff games with perfect information
IPCO'10 Proceedings of the 14th international conference on Integer Programming and Combinatorial Optimization
Learning to make predictions in partially observable environments without a generative model
Journal of Artificial Intelligence Research
Survey A survey of computational complexity results in systems and control
Automatica (Journal of IFAC)
Event-learning and robust policy heuristics
Cognitive Systems Research
Finding patterns in an unknown graph
AI Communications - The Symposium on Combinatorial Search
A survey of point-based POMDP solvers
Autonomous Agents and Multi-Agent Systems
Parallel Abductive Query Answering in Probabilistic Logic Programs
ACM Transactions on Computational Logic (TOCL)
ICALP'13 Proceedings of the 40th international conference on Automata, Languages, and Programming - Volume Part I
An intelligent broker agent for energy trading: an MDP approach
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Hi-index | 0.00 |