Learning curve bounds for a Markov decision process with undiscounted rewards
COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Dynamic power management of electronic systems
Proceedings of the 1998 IEEE/ACM international conference on Computer-aided design
Computational challenges in portfolio management
Computing in Science and Engineering
IEEE/ACM Transactions on Networking (TON)
Planning and Control in Artificial Intelligence: A Unifying Perspective
Applied Intelligence
The Relations Among Potentials, Perturbation Analysis,and Markov Decision Processes
Discrete Event Dynamic Systems
Kernel-Based Reinforcement Learning
Machine Learning
Risk-Sensitive Reinforcement Learning
Machine Learning
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning
Discrete Event Dynamic Systems
Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes
Discrete Event Dynamic Systems
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
On the Use of Option Policies for Autonomous Robot Navigation
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Optimizing Average Reward Using Discounted Rewards
COLT '01/EuroCOLT '01 Proceedings of the 14th Annual Conference on Computational Learning Theory and and 5th European Conference on Computational Learning Theory
ICSAB Proceedings of the seventh international conference on simulation of adaptive behavior on From animals to animats
Greedy linear value-approximation for factored Markov decision processes
Eighteenth national conference on Artificial intelligence
Learning Generalized Policies from Planning Examples Using Concept Languages
Applied Intelligence
Optimal Stock Allocation for a Capacitated Supply System
Management Science
A Dynamic Programming Procedure for Pricing American-Style Asian Options
Management Science
Adaptive Inventory Control for Nonstationary Demand and Partial Information
Management Science
Supply Chain Management with Guaranteed Delivery
Management Science
CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
Probability in the Engineering and Informational Sciences
Policy iteration type algorithms for recurrent state Markov decision processes
Computers and Operations Research
Probability in the Engineering and Informational Sciences
Basic Ideas for Event-Based Optimization of Markov Systems
Discrete Event Dynamic Systems
Bayesian sparse sampling for on-line reward optimization
ICML '05 Proceedings of the 22nd international conference on Machine learning
On the Introduction of an Agile, Temporary Workforce into a Tandem Queueing System
Queueing Systems: Theory and Applications
An analytic modelling approach for network routing algorithms that use "ant-like" mobile agents
Computer Networks: The International Journal of Computer and Telecommunications Networking
Computing the Optimally Fitted Spike Train for a Synapse
Neural Computation
Theoretical Computer Science - Tools and algorithms for the construction and analysis of systems (TACAS 2004)
Model checking discounted temporal properties
Theoretical Computer Science - Tools and algorithms for the construction and analysis of systems (TACAS 2004)
Probability in the Engineering and Informational Sciences
Admission Control With Incomplete Information To A Finite Buffer Queue
Probability in the Engineering and Informational Sciences
Dynamic Control of a Multiclass Queue with Thin Arrival Streams
Operations Research
Quantitative verification: models techniques and tools
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Motion segmentation and retrieval for 3D video based on modified shape distribution
EURASIP Journal on Applied Signal Processing
Quantitative verification: models, techniques and tools
The 6th Joint Meeting on European software engineering conference and the ACM SIGSOFT symposium on the foundations of software engineering: companion papers
Comparison-based algorithms are robust and randomized algorithms are anytime
Evolutionary Computation
Brief paper: Policy iteration based feedback control
Automatica (Journal of IFAC)
Fault-tolerant control of a distributed database system
Journal of Control Science and Engineering - Robustness Issues in Fault Diagnosis and Fault Tolerant Control
A New Natural Policy Gradient by Stationary Distribution Metric
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A survey on metaheuristics for stochastic combinatorial optimization
Natural Computing: an international journal
Compact, convex upper bound iteration for approximate POMDP planning
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Faster heuristic search algorithms for planning with uncertainty and full feedback
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Solving POMDPs: RTDP-bel vs. point-based algorithms
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Sensor maneuver design for microwave source localization
ACC'09 Proceedings of the 2009 conference on American Control Conference
New Algorithms for Solving Simple Stochastic Games
Electronic Notes in Theoretical Computer Science (ENTCS)
Conformant plans and beyond: Principles and complexity
Artificial Intelligence
On-Line Policy Gradient Estimation with Multi-Step Sampling
Discrete Event Dynamic Systems
Bounded parameter Markov decision processes with average reward criterion
COLT'07 Proceedings of the 20th annual conference on Learning theory
Deterministic POMDPs revisited
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Sampled fictitious play for approximate dynamic programming
Computers and Operations Research
Optimal path planning for flexible redundant robot manipulators
CONTROL'05 Proceedings of the 2005 WSEAS international conference on Dynamical systems and control
Non-deterministic policies in Markovian decision processes
Journal of Artificial Intelligence Research
Qualitative MDPs and POMDPs: an order-of-magnitude approximation
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
A characterization of meaningful schedulers for continuous-time markov decision processes
FORMATS'06 Proceedings of the 4th international conference on Formal Modeling and Analysis of Timed Systems
Innovation and Price Competition in a Two-Sided Market
Journal of Management Information Systems
Model checking interactive markov chains
TACAS'10 Proceedings of the 16th international conference on Tools and Algorithms for the Construction and Analysis of Systems
External estimates of the reachability sets of nonlinear controlled systems
Automation and Remote Control
Automatica (Journal of IFAC)
Hi-index | 0.01 |