On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
Mathematics of Operations Research
Simulation-based optimization of Markov decision processes: An empirical process theory approach
Automatica (Journal of IFAC)
Fast convergence to state-action frequency polytopes for MDPs
Operations Research Letters
Hi-index | 0.00 |