Finite-time Analysis of the Multiarmed Bandit Problem
Machine Learning
On the Convergence Rate of Good-Turing Estimators
COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Always Good Turing: Asymptotically Optimal Probability Estimation
FOCS '03 Proceedings of the 44th Annual IEEE Symposium on Foundations of Computer Science
Concentration inequalities for the missing mass and for histogram rule error
The Journal of Machine Learning Research
Hi-index | 0.00 |
We consider an original problem that arises from the issue of security analysis of a power system and that we name optimal discovery with probabilistic expert advice. We address it with an algorithm based on the optimistic paradigm and on the Good-Turing missing mass estimator. We prove two different regret bounds on the performance of this algorithm under weak assumptions on the probabilistic experts. Under more restrictive hypotheses, we also prove a macroscopic optimality result, comparing the algorithm both with an oracle strategy and with uniform sampling. Finally, we provide numerical experiments illustrating these theoretical findings.