搜索结果: 1-1 共查到“统计学 multi-armed bandits”相关记录1条 . 查询时间(0.062 秒)
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences
Finite-Time Multi-armed Bandits Problems Kullback-Leibler Divergences
2011/6/20
We consider a Kullback-Leibler-based algorithmfor the stochastic multi-armed bandit prob-
lem in the case of distributions with finite supports (not necessarily known beforehand),
whose asymptotic r...