搜索结果: 1-1 共查到“管理学 Multi-Armed Bandit Problems”相关记录1条 . 查询时间(0.084 秒)
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems
Deterministic Sequencing Exploration Exploitation Multi-Armed Bandit Problems
2011/7/7
In the Multi-Armed Bandit (MAB) problem, there are a given set of arms with unknown reward distributions. At each time, a player selects one arm to play, aiming to maximize the total expected reward o...