COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM"
摘要:In the article [1] ,we can get a tighter upper bound of expected regret in Theorem 1 and 4, there are also some critical incorrect statements in the proof of Theorem 2, we modified the incorrect statements in this comment and a correction version of Theorem 2 is also presented.
关键字:Finite-time analysis Multiarmed bandit problem Expected regret
ISSN号:2160-133X
卷、期、页:Volume 2019-July
发表日期:2019-01-01
期刊分区(SCI为中科院分区):无
收录情况:EI(工程索引)
发表期刊名称:International Conference on Machine Learning and Cybernetics
参与作者:Li, Wei-Min,Ito, Nobuyasu
通讯作者:张鲁宁
第一作者:左信,刘建伟
论文类型:会议论文
论文概要:张鲁宁,左信,刘建伟,Li, Wei-Min,Ito, Nobuyasu,COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM",International Conference on Machine Learning and Cybernetics,2019,Volume 2019-July
论文题目:COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM"