论文成果

COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM"

摘要:In the article [1] ,we can get a tighter upper bound of expected regret in Theorem 1 and 4, there are also some critical incorrect statements in the proof of Theorem 2, we modified the incorrect statements in this comment and a correction version of Theorem 2 is also presented.

关键字:Finite-time analysis Multiarmed bandit problem Expected regret

ISSN号:2160-133X

卷、期、页:Volume 2019-July

发表日期:2019-01-01

期刊分区(SCI为中科院分区):无

收录情况:EI(工程索引)

发表期刊名称:International Conference on Machine Learning and Cybernetics

参与作者:Li, Wei-Min,Ito, Nobuyasu

通讯作者:张鲁宁

第一作者:左信,刘建伟

论文类型:会议论文

论文概要:张鲁宁,左信,刘建伟,Li, Wei-Min,Ito, Nobuyasu,COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM",International Conference on Machine Learning and Cybernetics,2019,Volume 2019-July

论文题目:COMMENTS ON "FINITE-TIME ANALYSIS OF THE MULTIARMED BANDIT PROBLEM"