论文成果

Comments on 'Finite-Time Analysis of the Multiarmed Bandit Problem'

摘要:In the article [1], we can get a tighter upper bound of expected regret in Theorem 1 and 4, there are also some critical incorrect statements in the proof of Theorem 2, we modified the incorrect statements in this comment and a correction version of Theorem 2 is also presented.
© 2019 IEEE.

ISSN号:2160-133X

卷、期、页:v 2019-July,

发表日期:2019-07-01

期刊分区(SCI为中科院分区):无

收录情况:EI(工程索引)

发表期刊名称:Proceedings - International Conference on Machine Learning and Cybernetics

参与作者:Li, Wei-Min,Ito, Nobuyasu

通讯作者:张鲁宁

第一作者:左信,刘建伟

论文类型:会议论文

论文概要:张鲁宁,左信,刘建伟,Li, Wei-Min,Ito, Nobuyasu,Comments on 'Finite-Time Analysis of the Multiarmed Bandit Problem',Proceedings - International Conference on Machine Learning and Cybernetics,2019,v 2019-July,

论文题目:Comments on 'Finite-Time Analysis of the Multiarmed Bandit Problem'