首页 /研究 /Optimistic reinforcement learning by forward Kullback–Leibler divergence optimization
LEARNING

Optimistic reinforcement learning by forward Kullback–Leibler divergence optimization

Taisuke Kobayashi

发表年份
2022
引用次数
20

关键词

Reinforcement learningDivergence (linguistics)Computer scienceKullback–Leibler divergenceMathematical optimizationHyperparameterOptimization problemArtificial intelligenceBellman equationMarkov decision process

相关论文

查看 LEARNING 分类全部论文