Home /Research /Optimistic reinforcement learning by forward Kullback–Leibler divergence optimization
LEARNING

Optimistic reinforcement learning by forward Kullback–Leibler divergence optimization

Taisuke Kobayashi

Year
2022
Citations
20

Keywords

Reinforcement learningDivergence (linguistics)Computer scienceKullback–Leibler divergenceMathematical optimizationHyperparameterOptimization problemArtificial intelligenceBellman equationMarkov decision process

Related papers

Browse all LEARNING papers