首页 /研究 /Hierarchical relative entropy policy search
LEARNING

Hierarchical relative entropy policy search

DanielChristian, NeumannGerhard, KroemerOliver, PetersJan

发表年份
2016
引用次数
32

摘要

Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical polic...

关键词

Kullback–Leibler divergenceComputer scienceMathematicsEntropy (arrow of time)Artificial intelligencePhysicsThermodynamics

相关论文

查看 LEARNING 分类全部论文