Hierarchical relative entropy policy search

DanielChristian, NeumannGerhard, KroemerOliver, PetersJan

发表年份: 2016
引用次数: 32

摘要

Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical polic...

关键词

Kullback–Leibler divergenceComputer scienceMathematicsEntropy (arrow of time)Artificial intelligencePhysicsThermodynamics

Hierarchical relative entropy policy search

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control