LEARNING
Hierarchical relative entropy policy search
DanielChristian, NeumannGerhard, KroemerOliver, PetersJan
- 发表年份
- 2016
- 引用次数
- 32
摘要
Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks that are strongly structured. Such task structures can be exploited by incorporating hierarchical polic...
关键词
Kullback–Leibler divergenceComputer scienceMathematicsEntropy (arrow of time)Artificial intelligencePhysicsThermodynamics
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
开放获取📊 20,501 引用
Fractional Differential Equations
Igor Podlubný
2025
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991