首页 /研究 /Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

LEARNING

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

Sean Gillen, Katie Byl

发表年份: 2022
访问权限: 开放获取

摘要

In recent years, fully differentiable rigid body physics simulators have been developed, which can be used to simulate a wide range of robotic systems. In the context of reinforcement learning for control, these simulators theoretically allow algorithms to be applied directly to analytic gradients of the reward function. However, to date, these gradients have proved extremely challenging to use, and are outclassed by algorithms using no gradient information at all. In this work we present a novel algorithm, cross entropy analytic policy gradients, that is able to leverage these gradients to outperform state of art deep reinforcement learning on a set of challenging nonlinear control problems.

关键词

cs.LGcs.ROeess.SY

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

摘要

关键词

相关论文

面向学习与规划的并行可微可达性：具有认证神经动力学与控制器的系统

人工智能增强的智能焊接岛：基础模型革新制造业

基于深度强化学习和动态图神经网络的多任务机器人调度代理

基于微调与AAS增强检索的LLM驱动自动化DFA评估