Home /Research /Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

LEARNING

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

Sean Gillen, Katie Byl

Year: 2022
Access: Open access

Abstract

In recent years, fully differentiable rigid body physics simulators have been developed, which can be used to simulate a wide range of robotic systems. In the context of reinforcement learning for control, these simulators theoretically allow algorithms to be applied directly to analytic gradients of the reward function. However, to date, these gradients have proved extremely challenging to use, and are outclassed by algorithms using no gradient information at all. In this work we present a novel algorithm, cross entropy analytic policy gradients, that is able to leverage these gradients to outperform state of art deep reinforcement learning on a set of challenging nonlinear control problems.

Keywords

cs.LGcs.ROeess.SY

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

Abstract

Keywords

Related papers

Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers

Artificial Intelligence enhanced smart welding islands: Foundation models revolutionizing manufacturing

A deep reinforcement learning and a dynamic graph neural network-based scheduling agent to control a multi-task robot

LLM Agent-driven Automated DFA Assessment with Fine-tuning and AAS-based RAG