Customisable Control Policy Learning for Robotics

Ce Guo, Wayne Luk, Stanley Qing Shui Loh, Alexander Warren, Joshua A. Levine

发表年份: 2019
引用次数: 13

摘要

Deep reinforcement learning algorithms integrate deep neural networks with traditional reinforcement learning methodologies. These techniques have been developed and used for various applications to produce exciting results in many fields, including robotics. However, physical robots require a large amount of training episodes which can damage the robot if directed by immature policies. Training using simulations can serve as a viable alternative before a robot is deployed in the field. This study addresses a computational challenge of deep reinforcement learning by developing a hardware architecture for the Deep Deterministic Policy Gradient (DDPG) algorithm. Additionally, we identify the customisation opportunities for a full-stack development framework with reinforcement learning to discover control policies for robotic arms. Finally, we transfer policies encoded in fixed-point numbers from our FPGA DDPG implementation to a robotic arm to evaluate the feasibility of our learning platform.

关键词

Reinforcement learningArtificial intelligenceRoboticsComputer scienceDeep learningRobotField (mathematics)Artificial neural networkRobot learningTransfer of learning

Customisable Control Policy Learning for Robotics

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory