首页 /研究 /Reinforcement Learning-Based Algorithm to Avoid Obstacles by the Anthropomorphic Robotic Arm
LEARNING

Reinforcement Learning-Based Algorithm to Avoid Obstacles by the Anthropomorphic Robotic Arm

Tymoteusz Lindner, Andrzej Milecki

发表年份
2022
引用次数
27
访问权限
开放获取

摘要

In this paper, the application of the policy gradient Reinforcement Learning-based (RL) method for obstacle avoidance is proposed. This method was successfully used to control the movements of a robot using trial-and-error interactions with its environment. In this paper, an approach based on a Deep Deterministic Policy Gradient (DDPG) algorithm combined with a Hindsight Experience Replay (HER) algorithm for avoiding obstacles has been investigated. In order to ensure that the robot avoids obstacles and reaches the desired position as quickly and as accurately as possible, a special approach to the training and architecture of two RL agents working simultaneously was proposed. The implementation of this RL-based approach was first implemented in a simulation environment, which was used to control the 6-axis robot simulation model. Then, the same algorithm was used to control a real 6-DOF (degrees of freedom) robot. The results obtained in the simulation were compared with results obtained in laboratory conditions.

关键词

Reinforcement learningHindsight biasComputer scienceRobotObstacleArtificial intelligencePosition (finance)Obstacle avoidanceControl (management)Control theory (sociology)

相关论文

查看 LEARNING 分类全部论文