首页 /研究 /Deep Deterministic Policy Gradient for Navigation of Mobile Robots
LEARNING

Deep Deterministic Policy Gradient for Navigation of Mobile Robots

Junior Costa de Jesus, Jair Augusto Bottega, Marco Antônio de Souza Leite Cuadros, Daniel Fernando Tello Gamarra

发表年份
2020
引用次数
18

摘要

This article describes the use of the Deep Deterministic Policy Gradient network, a deep reinforcement learning algorithm, for mobile robot navigation. The neural network structure has as inputs laser range findings, angular and linear velocities of the robot, and position and orientation of the mobile robot with respect to a goal position. The outputs of the network will be the angular and linear velocities used as control signals for the robot. The experiments demonstrated that deep reinforcement learning’s techniques that uses continuous actions, are efficient for decision-making in a mobile robot. Nevertheless, the design of the reward functions constitutes an important issue in the performance of deep reinforcement learning algorithms. In order to show the performance of the Deep Reinforcement Learning algorithm, we have applied successfully the proposed architecture in simulated environments and in experiments with a real robot.

关键词

Reinforcement learningMobile robotComputer scienceArtificial intelligencePosition (finance)RobotDeep learningArtificial neural networkComputer vision

相关论文

查看 LEARNING 分类全部论文