首页 /研究 /Mapless Navigation with Deep Reinforcement Learning based on The Convolutional Proximal Policy Optimization Network

LEARNING

Mapless Navigation with Deep Reinforcement Learning based on The Convolutional Proximal Policy Optimization Network

Kim Gon Woo

发表年份: 2021
引用次数: 23

摘要

In recent years, mapless navigation with a deep reinforcement learning approach in Autonomous Mobile Robot has shown a considerable benefit in improving robot behavior flexibility. Specifically, the Robot adapts to complex constraints and performs well in various environments without the need for a predetermined map and route plan. However, several previous studies show a lack of stability in the training of deep reinforcement learning networks for mapless navigation. Moreover, the exploration and exploitation in a specific work environment play an essential role in improving mapless navigation performance, which needs to be carefully considered. From the aforementioned issues, as well as inspired by the Proximal Policy optimization algorithm has shown that it does not only evaluates the advantages and disadvantages of policy but also prevents the policy from changing too much after each time weight update. In this paper, we propose to build a Convolutional Proximal Policy optimization network for the Mapless Navigation problem. Furthermore, the use of Boltzmann's policy to help balance exploration and exploitation also contributes to the Robot's ability to explore more deeply in complex environments, and the performance of the mapless navigation problem is also significantly improved.

关键词

Reinforcement learningFlexibility (engineering)Computer scienceArtificial intelligenceMobile robot navigationMobile robotStability (learning theory)RobotDeep learningPlan (archaeology)

Mapless Navigation with Deep Reinforcement Learning based on The Convolutional Proximal Policy Optimization Network

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory