Discrete Deep Reinforcement Learning for Mapless Navigation

Enrico Marchesini, Alessandro Farinelli

发表年份: 2020
引用次数: 73

摘要

Our goal is to investigate whether discrete state space algorithms are a viable solution to continuous alternatives for mapless navigation. To this end we present an approach based on Double Deep Q-Network and employ parallel asynchronous training and a multi-batch Priority Experience Replay to reduce the training time. Experiments show that our method trains faster and outperforms both the continuous Deep Deterministic Policy Gradient and Proximal Policy Optimization algorithms. Moreover, we train the models in a custom environment built on the recent Unity learning toolkit and show that they can be exported on the TurtleBot3 simulator and to the real robot without further training. Overall our optimized method is 40% faster compared to the original discrete algorithm. This setting significantly reduces the training times with respect to the continuous algorithms, maintaining a similar level of success rate hence being a viable alternative for mapless navigation.

关键词

Computer scienceReinforcement learningTrainAsynchronous communicationTraining (meteorology)RobotArtificial intelligenceState spaceReal-time computingDeep learning

Discrete Deep Reinforcement Learning for Mapless Navigation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory