首页 /研究 /Deep Reinforcement Learning-based Continuous Control for Multicopter Systems

LEARNING

Deep Reinforcement Learning-based Continuous Control for Multicopter Systems

Anush Manukyan, Miguel Olivares-Mendez, Matthieu Geist, Holger Voos

发表年份: 2019
引用次数: 9

摘要

In this paper we apply deep reinforcement learning techniques on a multicopter for learning a stable hovering task in a continuous state action environment. We present a framework based on OpenAI GYM, Gazebo, Robotic Operating System and RotorS MAV simulator, used for successfully training different agents to perform various tasks. The deep reinforcement learning method used for the training is a model-free, on-policy, actor-critic based algorithm called Trust Region Policy Optimization (TRPO). Two neural networks have been used as nonlinear function approximators. Our experiments show that such learning approach achieves successful results, and facilitates the process of controller design.

关键词

Reinforcement learningComputer scienceTask (project management)Artificial intelligenceProcess (computing)Controller (irrigation)State (computer science)Artificial neural networkNonlinear systemFunction (biology)

Deep Reinforcement Learning-based Continuous Control for Multicopter Systems

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory