首页 /研究 /Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning

LEARNING

Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning

Guilherme Cano Lopes, Murillo Ferreira, Alexandre da Silva Simões, Esther Luna Colombini

发表年份: 2018
引用次数: 52

摘要

Aerial platforms, such as quadrotors, are inherently unstable systems. Generally, the task of stabilizing the flight of a quadrotor is approached by techniques based on classic and modern control algorithms. However, recent model-free reinforcement learning algorithms have been successfully used for controlling drones. In this work we show the feasibility of applying reinforcement learning methods to optimize a stochastic control policy (during training), in order to perform the position control of the "model-free" quadrotor. This process is achieved while maintaining a good sampling efficiency, allowing fast convergence even when using computationally expensive off-the-shelf simulators for robotics and without the necessity of any additional exploration strategy. We used the Proximal Policy Optimization (PPO) algorithm to make the agent learn a reliable control policy. The experiments for the resultant intelligent controller were performed using the V-REP simulator and the Vortex physics engine.

关键词

Reinforcement learningComputer scienceConvergence (economics)Process (computing)Controller (irrigation)Task (project management)Artificial intelligenceRoboticsPosition (finance)Drone

Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory