Home /Research /Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning
LEARNING

Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning

Guilherme Cano Lopes, Murillo Ferreira, Alexandre da Silva Simões, Esther Luna Colombini

Year
2018
Citations
52

Abstract

Aerial platforms, such as quadrotors, are inherently unstable systems. Generally, the task of stabilizing the flight of a quadrotor is approached by techniques based on classic and modern control algorithms. However, recent model-free reinforcement learning algorithms have been successfully used for controlling drones. In this work we show the feasibility of applying reinforcement learning methods to optimize a stochastic control policy (during training), in order to perform the position control of the "model-free" quadrotor. This process is achieved while maintaining a good sampling efficiency, allowing fast convergence even when using computationally expensive off-the-shelf simulators for robotics and without the necessity of any additional exploration strategy. We used the Proximal Policy Optimization (PPO) algorithm to make the agent learn a reliable control policy. The experiments for the resultant intelligent controller were performed using the V-REP simulator and the Vortex physics engine.

Keywords

Reinforcement learningComputer scienceConvergence (economics)Process (computing)Controller (irrigation)Task (project management)Artificial intelligenceRoboticsPosition (finance)Drone

Related papers

Browse all LEARNING papers