首页 /研究 /Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning
LEARNING

Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning

Guilherme Cano Lopes, Murillo Ferreira, Alexandre da Silva Simões, Esther Luna Colombini

发表年份
2018
引用次数
52

摘要

Aerial platforms, such as quadrotors, are inherently unstable systems. Generally, the task of stabilizing the flight of a quadrotor is approached by techniques based on classic and modern control algorithms. However, recent model-free reinforcement learning algorithms have been successfully used for controlling drones. In this work we show the feasibility of applying reinforcement learning methods to optimize a stochastic control policy (during training), in order to perform the position control of the "model-free" quadrotor. This process is achieved while maintaining a good sampling efficiency, allowing fast convergence even when using computationally expensive off-the-shelf simulators for robotics and without the necessity of any additional exploration strategy. We used the Proximal Policy Optimization (PPO) algorithm to make the agent learn a reliable control policy. The experiments for the resultant intelligent controller were performed using the V-REP simulator and the Vortex physics engine.

关键词

Reinforcement learningComputer scienceConvergence (economics)Process (computing)Controller (irrigation)Task (project management)Artificial intelligenceRoboticsPosition (finance)Drone

相关论文

查看 LEARNING 分类全部论文