首页 /研究 /Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

LEARNING

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

Adrian P. Pope, Jaime S. Ide, Daria Mićović, Henry Díaz, Jason C. Twedt, Kevin Alcedo, Thayne T. Walker, David Rosenbluth, Lee Ritholtz, D. Javorsek

发表年份: 2022
引用次数: 87

摘要

Autonomous control in high-dimensional, continuous state spaces is a persistent and important challenge in the fields of robotics and artificial intelligence. Because of high risk and complexity, the adoption of AI for autonomous combat systems has been a long-standing difficulty. In order to address these issues, DARPA's AlphaDogfight Trials (ADT) program sought to vet the feasibility of and increase trust in AI for autonomously piloting an F-16 in simulated air-to-air combat. Our submission to ADT solves the high-dimensional, continuous control problem using a novel hierarchical deep reinforcement learning approach consisting of a high-level policy selector and a set of separately trained low-level policies specialized for excelling in specific regions of the state space. Both levels of the hierarchy are trained using off-policy, maximum entropy methods with expert knowledge integrated through reward shaping. Our approach outperformed human expert pilots and achieved a second-place rank in the ADT championship event.

关键词

Reinforcement learningArtificial intelligenceRoboticsHierarchyState spaceComputer scienceChampionshipMachine learningHigh dimensionalOperations research

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory