首页 /研究 /Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials
LEARNING

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

Adrian P. Pope, Jaime S. Ide, Daria Mićović, Henry Díaz, Jason C. Twedt, Kevin Alcedo, Thayne T. Walker, David Rosenbluth, Lee Ritholtz, D. Javorsek

发表年份
2022
引用次数
87

摘要

Autonomous control in high-dimensional, continuous state spaces is a persistent and important challenge in the fields of robotics and artificial intelligence. Because of high risk and complexity, the adoption of AI for autonomous combat systems has been a long-standing difficulty. In order to address these issues, DARPA's AlphaDogfight Trials (ADT) program sought to vet the feasibility of and increase trust in AI for autonomously piloting an F-16 in simulated air-to-air combat. Our submission to ADT solves the high-dimensional, continuous control problem using a novel hierarchical deep reinforcement learning approach consisting of a high-level policy selector and a set of separately trained low-level policies specialized for excelling in specific regions of the state space. Both levels of the hierarchy are trained using off-policy, maximum entropy methods with expert knowledge integrated through reward shaping. Our approach outperformed human expert pilots and achieved a second-place rank in the ADT championship event.

关键词

Reinforcement learningArtificial intelligenceRoboticsHierarchyState spaceComputer scienceChampionshipMachine learningHigh dimensionalOperations research

相关论文

查看 LEARNING 分类全部论文