Hierarchical Landmark Policy Optimization for Visual Indoor Navigation

Aleksei Staroverov, Aleksandr I. Panov

发表年份: 2022
引用次数: 9
访问权限: 开放获取

摘要

In this paper, we study the problem of visual indoor navigation to an object that is defined by its semantic category. Recent works have shown significant achievements in the end-to-end reinforcement learning approach and modular systems. However, both approaches need a big step forward to be robust and practically applicable. To solve the problem of insufficient exploration of the scenes and make exploration more semantically meaningful, we extend standard task formulation and give the agent easily accessible landmarks in the form of the room locations and those types. The availability of landmarks allows the agent to build a hierarchical policy structure and achieve a success rate of 63% on validation scenes in a photo-realistic Habitat simulator. In a hierarchy, a low level consists of separately trained RL skills and a high level deterministic policy, which decides which skill is needed at the moment. Also, in this paper, we show the possibility of transferring a trained policy to a real robot. After a bit of training on the reconstructed real scene, the robot shows up to 79% SPL when solving the task of navigating to an arbitrary object.

关键词

LandmarkComputer scienceTask (project management)Reinforcement learningModular designRobotArtificial intelligenceHierarchyObject (grammar)Scheme (mathematics)

Hierarchical Landmark Policy Optimization for Visual Indoor Navigation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory