首页 /研究 /Attentional Separation-and-Aggregation Network for Self-supervised\n Depth-Pose Learning in Dynamic Scenes

PERCEPTION

Attentional Separation-and-Aggregation Network for Self-supervised\n Depth-Pose Learning in Dynamic Scenes

Feng Gao, Jincheng Yu, Hao Shen, Yu Wang, Huazhong Yang

发表年份: 2020
引用次数: 7
访问权限: 开放获取

摘要

Learning depth and ego-motion from unlabeled videos via self-supervision from\nepipolar projection can improve the robustness and accuracy of the 3D\nperception and localization of vision-based robots. However, the rigid\nprojection computed by ego-motion cannot represent all scene points, such as\npoints on moving objects, leading to false guidance in these regions. To\naddress this problem, we propose an Attentional Separation-and-Aggregation\nNetwork (ASANet), which can learn to distinguish and extract the scene's static\nand dynamic characteristics via the attention mechanism. We further propose a\nnovel MotionNet with an ASANet as the encoder, followed by two separate\ndecoders, to estimate the camera's ego-motion and the scene's dynamic motion\nfield. Then, we introduce an auto-selecting approach to detect the moving\nobjects for dynamic-aware learning automatically. Empirical experiments\ndemonstrate that our method can achieve the state-of-the-art performance on the\nKITTI benchmark.\n

关键词

Artificial intelligenceComputer scienceComputer visionRobustness (evolution)Epipolar geometryBenchmark (surveying)EncoderPerceptionRobotMotion (physics)

Attentional Separation-and-Aggregation Network for Self-supervised\n Depth-Pose Learning in Dynamic Scenes

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory