Patch-Wise Attention Network for Monocular Depth Estimation

Sihaeng Lee, Janghyeon Lee, Byungju Kim, Eojindl Yi, Junmo Kim

发表年份: 2021
引用次数: 66

摘要

In computer vision, monocular depth estimation is the problem of obtaining a high-quality depth map from a two-dimensional image. This map provides information on three-dimensional scene geometry, which is necessary for various applications in academia and industry, such as robotics and autonomous driving. Recent studies based on convolutional neural networks achieved impressive results for this task. However, most previous studies did not consider the relationships between the neighboring pixels in a local area of the scene. To overcome the drawbacks of existing methods, we propose a patch-wise attention method for focusing on each local area. After extracting patches from an input feature map, our module generates attention maps for each local patch, using two attention modules for each patch along the channel and spatial dimensions. Subsequently, the attention maps return to their initial positions and merge into one attention feature. Our method is straightforward but effective. The experimental results on two challenging datasets, KITTI and NYU Depth V2, demonstrate that the proposed method achieves significant performance. Furthermore, our method outperforms other state-of-the-art methods on the KITTI depth estimation benchmark.

关键词

Artificial intelligenceComputer scienceMonocularMerge (version control)Computer visionConvolutional neural networkBenchmark (surveying)Depth mapFeature (linguistics)Pattern recognition (psychology)

Patch-Wise Attention Network for Monocular Depth Estimation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory