首页 /研究 /Super-Resolution for Monocular Depth Estimation With Multi-Scale Sub-Pixel Convolutions and a Smoothness Constraint
PERCEPTION

Super-Resolution for Monocular Depth Estimation With Multi-Scale Sub-Pixel Convolutions and a Smoothness Constraint

Shiyu Zhao, Lin Zhang, Ying Shen, Shengjie Zhao, Huijuan Zhang

发表年份
2019
引用次数
21
访问权限
开放获取

摘要

Depth estimation from a monocular image is of paramount importance in various vision tasks, such as obstacle detection, robot navigation, and 3D reconstruction. However, how to get an accurate depth map with clear details and a fine resolution remains an unresolved issue. As an attempt to solve this problem, we exploit image super-resolution concepts and techniques for monocular depth estimation and propose a novel CNN-based approach, namely <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$MSCN_{NS}$ </tex-math></inline-formula> , which involves multi-scale sub-pixel convolutions and a neighborhood smoothness constraint. Specifically, <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$MSCN_{NS}$ </tex-math></inline-formula> makes use of sub-pixel convolutions with multi-scale fusions to retrieve a high-resolution depth map with fine details of the scene. Different from previous multi-scale fusion strategies, those multi-scale features come from supervised scale branches of the network. Furthermore, <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$MSCN_{NS}$ </tex-math></inline-formula> incorporates a neighborhood smoothness regularization term to make sure that spatially closer pixels with similar features would have close depth values. The effectiveness and efficiency of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$MSCN_{NS}$ </tex-math></inline-formula> have been corroborated through extensive experiments conducted on benchmark datasets.

关键词

PixelScale (ratio)Artificial intelligenceSmoothnessComputer scienceMonocularHessian matrixComputer visionResolution (logic)Algorithm

相关论文

查看 PERCEPTION 分类全部论文