CPS: Class-level 6D Pose and Shape Estimation From Monocular Images

Fabian Manhardt, Manuel Nickel, Sven Meier, Luca Minciullo, Nassir Navab

发表年份: 2020
引用次数: 14

摘要

Contemporary monocular 6D pose estimation methods can only cope with a handful of object instances. This naturally limits possible applications as, for instance, robots need to work with hundreds of different objects in a real environment. In this paper, we propose the first deep learning approach for class-wise monocular 6D pose estimation, coupled with metric shape retrieval. We propose a new loss formulation which directly optimizes over all parameters, i.e. 3D orientation, translation, scale and shape at the same time. Instead of decoupling each parameter, we transform the regressed shape, in the form of a point cloud, to 3D and directly measure its metric misalignment. We experimentally demonstrate that we can retrieve precise metric point clouds from a single image, which can also be further processed for e.g. subsequent rendering. Moreover, we show that our new 3D point cloud loss outperforms all baselines and gives overall good results despite the inherent ambiguity due to monocular data.

关键词

Artificial intelligencePoint cloudMonocularComputer scienceComputer visionRendering (computer graphics)PoseAmbiguityMetric (unit)Monocular vision

CPS: Class-level 6D Pose and Shape Estimation From Monocular Images

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory