Home /Research /GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation
LEARNING

GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation

Yongming Wen, Yiquan Fang, Junhao Cai, Kimwa Tung, Hui Cheng

Year
2021
Citations
13

Abstract

In 6D object pose estimation task, object models are usually available and represented as the point cloud set in canonical object frame, which are important references for estimating object poses to the camera frame. However, directly introducing object models as the prior knowledge (i.e., object model point cloud) will cause potential perturbations and even degenerate pose estimation performance. To make the most of object model priors and eliminate the problem, we present an end-to-end deep learning approach called the Geometric Constraint Co-attention Network (GCCN) for 6D object pose estimation. GCCN is designed to explicitly leverage the object model priors effectively with the co-attention mechanism. We add explicit geometric constraints to a co-attention module to inform the geometric correspondence relationships between points in the scene and object model priors and develop a novel geometric constraint loss to guide the training. In this manner, our method effectively eliminates the side effect of directly introducing the object model priors into the network. Experiments on the YCB-Video and LineMOD datasets demonstrate that our GCCN substantially improves the performance of pose estimation and is robust against heavy occlusions. We also demonstrate that GCCN is accurate and robust enough to be deployed in real-world robotic tasks.

Keywords

PoseArtificial intelligenceComputer scienceComputer visionPrior probabilityLeverage (statistics)Object (grammar)Object model3D pose estimationPoint cloud

Related papers

Browse all LEARNING papers