Home /Research /Deep Learning based Iterative 6D Pose Estimation
PERCEPTION

Deep Learning based Iterative 6D Pose Estimation

Peng Chen, Zhusui Zheng, Wang Dai

Year
2021
Citations
7

Abstract

Estimating the 6D object pose from a single RGB image is a challenging problem in computer vision. But it has various applications in robotics and virtual or augmented reality. Current solutions can be divided into two categories. The first one try to regress the pose parameters directly, but the estimation accuracy is usually not satisfied. The other one try to firstly extract local features and then estimate the pose through RANSAC based PnP algorithm. Since the network is designed for local feature extraction, it is not end-to-end trainable. Meanwhile, the RANSAC process is also time consuming with the presence of noise and outliers. A new architecture for object 6D pose estimation is proposed in this work. It is composed of correspondences extraction part, pose estimation part and the orthogonal iteration part. The first tw'o parts can be treated as an end-to-end trainable network, while the last two parts are used to find the optimal pose parameters without including the RANSAC process. Experimental results show <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">:</sup> that the architecture has persuasive advantage over the commonly applied 6D pose estimation solutions.

Keywords

RANSACPoseArtificial intelligenceOutlierComputer science3D pose estimationObject (grammar)Computer visionProcess (computing)Feature extraction

Related papers

Browse all PERCEPTION papers