Home /Research /Joint object recognition and pose estimation using a nonlinear view-invariant latent generative model
PERCEPTION

Joint object recognition and pose estimation using a nonlinear view-invariant latent generative model

Amr Bakry, Tarek El-Gaaly, Mohamed Elhoseiny, Ahmed Elgammal

Year
2016
Citations
7

Abstract

Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.

Keywords

Artificial intelligencePoseManifold (fluid mechanics)Cognitive neuroscience of visual object recognitionComputer scienceInvariant (physics)EmbeddingObject (grammar)Computer visionPattern recognition (psychology)

Related papers

Browse all PERCEPTION papers