首页 /研究 /RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features

MANIPULATION

RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features

Max Schwarz, Hannes Schulz, Sven Behnke

发表年份: 2015
引用次数: 321

摘要

Object recognition and pose estimation from RGB-D images are important tasks for manipulation robots which can be learned from examples. Creating and annotating datasets for learning is expensive, however. We address this problem with transfer learning from deep convolutional neural networks (CNN) that are pre-trained for image categorization and provide a rich, semantically meaningful feature set. We incorporate depth information, which the CNN was not trained with, by rendering objects from a canonical perspective and colorizing the depth channel according to distance from the object center. We evaluate our approach on the Washington RGB-D Objects dataset, where we find that the generated feature set naturally separates classes and instances well and retains pose manifolds. We outperform state-of-the-art on a number of subtasks and show that our approach can yield superior results when only little training data is available.

关键词

Artificial intelligenceComputer scienceConvolutional neural networkPattern recognition (psychology)PoseRGB color modelCategorizationRendering (computer graphics)Computer visionCognitive neuroscience of visual object recognition

RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory