首页 /研究 /SMSnet: Semantic motion segmentation using deep convolutional neural networks

LEARNING

SMSnet: Semantic motion segmentation using deep convolutional neural networks

Johan Vertens, Abhinav Valada, Wolfram Burgard

发表年份: 2017
引用次数: 69

摘要

Interpreting the semantics and motion of objects are prerequisites for autonomous robots that enable them to reason and operate in dynamic real-world environments. Existing approaches that tackle the problem of semantic motion segmentation consist of long multistage pipelines and typically require several seconds to process each frame. In this paper, we present a novel convolutional neural network architecture that learns to predict both the object label and motion status of each pixel in an image. Given a pair of consecutive images, the network learns to fuse features from self-generated optical flow maps and semantic segmentation kernels to yield pixel-wise semantic motion labels. We also introduce the Cityscapes-Motion dataset which contains over 2,900 manually annotated semantic motion labels, which is the largest dataset of its kind so far. We demonstrate that our network outperforms existing approaches achieving state-of-the-art performance on the KITTI dataset, as well as in the more challenging Cityscapes-Motion dataset while being substantially faster than existing techniques.

关键词

Computer scienceArtificial intelligenceConvolutional neural networkSegmentationSemantics (computer science)Motion (physics)Computer visionFuse (electrical)Optical flowFrame (networking)

SMSnet: Semantic motion segmentation using deep convolutional neural networks

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory