首页 /研究 /SMSnet: Semantic motion segmentation using deep convolutional neural networks
LEARNING

SMSnet: Semantic motion segmentation using deep convolutional neural networks

Johan Vertens, Abhinav Valada, Wolfram Burgard

发表年份
2017
引用次数
69

摘要

Interpreting the semantics and motion of objects are prerequisites for autonomous robots that enable them to reason and operate in dynamic real-world environments. Existing approaches that tackle the problem of semantic motion segmentation consist of long multistage pipelines and typically require several seconds to process each frame. In this paper, we present a novel convolutional neural network architecture that learns to predict both the object label and motion status of each pixel in an image. Given a pair of consecutive images, the network learns to fuse features from self-generated optical flow maps and semantic segmentation kernels to yield pixel-wise semantic motion labels. We also introduce the Cityscapes-Motion dataset which contains over 2,900 manually annotated semantic motion labels, which is the largest dataset of its kind so far. We demonstrate that our network outperforms existing approaches achieving state-of-the-art performance on the KITTI dataset, as well as in the more challenging Cityscapes-Motion dataset while being substantially faster than existing techniques.

关键词

Computer scienceArtificial intelligenceConvolutional neural networkSegmentationSemantics (computer science)Motion (physics)Computer visionFuse (electrical)Optical flowFrame (networking)

相关论文

查看 LEARNING 分类全部论文