首页 /研究 /Sensor fusion for semantic segmentation of urban scenes
PERCEPTION

Sensor fusion for semantic segmentation of urban scenes

Richard Zhang, Stefan A. Candra, K. Vetter, Avideh Zakhor

发表年份
2015
引用次数
113

摘要

Semantic understanding of environments is an important problem in robotics in general and intelligent autonomous systems in particular. In this paper, we propose a semantic segmentation algorithm which effectively fuses information from images and 3D point clouds. The proposed method incorporates information from multiple scales in an intuitive and effective manner. A late-fusion architecture is proposed to maximally leverage the training data in each modality. Finally, a pairwise Conditional Random Field (CRF) is used as a post-processing step to enforce spatial consistency in the structured prediction. The proposed algorithm is evaluated on the publicly available KITTI dataset [1] [2], augmented with additional pixel and point-wise semantic labels for building, sky, road, vegetation, sidewalk, car, pedestrian, cyclist, sign/pole, and fence regions. A per-pixel accuracy of 89.3% and average class accuracy of 65.4% is achieved, well above current state-of-the-art [3].

关键词

Leverage (statistics)Conditional random fieldComputer scienceArtificial intelligencePoint cloudSegmentationPixelComputer visionPairwise comparisonConsistency (knowledge bases)

相关论文

查看 PERCEPTION 分类全部论文