首页 /研究 /Multi Modal Semantic Segmentation using Synthetic Data
LEARNING

Multi Modal Semantic Segmentation using Synthetic Data

Kartik Srivastava, Akash Kumar Singh, Guruprasad M. Hegde

发表年份
2019
访问权限
开放获取

摘要

Semantic understanding of scenes in three-dimensional space (3D) is a quintessential part of robotics oriented applications such as autonomous driving as it provides geometric cues such as size, orientation and true distance of separation to objects which are crucial for taking mission critical decisions. As a first step, in this work we investigate the possibility of semantically classifying different parts of a given scene in 3D by learning the underlying geometric context in addition to the texture cues BUT in the absence of labelled real-world datasets. To this end we generate a large number of synthetic scenes, their pixel-wise labels and corresponding 3D representations using CARLA software framework. We then build a deep neural network that learns underlying category specific 3D representation and texture cues from color information of the rendered synthetic scenes. Further on we apply the learned model on different real world datasets to evaluate its performance. Our preliminary investigation of results show that the neural network is able to learn the geometric context from synthetic scenes and effectively apply this knowledge to classify each point of a 3D representation of a scene in real-world.

关键词

cs.CVcs.AI

相关论文

查看 LEARNING 分类全部论文