首页 /研究 /WideSegNeXt: Semantic Image Segmentation Using Wide Residual Network and NeXt Dilated Unit
LEARNING

WideSegNeXt: Semantic Image Segmentation Using Wide Residual Network and NeXt Dilated Unit

Yoshiki Nakayama, Huimin Lu, Yujie Li, Tohru Kamiya

发表年份
2020
引用次数
58

摘要

Semantic segmentation is widely applied in autonomous driving, in robotic picking, and for medical purposes. Due to the breakthrough of deep learning in recent years, the fully convolutional network (FCN)-based method has become the de facto standard in semantic segmentation. However, the simple FCN has difficulty in capturing global context information, since the local receptive field is small. Furthermore, there is a problem of low image resolution because of the existence of the pooling layer. In this paper, we address the shortcomings of the FCN by proposing a new architecture called WideSegNeXt, which captures the image context on various spatial scales and is effective in identifying small objects. In addition, there is little loss of position information, since there are no pooling layers in the structure. The proposed method achieves a mean intersection over union (MIoU) of 72.5% and a global accuracy (GA) of 92.4% on the CamVid dataset and achieves higher performance than previous methods without additional input datasets.

关键词

Computer sciencePoolingArtificial intelligenceSegmentationContext (archaeology)ResidualIntersection (aeronautics)Image segmentationPattern recognition (psychology)Deep learning

相关论文

查看 LEARNING 分类全部论文