Home /Research /Realtime Global Attention Network for Semantic Segmentation
PERCEPTION

Realtime Global Attention Network for Semantic Segmentation

Xi Mo, Xiangyu Chen

Year
2022
Citations
5

Abstract

In this letter, we proposed an end-to-end realtime global attention neural network (RGANet) for the challenging task of semantic segmentation. Different from the encoding strategy deployed by self-attention paradigms, the proposed global attention module encodes global attention via depthwise convolution and affine transformations. The integration of these global attention modules into a hierarchical architecture maintains high inferential performance. In addition, an improved evaluation metric, namely MGRID, is proposed to alleviate the negative effect of non-convex, widely scattered ground-truth areas. Results from extensive experiments on state-of-the-art architectures for suction region segmentation manifest the leading performance of proposed approaches for robotic monocular visual perception.

Keywords

SegmentationComputer scienceMetric (unit)Task (project management)Affine transformationArtificial intelligenceMonocularConvolution (computer science)Encoding (memory)Artificial neural network

Related papers

Browse all PERCEPTION papers