首页 /研究 /Transformer-based Adaptive Interactive Promotion Network for RGB-T Salient Object Detection
PERCEPTION

Transformer-based Adaptive Interactive Promotion Network for RGB-T Salient Object Detection

Jinchao Zhu, Xiaoyu Zhang, Feng Dong, Siyu Yan, Xianbang Meng, Yuehua Li, Panlong Tan

发表年份
2022
引用次数
6

摘要

RGB-Thermal salient object detection (RGB-T SOD) aims to better segment the most salient objects with the cooperation of visual and thermal infrared images. The addition of thermal infrared images helps to improve the accuracy of robot decision-making when performing complex visual tasks. How to exploit the potential of multi-modal complementarity, tap the dominant modal information, and better complete object location is still a problem worthy of exploration. In this paper, we propose an adaptive interaction promotion network (AIPNet). In specific, we design a modal interaction module (MIM) with two parallel units to fuse the two modal features extracted by the encoders. The spatial interaction unit (SIU) is responsible for directly completing modal interaction and integration. The self-reinforcement unit (SRU) is responsible for enhancing two single-mode features and amplifying the role of dominant modal features. Besides, we use a query-location module (QLM) for high-level features to accurately confirm the location of salient objects. Finally, we adopt a re-calibration dual branch decoder (RCDB) to integrate the output features. Sufficient experiments conducted on RGB-T and RGB-D SOD datasets demonstrate that the proposed method performs favorably against the other 13 state-of-the-art methods.

关键词

Computer scienceRGB color modelArtificial intelligenceComputer visionSalientModalPattern recognition (psychology)

相关论文

查看 PERCEPTION 分类全部论文