首页 /研究 /SANet: A Pure Vision Strip-Aware Network with PSSCA and Multistage Fusion for Weld Seam Detection
PERCEPTION

SANet: A Pure Vision Strip-Aware Network with PSSCA and Multistage Fusion for Weld Seam Detection

Zhijian Zhu, Yang Zhao, Lijie Zhao, Guoli Song, Qinghui Wang

发表年份
2025
引用次数
1
访问权限
开放获取

摘要

Weld seam detection is a fundamental prerequisite for robotic welding automation, yet it remains challenging due to the elongated shape of welds, weak contrast against metallic backgrounds, and significant environmental interference in industrial scenarios. To address these challenges, we propose a novel deep neural network architecture termed SANet (Strip-Aware Network). The model is constructed upon a U-shaped backbone and integrates strip-aware feature modeling with multistage supervision. It mainly consists of two complementary modules: the Paralleled Strip and Spatial Context-Aware (PSSCA) module and the Multistage Fusion (MF) module. The PSSCA module enhances the extraction of elongated strip-like features by combining parallel strip perception with spatial context modeling, thereby improving fine-grained weld seam representation. In addition, SANet integrates the StripPooling attention mechanism as an auxiliary component to enlarge the receptive field along strip directions and enhance feature discrimination under complex backgrounds. Meanwhile, the MF module performs cross-stage feature fusion by aggregating encoder and decoder features at multiple levels, ensuring accurate boundary recovery and robust global-to-local interaction. The weld seam detection task is formulated as a two-dimensional segmentation problem and evaluated on a self-built dataset consisting of over 4000 weld seam images covering diverse industrial scenarios such as pipe joints, trusses, elbows, and furnace structures. Experimental results show that SANet achieves an IoU of 96.23% and a Dice coefficient of 98.07%, surpassing all compared models and demonstrating its superior performance in weld seam detection. These findings validate the effectiveness of the proposed architecture and highlight its potential as a low-cost, flexible, and reliable pure vision solution for intelligent welding applications.

关键词

WeldingSegmentationEncoderWeld poolContext (archaeology)Feature (linguistics)Artificial neural networkFeature extractionPattern recognition (psychology)

相关论文

查看 PERCEPTION 分类全部论文