Research on weld start point detection and localization using deep learning techniques

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-07-22 DOI:10.1016/j.engappai.2025.111833

Mingda Huang, Linyuxuan Li, Yixing Meng, Guanhua Zhu, Hu Wu, Xianhai Yang

{"title":"Research on weld start point detection and localization using deep learning techniques","authors":"Mingda Huang, Linyuxuan Li, Yixing Meng, Guanhua Zhu, Hu Wu, Xianhai Yang","doi":"10.1016/j.engappai.2025.111833","DOIUrl":null,"url":null,"abstract":"<div><div>In industrial automated welding, precise positioning of the weld start point is crucial for ensuring weld stability and quality. To address this challenge, this paper proposes an improved weld start point recognition algorithm based on the Real-Time Detection Transformer (RT-DETR), enabling the welding robot to adaptively recognize and initially position the weld start point. Images of the weld start point are captured using a vision system based on the D435i camera, and a dataset is constructed for deep learning training. The algorithm is based on the RT-DETR model, utilizing a Residual Network architecture with 18 layers (RT-DETR-R18), integrating the Context Guided Block (CGBlock) from the Context Guided Network (CGNet) with the BasicBlock of Residual Network 18 (ResNet18) to form a new architecture, Context Guided Residual Network 18 (CG-ResNet18), enhancing the model's global perception and accuracy. Additionally, a Large Separable Kernel Attention (LSKA) mechanism is incorporated into the backbone network to improve applicability and robustness. The hybrid encoder of RT-DETR is further optimized using Group Separable Convolution (GSConv) and Efficient Cross Stage Partial Network Block (VoV-GSCSP) modules, enhancing feature fusion and reducing computational load. Experimental results show that the improved model achieves a mean Average Precision (mAP) of 99.4 %, a 5 % increase from the original, with a 32 % reduction in parameters and a 31 % reduction in computational load. The detection error of the weld start point is below 0.4 mm, meeting the real-time detection and positioning requirements of the welding robot.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111833"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625018354","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In industrial automated welding, precise positioning of the weld start point is crucial for ensuring weld stability and quality. To address this challenge, this paper proposes an improved weld start point recognition algorithm based on the Real-Time Detection Transformer (RT-DETR), enabling the welding robot to adaptively recognize and initially position the weld start point. Images of the weld start point are captured using a vision system based on the D435i camera, and a dataset is constructed for deep learning training. The algorithm is based on the RT-DETR model, utilizing a Residual Network architecture with 18 layers (RT-DETR-R18), integrating the Context Guided Block (CGBlock) from the Context Guided Network (CGNet) with the BasicBlock of Residual Network 18 (ResNet18) to form a new architecture, Context Guided Residual Network 18 (CG-ResNet18), enhancing the model's global perception and accuracy. Additionally, a Large Separable Kernel Attention (LSKA) mechanism is incorporated into the backbone network to improve applicability and robustness. The hybrid encoder of RT-DETR is further optimized using Group Separable Convolution (GSConv) and Efficient Cross Stage Partial Network Block (VoV-GSCSP) modules, enhancing feature fusion and reducing computational load. Experimental results show that the improved model achieves a mean Average Precision (mAP) of 99.4 %, a 5 % increase from the original, with a 32 % reduction in parameters and a 31 % reduction in computational load. The detection error of the weld start point is below 0.4 mm, meeting the real-time detection and positioning requirements of the welding robot.

查看原文本刊更多论文

基于深度学习技术的焊缝起始点检测与定位研究

在工业自动化焊接中，焊接起点的精确定位是保证焊接稳定性和焊接质量的关键。针对这一挑战，本文提出了一种改进的基于实时检测变压器（RT-DETR）的焊缝起始点识别算法，使焊接机器人能够自适应识别焊缝起始点并进行初始定位。使用基于D435i相机的视觉系统捕获焊缝起点图像，并构建数据集进行深度学习训练。该算法以RT-DETR模型为基础，利用18层残差网络架构（RT-DETR- r18），将Context Guided Network （CGNet）中的Context Guided Block （CGBlock）与残差网络18的BasicBlock （ResNet18）相结合，形成新的架构Context Guided Residual Network 18 (CG-ResNet18)，增强了模型的全局感知和精度。此外，在骨干网中引入了大可分离核注意（Large可分离核注意，LSKA）机制，以提高其适用性和鲁棒性。利用群可分卷积（GSConv）和高效跨阶段部分网络块（VoV-GSCSP）模块对RT-DETR混合编码器进行了进一步优化，增强了特征融合，减少了计算量。实验结果表明，改进模型的平均精度（mAP）达到99.4%，比原模型提高了5%，参数减少了32%，计算量减少了31%。焊缝起点检测误差在0.4 mm以下，满足焊接机器人实时检测定位要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.