Research on weld start point detection and localization using deep learning techniques

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Mingda Huang, Linyuxuan Li, Yixing Meng, Guanhua Zhu, Hu Wu, Xianhai Yang
{"title":"Research on weld start point detection and localization using deep learning techniques","authors":"Mingda Huang,&nbsp;Linyuxuan Li,&nbsp;Yixing Meng,&nbsp;Guanhua Zhu,&nbsp;Hu Wu,&nbsp;Xianhai Yang","doi":"10.1016/j.engappai.2025.111833","DOIUrl":null,"url":null,"abstract":"<div><div>In industrial automated welding, precise positioning of the weld start point is crucial for ensuring weld stability and quality. To address this challenge, this paper proposes an improved weld start point recognition algorithm based on the Real-Time Detection Transformer (RT-DETR), enabling the welding robot to adaptively recognize and initially position the weld start point. Images of the weld start point are captured using a vision system based on the D435i camera, and a dataset is constructed for deep learning training. The algorithm is based on the RT-DETR model, utilizing a Residual Network architecture with 18 layers (RT-DETR-R18), integrating the Context Guided Block (CGBlock) from the Context Guided Network (CGNet) with the BasicBlock of Residual Network 18 (ResNet18) to form a new architecture, Context Guided Residual Network 18 (CG-ResNet18), enhancing the model's global perception and accuracy. Additionally, a Large Separable Kernel Attention (LSKA) mechanism is incorporated into the backbone network to improve applicability and robustness. The hybrid encoder of RT-DETR is further optimized using Group Separable Convolution (GSConv) and Efficient Cross Stage Partial Network Block (VoV-GSCSP) modules, enhancing feature fusion and reducing computational load. Experimental results show that the improved model achieves a mean Average Precision (mAP) of 99.4 %, a 5 % increase from the original, with a 32 % reduction in parameters and a 31 % reduction in computational load. The detection error of the weld start point is below 0.4 mm, meeting the real-time detection and positioning requirements of the welding robot.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111833"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625018354","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In industrial automated welding, precise positioning of the weld start point is crucial for ensuring weld stability and quality. To address this challenge, this paper proposes an improved weld start point recognition algorithm based on the Real-Time Detection Transformer (RT-DETR), enabling the welding robot to adaptively recognize and initially position the weld start point. Images of the weld start point are captured using a vision system based on the D435i camera, and a dataset is constructed for deep learning training. The algorithm is based on the RT-DETR model, utilizing a Residual Network architecture with 18 layers (RT-DETR-R18), integrating the Context Guided Block (CGBlock) from the Context Guided Network (CGNet) with the BasicBlock of Residual Network 18 (ResNet18) to form a new architecture, Context Guided Residual Network 18 (CG-ResNet18), enhancing the model's global perception and accuracy. Additionally, a Large Separable Kernel Attention (LSKA) mechanism is incorporated into the backbone network to improve applicability and robustness. The hybrid encoder of RT-DETR is further optimized using Group Separable Convolution (GSConv) and Efficient Cross Stage Partial Network Block (VoV-GSCSP) modules, enhancing feature fusion and reducing computational load. Experimental results show that the improved model achieves a mean Average Precision (mAP) of 99.4 %, a 5 % increase from the original, with a 32 % reduction in parameters and a 31 % reduction in computational load. The detection error of the weld start point is below 0.4 mm, meeting the real-time detection and positioning requirements of the welding robot.
基于深度学习技术的焊缝起始点检测与定位研究
在工业自动化焊接中,焊接起点的精确定位是保证焊接稳定性和焊接质量的关键。针对这一挑战,本文提出了一种改进的基于实时检测变压器(RT-DETR)的焊缝起始点识别算法,使焊接机器人能够自适应识别焊缝起始点并进行初始定位。使用基于D435i相机的视觉系统捕获焊缝起点图像,并构建数据集进行深度学习训练。该算法以RT-DETR模型为基础,利用18层残差网络架构(RT-DETR- r18),将Context Guided Network (CGNet)中的Context Guided Block (CGBlock)与残差网络18的BasicBlock (ResNet18)相结合,形成新的架构Context Guided Residual Network 18 (CG-ResNet18),增强了模型的全局感知和精度。此外,在骨干网中引入了大可分离核注意(Large可分离核注意,LSKA)机制,以提高其适用性和鲁棒性。利用群可分卷积(GSConv)和高效跨阶段部分网络块(VoV-GSCSP)模块对RT-DETR混合编码器进行了进一步优化,增强了特征融合,减少了计算量。实验结果表明,改进模型的平均精度(mAP)达到99.4%,比原模型提高了5%,参数减少了32%,计算量减少了31%。焊缝起点检测误差在0.4 mm以下,满足焊接机器人实时检测定位要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信