{"title":"基于多尺度网络和加性注意机制的金属表面缺陷检测轻量化RT-DETR模型","authors":"Zongchen Hao, Bo Liu, Binrui Xu","doi":"10.1007/s10921-025-01251-4","DOIUrl":null,"url":null,"abstract":"<div><p>In the industrial production of metals, surface defect detection is crucial for ensuring product quality and optimizing production line efficiency. Although deep learning algorithms are effective for detecting metal surface defects, their complexity can often slow down the detection process. To achieve a balance between detection accuracy and efficiency, this study proposes an enhanced and lightweight Real-Time Detection Transformer (RT-DETR) network and incorporates a multi-scale residual feature extraction (MSRFE) module, termed as MSRFE-RTDETR. The MSRFE module is specifically designed to manage varying defect shapes while reducing the parameter count. To further enhance detection accuracy, a context feature information fusion (CFIF) module is introduced, which integrates deep and shallow features to prevent information loss. Additionally, an efficient encoder based on additive attention (EEAA) is employed to overcome the limitations of matrix multiplication inherent in traditional multi-head attention mechanisms, thereby increasing the model's detection speed. Compared to the baseline model, the proposed algorithm improves the average precision on the public NEU-DET dataset by 2.4%, increases detection speed by 39.69 FPS, and enhances all lightweight metrics. Its generalization is validated on GC10-DET and ASSDD datasets, demonstrating superior performance.</p></div>","PeriodicalId":655,"journal":{"name":"Journal of Nondestructive Evaluation","volume":"44 3","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Lightweight RT-DETR Model for Metal Surface Defect Detection Using Multi-Scale Network and Additive Attention Mechanism\",\"authors\":\"Zongchen Hao, Bo Liu, Binrui Xu\",\"doi\":\"10.1007/s10921-025-01251-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the industrial production of metals, surface defect detection is crucial for ensuring product quality and optimizing production line efficiency. Although deep learning algorithms are effective for detecting metal surface defects, their complexity can often slow down the detection process. To achieve a balance between detection accuracy and efficiency, this study proposes an enhanced and lightweight Real-Time Detection Transformer (RT-DETR) network and incorporates a multi-scale residual feature extraction (MSRFE) module, termed as MSRFE-RTDETR. The MSRFE module is specifically designed to manage varying defect shapes while reducing the parameter count. To further enhance detection accuracy, a context feature information fusion (CFIF) module is introduced, which integrates deep and shallow features to prevent information loss. Additionally, an efficient encoder based on additive attention (EEAA) is employed to overcome the limitations of matrix multiplication inherent in traditional multi-head attention mechanisms, thereby increasing the model's detection speed. Compared to the baseline model, the proposed algorithm improves the average precision on the public NEU-DET dataset by 2.4%, increases detection speed by 39.69 FPS, and enhances all lightweight metrics. Its generalization is validated on GC10-DET and ASSDD datasets, demonstrating superior performance.</p></div>\",\"PeriodicalId\":655,\"journal\":{\"name\":\"Journal of Nondestructive Evaluation\",\"volume\":\"44 3\",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Nondestructive Evaluation\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10921-025-01251-4\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, CHARACTERIZATION & TESTING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nondestructive Evaluation","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s10921-025-01251-4","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, CHARACTERIZATION & TESTING","Score":null,"Total":0}
A Lightweight RT-DETR Model for Metal Surface Defect Detection Using Multi-Scale Network and Additive Attention Mechanism
In the industrial production of metals, surface defect detection is crucial for ensuring product quality and optimizing production line efficiency. Although deep learning algorithms are effective for detecting metal surface defects, their complexity can often slow down the detection process. To achieve a balance between detection accuracy and efficiency, this study proposes an enhanced and lightweight Real-Time Detection Transformer (RT-DETR) network and incorporates a multi-scale residual feature extraction (MSRFE) module, termed as MSRFE-RTDETR. The MSRFE module is specifically designed to manage varying defect shapes while reducing the parameter count. To further enhance detection accuracy, a context feature information fusion (CFIF) module is introduced, which integrates deep and shallow features to prevent information loss. Additionally, an efficient encoder based on additive attention (EEAA) is employed to overcome the limitations of matrix multiplication inherent in traditional multi-head attention mechanisms, thereby increasing the model's detection speed. Compared to the baseline model, the proposed algorithm improves the average precision on the public NEU-DET dataset by 2.4%, increases detection speed by 39.69 FPS, and enhances all lightweight metrics. Its generalization is validated on GC10-DET and ASSDD datasets, demonstrating superior performance.
期刊介绍:
Journal of Nondestructive Evaluation provides a forum for the broad range of scientific and engineering activities involved in developing a quantitative nondestructive evaluation (NDE) capability. This interdisciplinary journal publishes papers on the development of new equipment, analyses, and approaches to nondestructive measurements.