Rail Surface Defects Detection Based on Yolo v5 Integrated with Transformer

Qian-mao Hu, Bo Tang, Lin Jiang, Faxun Zhu, Xiaoke Zhao
{"title":"Rail Surface Defects Detection Based on Yolo v5 Integrated with Transformer","authors":"Qian-mao Hu, Bo Tang, Lin Jiang, Faxun Zhu, Xiaoke Zhao","doi":"10.1109/icet55676.2022.9824255","DOIUrl":null,"url":null,"abstract":"The traditional machine vision detection method needs to manually design the characteristics of the target, the feature expression ability is insufficient and the generalization ability is not strong. Deep learning can automatically learn high-level feature information, improve the efficiency and accuracy of image recognition, and has better adaptability and universality. Transformer abandons the structure of CNN with deep neural network mainly based on self-attention mechanism, which can be processed in parallel and has global information. This paper combines CNN with Transformer and integrates transformer’s attention mechanism into Yolo V5 network structure to detect rail surface defects. The AP (average precision) of Type-I and Type-II rail defects reached 99.5% and 97.8% respectively, and FPS (frame per second) reaches 76.92 on RSDDs dataset.","PeriodicalId":166358,"journal":{"name":"2022 IEEE 5th International Conference on Electronics Technology (ICET)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Electronics Technology (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icet55676.2022.9824255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The traditional machine vision detection method needs to manually design the characteristics of the target, the feature expression ability is insufficient and the generalization ability is not strong. Deep learning can automatically learn high-level feature information, improve the efficiency and accuracy of image recognition, and has better adaptability and universality. Transformer abandons the structure of CNN with deep neural network mainly based on self-attention mechanism, which can be processed in parallel and has global information. This paper combines CNN with Transformer and integrates transformer’s attention mechanism into Yolo V5 network structure to detect rail surface defects. The AP (average precision) of Type-I and Type-II rail defects reached 99.5% and 97.8% respectively, and FPS (frame per second) reaches 76.92 on RSDDs dataset.
基于Yolo v5集成变压器的钢轨表面缺陷检测
传统的机器视觉检测方法需要人工设计目标的特征,特征表达能力不足,泛化能力不强。深度学习可以自动学习高级特征信息,提高图像识别的效率和准确性,具有较好的适应性和通用性。变压器摒弃了CNN的结构,采用以自关注机制为主的深度神经网络,可以并行处理,具有全局信息。本文将CNN与变压器相结合,将变压器的注意机制集成到Yolo V5网络结构中,检测钢轨表面缺陷。在RSDDs数据集上,i型和ii型钢轨缺陷的平均精度AP分别达到99.5%和97.8%,FPS(帧/秒)达到76.92。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信