Rail Surface Defects Detection Based on Yolo v5 Integrated with Transformer

2022 IEEE 5th International Conference on Electronics Technology (ICET) Pub Date : 2022-05-13 DOI:10.1109/icet55676.2022.9824255

Qian-mao Hu, Bo Tang, Lin Jiang, Faxun Zhu, Xiaoke Zhao

引用次数: 0

Abstract

The traditional machine vision detection method needs to manually design the characteristics of the target, the feature expression ability is insufficient and the generalization ability is not strong. Deep learning can automatically learn high-level feature information, improve the efficiency and accuracy of image recognition, and has better adaptability and universality. Transformer abandons the structure of CNN with deep neural network mainly based on self-attention mechanism, which can be processed in parallel and has global information. This paper combines CNN with Transformer and integrates transformer’s attention mechanism into Yolo V5 network structure to detect rail surface defects. The AP (average precision) of Type-I and Type-II rail defects reached 99.5% and 97.8% respectively, and FPS (frame per second) reaches 76.92 on RSDDs dataset.

查看原文本刊更多论文

基于Yolo v5集成变压器的钢轨表面缺陷检测

传统的机器视觉检测方法需要人工设计目标的特征，特征表达能力不足，泛化能力不强。深度学习可以自动学习高级特征信息，提高图像识别的效率和准确性，具有较好的适应性和通用性。变压器摒弃了CNN的结构，采用以自关注机制为主的深度神经网络，可以并行处理，具有全局信息。本文将CNN与变压器相结合，将变压器的注意机制集成到Yolo V5网络结构中，检测钢轨表面缺陷。在RSDDs数据集上，i型和ii型钢轨缺陷的平均精度AP分别达到99.5%和97.8%，FPS(帧/秒)达到76.92。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 5th International Conference on Electronics Technology (ICET)

自引率

0.00%

发文量