基于改进的 YOLOv7-tiny 和 MAH-CRNN+CTC 模型的端子条检测和识别

Frontiers in Energy Research Pub Date : 2024-05-15 DOI:10.3389/fenrg.2024.1345574

Zhijun Guo, Weiming Luo, Qiujie Chen, Hongbo Zou

{"title":"基于改进的 YOLOv7-tiny 和 MAH-CRNN+CTC 模型的端子条检测和识别","authors":"Zhijun Guo, Weiming Luo, Qiujie Chen, Hongbo Zou","doi":"10.3389/fenrg.2024.1345574","DOIUrl":null,"url":null,"abstract":"For substation secondary circuit terminal strip wiring, low efficiency, less easy fault detection and inspection, and a variety of other issues, this study proposes a text detection and identification model based on improved YOLOv7-tiny and MAH-CRNN+CTC terminal lines. First, the YOLOv7-tiny target detection model is improved by the introduction of the spatially invariant multi-attention mechanism (SimAM) and the weighted bidirectional feature pyramid network (BiFPN). This also improves the feature enhancements and feature fusion ability of the model, balances various scales of characteristic information, and increases the positioning accuracy of the text test box. Then, a multi-head attention hybrid (MAH) mechanism is implemented to optimize the convolutional recurrent neural network with connectionist temporal classification (CRNN+CTC) so that the model could learn data features with larger weights and increase the recognition accuracy of the model. The findings indicate that the enhanced YOLOv7-tiny model achieves 97.39%, 98.62%, and 95.07% of precision, recall, and mean average precision (mAP), respectively, on the detection dataset. The improved MAH-CRNN+CTC model achieves 91.2% character recognition accuracy in the recognition dataset.","PeriodicalId":503838,"journal":{"name":"Frontiers in Energy Research","volume":"61 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Terminal strip detection and recognition based on improved YOLOv7-tiny and MAH-CRNN+CTC models\",\"authors\":\"Zhijun Guo, Weiming Luo, Qiujie Chen, Hongbo Zou\",\"doi\":\"10.3389/fenrg.2024.1345574\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For substation secondary circuit terminal strip wiring, low efficiency, less easy fault detection and inspection, and a variety of other issues, this study proposes a text detection and identification model based on improved YOLOv7-tiny and MAH-CRNN+CTC terminal lines. First, the YOLOv7-tiny target detection model is improved by the introduction of the spatially invariant multi-attention mechanism (SimAM) and the weighted bidirectional feature pyramid network (BiFPN). This also improves the feature enhancements and feature fusion ability of the model, balances various scales of characteristic information, and increases the positioning accuracy of the text test box. Then, a multi-head attention hybrid (MAH) mechanism is implemented to optimize the convolutional recurrent neural network with connectionist temporal classification (CRNN+CTC) so that the model could learn data features with larger weights and increase the recognition accuracy of the model. The findings indicate that the enhanced YOLOv7-tiny model achieves 97.39%, 98.62%, and 95.07% of precision, recall, and mean average precision (mAP), respectively, on the detection dataset. The improved MAH-CRNN+CTC model achieves 91.2% character recognition accuracy in the recognition dataset.\",\"PeriodicalId\":503838,\"journal\":{\"name\":\"Frontiers in Energy Research\",\"volume\":\"61 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Energy Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fenrg.2024.1345574\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Energy Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fenrg.2024.1345574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对变电站二次回路端子排接线效率低、不易进行故障检测与检查等多种问题，本研究提出了基于改进型 YOLOv7-tiny 和 MAH-CRNN+CTC 端子排的文本检测与识别模型。首先，通过引入空间不变多注意力机制（SimAM）和加权双向特征金字塔网络（BiFPN），改进了 YOLOv7-tiny 目标检测模型。这也提高了模型的特征增强和特征融合能力，平衡了各种尺度的特征信息，提高了文本测试框的定位精度。然后，采用多头注意力混合（MAH）机制来优化卷积递归神经网络与联结时态分类（CRNN+CTC），使模型能以更大的权重学习数据特征，提高模型的识别准确率。研究结果表明，增强型 YOLOv7-tiny 模型在检测数据集上的精确度、召回率和平均精确度（mAP）分别达到了 97.39%、98.62% 和 95.07%。改进的 MAH-CRNN+CTC 模型在识别数据集上达到了 91.2% 的字符识别准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Terminal strip detection and recognition based on improved YOLOv7-tiny and MAH-CRNN+CTC models

For substation secondary circuit terminal strip wiring, low efficiency, less easy fault detection and inspection, and a variety of other issues, this study proposes a text detection and identification model based on improved YOLOv7-tiny and MAH-CRNN+CTC terminal lines. First, the YOLOv7-tiny target detection model is improved by the introduction of the spatially invariant multi-attention mechanism (SimAM) and the weighted bidirectional feature pyramid network (BiFPN). This also improves the feature enhancements and feature fusion ability of the model, balances various scales of characteristic information, and increases the positioning accuracy of the text test box. Then, a multi-head attention hybrid (MAH) mechanism is implemented to optimize the convolutional recurrent neural network with connectionist temporal classification (CRNN+CTC) so that the model could learn data features with larger weights and increase the recognition accuracy of the model. The findings indicate that the enhanced YOLOv7-tiny model achieves 97.39%, 98.62%, and 95.07% of precision, recall, and mean average precision (mAP), respectively, on the detection dataset. The improved MAH-CRNN+CTC model achieves 91.2% character recognition accuracy in the recognition dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in Energy Research

自引率

0.00%

发文量