基于深度学习的图像格式管道和仪器图识别方法

Guanqun Su , Shuai Zhao , Tao Li , Shengyong Liu , Yaqi Li , Guanglong Zhao , Zhongtao Li
{"title":"基于深度学习的图像格式管道和仪器图识别方法","authors":"Guanqun Su ,&nbsp;Shuai Zhao ,&nbsp;Tao Li ,&nbsp;Shengyong Liu ,&nbsp;Yaqi Li ,&nbsp;Guanglong Zhao ,&nbsp;Zhongtao Li","doi":"10.1016/j.birob.2023.100142","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&amp;ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&amp;ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&amp;ID drawings in the PDF. We converted the PDF P&amp;ID drawings to PNG P&amp;IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.</p></div>","PeriodicalId":100184,"journal":{"name":"Biomimetic Intelligence and Robotics","volume":"4 1","pages":"Article 100142"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667379723000566/pdfft?md5=9d3473b5d2acdf3a606cb65e7ef087e9&pid=1-s2.0-S2667379723000566-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Image format pipeline and instrument diagram recognition method based on deep learning\",\"authors\":\"Guanqun Su ,&nbsp;Shuai Zhao ,&nbsp;Tao Li ,&nbsp;Shengyong Liu ,&nbsp;Yaqi Li ,&nbsp;Guanglong Zhao ,&nbsp;Zhongtao Li\",\"doi\":\"10.1016/j.birob.2023.100142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&amp;ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&amp;ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&amp;ID drawings in the PDF. We converted the PDF P&amp;ID drawings to PNG P&amp;IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.</p></div>\",\"PeriodicalId\":100184,\"journal\":{\"name\":\"Biomimetic Intelligence and Robotics\",\"volume\":\"4 1\",\"pages\":\"Article 100142\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2667379723000566/pdfft?md5=9d3473b5d2acdf3a606cb65e7ef087e9&pid=1-s2.0-S2667379723000566-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomimetic Intelligence and Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667379723000566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetic Intelligence and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667379723000566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在这项研究中,我们提出了一种基于深度人工神经网络的识别方法,用于识别符号、文本和管道等图像格式中管道和仪表图(P&ID)的各种元素。目前,P&ID 图像格式需要人工识别,存在识别错误率高的问题,因此,上述过程的自动化是加工厂行业的一个重要问题。中国海洋石油化工工程有限公司提供了本研究使用的图像集,其中包含 51 张 PDF 格式的 P&ID 图纸。我们将 PDF 格式的 P&ID 图纸转换为 PNG 格式的 P&ID 图纸,图像大小为 8410 × 5940。此外,我们使用标注软件对图像进行标注,将数据集按 3:1 的比例分为训练集和测试集,并部署深度神经网络进行识别。本研究提出的方法分为三个步骤。第一步使用 YOLOv5 + SE 对图像进行分割并识别符号。第二步使用字符区域感知确定文本区域,进行文本检测,并使用光学字符识别技术在文本区域内进行字符识别。第三步是使用 YOLOv5 + SE 进行流水线识别。符号识别准确率为 94.52%,召回率为 93.27%。文本定位阶段的识别准确率为 97.26%,召回率为 90.27%。字符识别阶段的识别准确率为 90.03%,召回率为 91.87%。管道识别准确率为 92.9%,召回率为 90.36%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Image format pipeline and instrument diagram recognition method based on deep learning

In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&ID drawings in the PDF. We converted the PDF P&ID drawings to PNG P&IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信