Guanqun Su , Shuai Zhao , Tao Li , Shengyong Liu , Yaqi Li , Guanglong Zhao , Zhongtao Li
{"title":"基于深度学习的图像格式管道和仪器图识别方法","authors":"Guanqun Su , Shuai Zhao , Tao Li , Shengyong Liu , Yaqi Li , Guanglong Zhao , Zhongtao Li","doi":"10.1016/j.birob.2023.100142","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&ID drawings in the PDF. We converted the PDF P&ID drawings to PNG P&IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.</p></div>","PeriodicalId":100184,"journal":{"name":"Biomimetic Intelligence and Robotics","volume":"4 1","pages":"Article 100142"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667379723000566/pdfft?md5=9d3473b5d2acdf3a606cb65e7ef087e9&pid=1-s2.0-S2667379723000566-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Image format pipeline and instrument diagram recognition method based on deep learning\",\"authors\":\"Guanqun Su , Shuai Zhao , Tao Li , Shengyong Liu , Yaqi Li , Guanglong Zhao , Zhongtao Li\",\"doi\":\"10.1016/j.birob.2023.100142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&ID drawings in the PDF. We converted the PDF P&ID drawings to PNG P&IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.</p></div>\",\"PeriodicalId\":100184,\"journal\":{\"name\":\"Biomimetic Intelligence and Robotics\",\"volume\":\"4 1\",\"pages\":\"Article 100142\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2667379723000566/pdfft?md5=9d3473b5d2acdf3a606cb65e7ef087e9&pid=1-s2.0-S2667379723000566-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomimetic Intelligence and Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667379723000566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetic Intelligence and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667379723000566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在这项研究中,我们提出了一种基于深度人工神经网络的识别方法,用于识别符号、文本和管道等图像格式中管道和仪表图(P&ID)的各种元素。目前,P&ID 图像格式需要人工识别,存在识别错误率高的问题,因此,上述过程的自动化是加工厂行业的一个重要问题。中国海洋石油化工工程有限公司提供了本研究使用的图像集,其中包含 51 张 PDF 格式的 P&ID 图纸。我们将 PDF 格式的 P&ID 图纸转换为 PNG 格式的 P&ID 图纸,图像大小为 8410 × 5940。此外,我们使用标注软件对图像进行标注,将数据集按 3:1 的比例分为训练集和测试集,并部署深度神经网络进行识别。本研究提出的方法分为三个步骤。第一步使用 YOLOv5 + SE 对图像进行分割并识别符号。第二步使用字符区域感知确定文本区域,进行文本检测,并使用光学字符识别技术在文本区域内进行字符识别。第三步是使用 YOLOv5 + SE 进行流水线识别。符号识别准确率为 94.52%,召回率为 93.27%。文本定位阶段的识别准确率为 97.26%,召回率为 90.27%。字符识别阶段的识别准确率为 90.03%,召回率为 91.87%。管道识别准确率为 92.9%,召回率为 90.36%。
Image format pipeline and instrument diagram recognition method based on deep learning
In this study, we proposed a recognition method based on deep artificial neural networks to identify various elements in pipelines and instrumentation diagrams (P&ID) in image formats, such as symbols, texts, and pipelines. Presently, the P&ID image format is recognized manually, and there is a problem with a high recognition error rate; therefore, automation of the above process is an important issue in the processing plant industry. The China National Offshore Petrochemical Engineering Co. provided the image set used in this study, which contains 51 P&ID drawings in the PDF. We converted the PDF P&ID drawings to PNG P&IDs with an image size of 8410 × 5940. In addition, we used labeling software to annotate the images, divided the dataset into training and test sets in a 3:1 ratio, and deployed a deep neural network for recognition. The method proposed in this study is divided into three steps. The first step segments the images and recognizes symbols using YOLOv5 + SE. The second step determines text regions using character region awareness for text detection, and performs character recognition within the text region using the optical character recognition technique. The third step is pipeline recognition using YOLOv5 + SE. The symbol recognition accuracy was 94.52%, and the recall rate was 93.27%. The recognition accuracy in the text positioning stage was 97.26% and the recall rate was 90.27%. The recognition accuracy in the character recognition stage was 90.03% and the recall rate was 91.87%. The pipeline identification accuracy was 92.9%, and the recall rate was 90.36%.