Petra Svobodova, Antonin Tomecek, J. Krátký, H. Špačková, Miroslav Klus
{"title":"the Usage of artificial intelligence in recognition of embossed numbers on billet","authors":"Petra Svobodova, Antonin Tomecek, J. Krátký, H. Špačková, Miroslav Klus","doi":"10.37904/metal.2021.4286","DOIUrl":null,"url":null,"abstract":"The article is focused on the recognition and localization of numbers on steel billets. Serial numbers are embossed to each billet. Our automated solution allows product identification without human interaction. There are several problems caused by embossing to the hot steel. First, the numbers are not clearly visible. There is a lot of noise around the serial number which causes shadows and reflections. Next, the surface of the billet is rough with grooves and ridges. These issues affect object detection. As a part of the 4 th Industrial Revolution, artificial intelligence and neural networks are used to automate production. Object recognition identifies which numbers are presented in the image. Another problem occurs when the serial number is located anywhere on the billet surface. The aim is to detect multiple objects in the scene using a single neural network. Our proposed solution is based on an extremely fast and accurate model from a class of deep learning algorithms. To localize and identify individual numbers, the You Only Look Once (YOLO) algorithm is implemented. It predicts bounding boxes and assigns classes from category 0 – 9. The approach is fully automatic and detects embossed numbers in real time. A custom dataset and annotations for train the model is created. Due to the lack of training images, data augmentation is used to extend a dataset by increasing the amount of data.","PeriodicalId":266696,"journal":{"name":"METAL 2021 Conference Proeedings","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"METAL 2021 Conference Proeedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37904/metal.2021.4286","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The article is focused on the recognition and localization of numbers on steel billets. Serial numbers are embossed to each billet. Our automated solution allows product identification without human interaction. There are several problems caused by embossing to the hot steel. First, the numbers are not clearly visible. There is a lot of noise around the serial number which causes shadows and reflections. Next, the surface of the billet is rough with grooves and ridges. These issues affect object detection. As a part of the 4 th Industrial Revolution, artificial intelligence and neural networks are used to automate production. Object recognition identifies which numbers are presented in the image. Another problem occurs when the serial number is located anywhere on the billet surface. The aim is to detect multiple objects in the scene using a single neural network. Our proposed solution is based on an extremely fast and accurate model from a class of deep learning algorithms. To localize and identify individual numbers, the You Only Look Once (YOLO) algorithm is implemented. It predicts bounding boxes and assigns classes from category 0 – 9. The approach is fully automatic and detects embossed numbers in real time. A custom dataset and annotations for train the model is created. Due to the lack of training images, data augmentation is used to extend a dataset by increasing the amount of data.
本文主要研究钢坯上数字的识别与定位问题。每个钢坯上都印有序列号。我们的自动化解决方案允许无需人工交互的产品识别。热钢压花产生的几个问题。首先,这些数字并不清晰可见。序列号周围有很多噪声,会产生阴影和反射。其次,坯料表面粗糙,有沟槽和脊。这些问题影响目标检测。作为第四次工业革命的一部分,人工智能和神经网络被用于自动化生产。物体识别识别哪些数字出现在图像中。当序列号位于钢坯表面的任何位置时,会出现另一个问题。目标是使用单个神经网络检测场景中的多个物体。我们提出的解决方案是基于一个非常快速和准确的模型,从一类深度学习算法。为了定位和识别单个数字,实现了You Only Look Once (YOLO)算法。它预测边界框并从类别0 - 9分配类。该方法是全自动的,并实时检测压纹数字。创建用于训练模型的自定义数据集和注释。由于缺乏训练图像,数据增强是通过增加数据量来扩展数据集。