{"title":"Research on scene text detection algorithm based on modified YOLOv5","authors":"Yong Luo, Chunyi Zhao, Fei Zhang","doi":"10.1117/12.2672998","DOIUrl":null,"url":null,"abstract":"Aiming at the problems of low precision, slow speed and the detection problems when the text lines are arranged in any direction in the traditional natural scene text detection method, based on the target detection algorithm YOLOv5, a rotated text detection method with angle classification is proposed——YOLOv5-R, by defining the representation of the rotating rectangle, calculating the method of rotating the IoU, and designing a new loss function to achieve accurate detection of horizontal and oblique text, and tested the effectiveness on the scene text datasets ICDAR2013 and ICDAR2015, after the transformation The algorithm realizes the function of rotating target detection, but there is still some room for improvement in arbitrary shape detection.","PeriodicalId":290902,"journal":{"name":"International Conference on Mechatronics Engineering and Artificial Intelligence","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Mechatronics Engineering and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2672998","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Aiming at the problems of low precision, slow speed and the detection problems when the text lines are arranged in any direction in the traditional natural scene text detection method, based on the target detection algorithm YOLOv5, a rotated text detection method with angle classification is proposed——YOLOv5-R, by defining the representation of the rotating rectangle, calculating the method of rotating the IoU, and designing a new loss function to achieve accurate detection of horizontal and oblique text, and tested the effectiveness on the scene text datasets ICDAR2013 and ICDAR2015, after the transformation The algorithm realizes the function of rotating target detection, but there is still some room for improvement in arbitrary shape detection.