Mladen Džida, Davorin Vukadin, M. Šilić, G. Delač, Klemo Vladimir
{"title":"An Overview of State-of-the-art Solutions for Scene Text Detection","authors":"Mladen Džida, Davorin Vukadin, M. Šilić, G. Delač, Klemo Vladimir","doi":"10.23919/MIPRO57284.2023.10159700","DOIUrl":null,"url":null,"abstract":"Scene text detection is a task of identifying text regions and labeling them with bounding boxes in a complex background. It has received a lot of attention recently and has become far from unsolvable due to progress of deep learning for computer vision and also due to rapid development of computer hardware which is able to process complex neural networks. Some of the most common challenges that make this task difficult are irregular text shapes, text interferences, very complex background, different text sizes and low image quality. This paper presents an overview of state-of-the-art solutions for scene text detection where ICDAR 2015 was used as a benchmark dataset. We compare solutions with respect to precision, recall and F-score.","PeriodicalId":177983,"journal":{"name":"2023 46th MIPRO ICT and Electronics Convention (MIPRO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 46th MIPRO ICT and Electronics Convention (MIPRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MIPRO57284.2023.10159700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Scene text detection is a task of identifying text regions and labeling them with bounding boxes in a complex background. It has received a lot of attention recently and has become far from unsolvable due to progress of deep learning for computer vision and also due to rapid development of computer hardware which is able to process complex neural networks. Some of the most common challenges that make this task difficult are irregular text shapes, text interferences, very complex background, different text sizes and low image quality. This paper presents an overview of state-of-the-art solutions for scene text detection where ICDAR 2015 was used as a benchmark dataset. We compare solutions with respect to precision, recall and F-score.