Mladen Džida, Davorin Vukadin, M. Šilić, G. Delač, Klemo Vladimir
{"title":"场景文本检测的最新解决方案概述","authors":"Mladen Džida, Davorin Vukadin, M. Šilić, G. Delač, Klemo Vladimir","doi":"10.23919/MIPRO57284.2023.10159700","DOIUrl":null,"url":null,"abstract":"Scene text detection is a task of identifying text regions and labeling them with bounding boxes in a complex background. It has received a lot of attention recently and has become far from unsolvable due to progress of deep learning for computer vision and also due to rapid development of computer hardware which is able to process complex neural networks. Some of the most common challenges that make this task difficult are irregular text shapes, text interferences, very complex background, different text sizes and low image quality. This paper presents an overview of state-of-the-art solutions for scene text detection where ICDAR 2015 was used as a benchmark dataset. We compare solutions with respect to precision, recall and F-score.","PeriodicalId":177983,"journal":{"name":"2023 46th MIPRO ICT and Electronics Convention (MIPRO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Overview of State-of-the-art Solutions for Scene Text Detection\",\"authors\":\"Mladen Džida, Davorin Vukadin, M. Šilić, G. Delač, Klemo Vladimir\",\"doi\":\"10.23919/MIPRO57284.2023.10159700\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scene text detection is a task of identifying text regions and labeling them with bounding boxes in a complex background. It has received a lot of attention recently and has become far from unsolvable due to progress of deep learning for computer vision and also due to rapid development of computer hardware which is able to process complex neural networks. Some of the most common challenges that make this task difficult are irregular text shapes, text interferences, very complex background, different text sizes and low image quality. This paper presents an overview of state-of-the-art solutions for scene text detection where ICDAR 2015 was used as a benchmark dataset. We compare solutions with respect to precision, recall and F-score.\",\"PeriodicalId\":177983,\"journal\":{\"name\":\"2023 46th MIPRO ICT and Electronics Convention (MIPRO)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 46th MIPRO ICT and Electronics Convention (MIPRO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/MIPRO57284.2023.10159700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 46th MIPRO ICT and Electronics Convention (MIPRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MIPRO57284.2023.10159700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Overview of State-of-the-art Solutions for Scene Text Detection
Scene text detection is a task of identifying text regions and labeling them with bounding boxes in a complex background. It has received a lot of attention recently and has become far from unsolvable due to progress of deep learning for computer vision and also due to rapid development of computer hardware which is able to process complex neural networks. Some of the most common challenges that make this task difficult are irregular text shapes, text interferences, very complex background, different text sizes and low image quality. This paper presents an overview of state-of-the-art solutions for scene text detection where ICDAR 2015 was used as a benchmark dataset. We compare solutions with respect to precision, recall and F-score.