Leon Landeka, R. Grbić, Matteo Brisinello, M. Herceg
{"title":"Improving text detection by generating images with curved text instances","authors":"Leon Landeka, R. Grbić, Matteo Brisinello, M. Herceg","doi":"10.1109/ZINC58345.2023.10174175","DOIUrl":null,"url":null,"abstract":"Modern text detection algorithms rely on deep neural networks, which are trained on labeled datasets to achieve high performance. Despite the increasing popularity of text detection, accurate detection of text in natural images remains a challenging problem due to variations in text size, shape, color, and font. In particular, curved text instances present a unique challenge for detection algorithms, yet they are seldom found in existing text detection datasets. In this paper, we present an approach to improve curved text detection performance by generating synthetic images with curved text instances and polygon bounding regions as annotations. We train a deep neural network-based text detector on these synthetic images and evaluate its performance on test sets. Our findings highlight the importance of utilizing diverse and realistic datasets for training robust text detection systems.","PeriodicalId":383771,"journal":{"name":"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ZINC58345.2023.10174175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Modern text detection algorithms rely on deep neural networks, which are trained on labeled datasets to achieve high performance. Despite the increasing popularity of text detection, accurate detection of text in natural images remains a challenging problem due to variations in text size, shape, color, and font. In particular, curved text instances present a unique challenge for detection algorithms, yet they are seldom found in existing text detection datasets. In this paper, we present an approach to improve curved text detection performance by generating synthetic images with curved text instances and polygon bounding regions as annotations. We train a deep neural network-based text detector on these synthetic images and evaluate its performance on test sets. Our findings highlight the importance of utilizing diverse and realistic datasets for training robust text detection systems.