Leon Landeka, R. Grbić, Matteo Brisinello, M. Herceg
{"title":"通过生成具有弯曲文本实例的图像来改进文本检测","authors":"Leon Landeka, R. Grbić, Matteo Brisinello, M. Herceg","doi":"10.1109/ZINC58345.2023.10174175","DOIUrl":null,"url":null,"abstract":"Modern text detection algorithms rely on deep neural networks, which are trained on labeled datasets to achieve high performance. Despite the increasing popularity of text detection, accurate detection of text in natural images remains a challenging problem due to variations in text size, shape, color, and font. In particular, curved text instances present a unique challenge for detection algorithms, yet they are seldom found in existing text detection datasets. In this paper, we present an approach to improve curved text detection performance by generating synthetic images with curved text instances and polygon bounding regions as annotations. We train a deep neural network-based text detector on these synthetic images and evaluate its performance on test sets. Our findings highlight the importance of utilizing diverse and realistic datasets for training robust text detection systems.","PeriodicalId":383771,"journal":{"name":"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving text detection by generating images with curved text instances\",\"authors\":\"Leon Landeka, R. Grbić, Matteo Brisinello, M. Herceg\",\"doi\":\"10.1109/ZINC58345.2023.10174175\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Modern text detection algorithms rely on deep neural networks, which are trained on labeled datasets to achieve high performance. Despite the increasing popularity of text detection, accurate detection of text in natural images remains a challenging problem due to variations in text size, shape, color, and font. In particular, curved text instances present a unique challenge for detection algorithms, yet they are seldom found in existing text detection datasets. In this paper, we present an approach to improve curved text detection performance by generating synthetic images with curved text instances and polygon bounding regions as annotations. We train a deep neural network-based text detector on these synthetic images and evaluate its performance on test sets. Our findings highlight the importance of utilizing diverse and realistic datasets for training robust text detection systems.\",\"PeriodicalId\":383771,\"journal\":{\"name\":\"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ZINC58345.2023.10174175\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Zooming Innovation in Consumer Technologies Conference (ZINC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ZINC58345.2023.10174175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving text detection by generating images with curved text instances
Modern text detection algorithms rely on deep neural networks, which are trained on labeled datasets to achieve high performance. Despite the increasing popularity of text detection, accurate detection of text in natural images remains a challenging problem due to variations in text size, shape, color, and font. In particular, curved text instances present a unique challenge for detection algorithms, yet they are seldom found in existing text detection datasets. In this paper, we present an approach to improve curved text detection performance by generating synthetic images with curved text instances and polygon bounding regions as annotations. We train a deep neural network-based text detector on these synthetic images and evaluate its performance on test sets. Our findings highlight the importance of utilizing diverse and realistic datasets for training robust text detection systems.