{"title":"Arbitrary Scene Text Detection with Bezier Proposal","authors":"Yuan-Po Chen, Yihong Li","doi":"10.1109/acait53529.2021.9731235","DOIUrl":null,"url":null,"abstract":"Scene text detection is widely studied in natural language processing since 2016, in which arbitrary scene text detection is always the difficulty. At present, to deal with the problem of how to detect arbitrary shape text, the semantic segmentation-based methods are widely used, but the post-processing and label generation operations are complex. Sparse R-CNN is a novel object detection framework with simple process and high accuracy, which can simplify post-processing by bipartite graph matching loss. Therefore, an arbitrary shape text detect method without any post-process based on Bezier proposal with Sparse R-CNN is proposed. Firstly, the feature pyramid network with attention mechanism is used to extract features, and then the processed features go into the Sparse R-CNN detection head to get the score and coordinates, and finally the detection results are visualized according to the score. The results on ICDAR2015 and CTW1500 datasets show that our method can detect arbitrary text effectively, and our method have higher accuracy and higher speed than other methods.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acait53529.2021.9731235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Scene text detection is widely studied in natural language processing since 2016, in which arbitrary scene text detection is always the difficulty. At present, to deal with the problem of how to detect arbitrary shape text, the semantic segmentation-based methods are widely used, but the post-processing and label generation operations are complex. Sparse R-CNN is a novel object detection framework with simple process and high accuracy, which can simplify post-processing by bipartite graph matching loss. Therefore, an arbitrary shape text detect method without any post-process based on Bezier proposal with Sparse R-CNN is proposed. Firstly, the feature pyramid network with attention mechanism is used to extract features, and then the processed features go into the Sparse R-CNN detection head to get the score and coordinates, and finally the detection results are visualized according to the score. The results on ICDAR2015 and CTW1500 datasets show that our method can detect arbitrary text effectively, and our method have higher accuracy and higher speed than other methods.