{"title":"基于改进DBNet的自然场景文本检测算法","authors":"Hui Chen, J. Liu, Weimin Zhou","doi":"10.1109/ICEICT55736.2022.9909509","DOIUrl":null,"url":null,"abstract":"There are many problems in text detection, such as large scale differences of high-resolution image features and poor multi-scale feature fusion, we propose an improved algorithm based on dbnet. On the basis of the feature fusion module, we add a atrous Convolution network with kernel-shared pooling to increase the receptive field, so that higher-level semantic information can be obtained in the feature fusion network, and through the shared kernel, the number of model parameters can be reduced, the computational cost can be reduced, and the detection accuracy can be improved. At the same time, we add the attention mechanism into the residual network to suppress the complex background noise and promote the information interaction between channels. In the loss function, we use dice loss partially to solve the imbalance of positive and negative sample data. Our experimental evaluation is on ICDAR2013 and ICDAR2015 datasets. The experimental results show that the algorithm has a certain improvement in accuracy and F value.","PeriodicalId":179327,"journal":{"name":"2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Natural Scene Text Detection Algorithm Based on Improved DBNet\",\"authors\":\"Hui Chen, J. Liu, Weimin Zhou\",\"doi\":\"10.1109/ICEICT55736.2022.9909509\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are many problems in text detection, such as large scale differences of high-resolution image features and poor multi-scale feature fusion, we propose an improved algorithm based on dbnet. On the basis of the feature fusion module, we add a atrous Convolution network with kernel-shared pooling to increase the receptive field, so that higher-level semantic information can be obtained in the feature fusion network, and through the shared kernel, the number of model parameters can be reduced, the computational cost can be reduced, and the detection accuracy can be improved. At the same time, we add the attention mechanism into the residual network to suppress the complex background noise and promote the information interaction between channels. In the loss function, we use dice loss partially to solve the imbalance of positive and negative sample data. Our experimental evaluation is on ICDAR2013 and ICDAR2015 datasets. The experimental results show that the algorithm has a certain improvement in accuracy and F value.\",\"PeriodicalId\":179327,\"journal\":{\"name\":\"2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEICT55736.2022.9909509\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEICT55736.2022.9909509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Natural Scene Text Detection Algorithm Based on Improved DBNet
There are many problems in text detection, such as large scale differences of high-resolution image features and poor multi-scale feature fusion, we propose an improved algorithm based on dbnet. On the basis of the feature fusion module, we add a atrous Convolution network with kernel-shared pooling to increase the receptive field, so that higher-level semantic information can be obtained in the feature fusion network, and through the shared kernel, the number of model parameters can be reduced, the computational cost can be reduced, and the detection accuracy can be improved. At the same time, we add the attention mechanism into the residual network to suppress the complex background noise and promote the information interaction between channels. In the loss function, we use dice loss partially to solve the imbalance of positive and negative sample data. Our experimental evaluation is on ICDAR2013 and ICDAR2015 datasets. The experimental results show that the algorithm has a certain improvement in accuracy and F value.