{"title":"基于深度学习的场景文本检测研究综述","authors":"Yuan Li, Mayire Ibrayim, A. Hamdulla","doi":"10.1109/ICSCDE54196.2021.00079","DOIUrl":null,"url":null,"abstract":"Scene text detection is a general text detection technology, which has become a hot research direction in the field of computer vision and document analysis in recent years, and is widely used in geographic positioning, license plate recognition, unmanned driving, and other fields. Compared with traditional document text detection, scene text changes more dramatically in font, scale, arrangement, and background. Therefore, deep learning technology has become the mainstream method in this field because of its excellent performance, which is helpful to improve the ability of text detection. This paper introduces the main research techniques of natural scene text detection and summarizes the structural characteristics of some classical network models. This paper sorts out, analyzes, and summarizes the running mechanism and performance of various network models for natural scene text detection deep learning-based in recent years. The common public datasets and their application characteristics are listed. Finally, the problems and challenges in scene text detection based on deep learning are discussed and provide an outlook on the future research directions in this field.","PeriodicalId":208108,"journal":{"name":"2021 International Conference of Social Computing and Digital Economy (ICSCDE)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Summary of Scene Text Detection Based on Deep Learning\",\"authors\":\"Yuan Li, Mayire Ibrayim, A. Hamdulla\",\"doi\":\"10.1109/ICSCDE54196.2021.00079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scene text detection is a general text detection technology, which has become a hot research direction in the field of computer vision and document analysis in recent years, and is widely used in geographic positioning, license plate recognition, unmanned driving, and other fields. Compared with traditional document text detection, scene text changes more dramatically in font, scale, arrangement, and background. Therefore, deep learning technology has become the mainstream method in this field because of its excellent performance, which is helpful to improve the ability of text detection. This paper introduces the main research techniques of natural scene text detection and summarizes the structural characteristics of some classical network models. This paper sorts out, analyzes, and summarizes the running mechanism and performance of various network models for natural scene text detection deep learning-based in recent years. The common public datasets and their application characteristics are listed. Finally, the problems and challenges in scene text detection based on deep learning are discussed and provide an outlook on the future research directions in this field.\",\"PeriodicalId\":208108,\"journal\":{\"name\":\"2021 International Conference of Social Computing and Digital Economy (ICSCDE)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference of Social Computing and Digital Economy (ICSCDE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSCDE54196.2021.00079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference of Social Computing and Digital Economy (ICSCDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCDE54196.2021.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Summary of Scene Text Detection Based on Deep Learning
Scene text detection is a general text detection technology, which has become a hot research direction in the field of computer vision and document analysis in recent years, and is widely used in geographic positioning, license plate recognition, unmanned driving, and other fields. Compared with traditional document text detection, scene text changes more dramatically in font, scale, arrangement, and background. Therefore, deep learning technology has become the mainstream method in this field because of its excellent performance, which is helpful to improve the ability of text detection. This paper introduces the main research techniques of natural scene text detection and summarizes the structural characteristics of some classical network models. This paper sorts out, analyzes, and summarizes the running mechanism and performance of various network models for natural scene text detection deep learning-based in recent years. The common public datasets and their application characteristics are listed. Finally, the problems and challenges in scene text detection based on deep learning are discussed and provide an outlook on the future research directions in this field.