{"title":"Improvement of super resolution reconstruction method for real text images","authors":"J. Zhang, Hong Qu","doi":"10.1109/MLISE57402.2022.00082","DOIUrl":null,"url":null,"abstract":"Super resolution refers to the process of restoring a low resolution image to a high resolution image. In recent years, researchers in the field of super-resolution are not satisfied with restoring artificially defined low-resolution images, and try to restore low-resolution images in natural scenes. For this situation, a real scene text super-resolution dataset TextZoom is proposed. It contains one-to-one mapping of low-resolution and high-resolution images captured by the camera, which is more realistic and challenging than manufactured data. A super-resolution network for TextZoom, which is called TSRN is also proposed. By adding channel attention and increasing the proportion of gradient loss function, the overall network pays more attention to the restoration of text and enhances the lines, and finally improves the recognition rate of medium and difficult graded text images in the TextZoom dataset after super-resolution preprocessing.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLISE57402.2022.00082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Super resolution refers to the process of restoring a low resolution image to a high resolution image. In recent years, researchers in the field of super-resolution are not satisfied with restoring artificially defined low-resolution images, and try to restore low-resolution images in natural scenes. For this situation, a real scene text super-resolution dataset TextZoom is proposed. It contains one-to-one mapping of low-resolution and high-resolution images captured by the camera, which is more realistic and challenging than manufactured data. A super-resolution network for TextZoom, which is called TSRN is also proposed. By adding channel attention and increasing the proportion of gradient loss function, the overall network pays more attention to the restoration of text and enhances the lines, and finally improves the recognition rate of medium and difficult graded text images in the TextZoom dataset after super-resolution preprocessing.