{"title":"TEYSuR -文本提取与YOLO和超分辨率","authors":"Srividya Subramanian, Vineet Kekatpure, Gladina Raymond, Kapil Parab, S. Dugad, Archana Shirke","doi":"10.1109/ICONAT53423.2022.9726124","DOIUrl":null,"url":null,"abstract":"In these times where a myriad of industries are becoming digitized and advanced, text recognition is being incorporated time and again to enhance customer satisfaction, improve accessibility and organize business processes. Conventional Deep Learning based text recognition, also termed as Optical Character Recognition (OCR) is based on learning the shapes of the characters in a language. In Text Extraction with YOLO and Super Resolution (TEYSuR) we propose a novel approach where YOLO, an object detection algorithm is used to localise and classify characters in an input image. The characters in the image are treated as individual objects by the algorithm and text is extracted with high accuracy and speed. In order to cater to input images of various font sizes the system proposes an additional Inference module which resizes images using Super Resolution and yields extremely high accuracy results. The word accuracy attained by the YOLO model was 88% and coupled with the Inference module TEYSuR attained a word accuracy of 96%. Hence, this system can be successfully used for extracting text from images with high efficiency.","PeriodicalId":377501,"journal":{"name":"2022 International Conference for Advancement in Technology (ICONAT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"TEYSuR - Text Extraction with YOLO and Super Resolution\",\"authors\":\"Srividya Subramanian, Vineet Kekatpure, Gladina Raymond, Kapil Parab, S. Dugad, Archana Shirke\",\"doi\":\"10.1109/ICONAT53423.2022.9726124\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In these times where a myriad of industries are becoming digitized and advanced, text recognition is being incorporated time and again to enhance customer satisfaction, improve accessibility and organize business processes. Conventional Deep Learning based text recognition, also termed as Optical Character Recognition (OCR) is based on learning the shapes of the characters in a language. In Text Extraction with YOLO and Super Resolution (TEYSuR) we propose a novel approach where YOLO, an object detection algorithm is used to localise and classify characters in an input image. The characters in the image are treated as individual objects by the algorithm and text is extracted with high accuracy and speed. In order to cater to input images of various font sizes the system proposes an additional Inference module which resizes images using Super Resolution and yields extremely high accuracy results. The word accuracy attained by the YOLO model was 88% and coupled with the Inference module TEYSuR attained a word accuracy of 96%. Hence, this system can be successfully used for extracting text from images with high efficiency.\",\"PeriodicalId\":377501,\"journal\":{\"name\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICONAT53423.2022.9726124\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference for Advancement in Technology (ICONAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONAT53423.2022.9726124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
TEYSuR - Text Extraction with YOLO and Super Resolution
In these times where a myriad of industries are becoming digitized and advanced, text recognition is being incorporated time and again to enhance customer satisfaction, improve accessibility and organize business processes. Conventional Deep Learning based text recognition, also termed as Optical Character Recognition (OCR) is based on learning the shapes of the characters in a language. In Text Extraction with YOLO and Super Resolution (TEYSuR) we propose a novel approach where YOLO, an object detection algorithm is used to localise and classify characters in an input image. The characters in the image are treated as individual objects by the algorithm and text is extracted with high accuracy and speed. In order to cater to input images of various font sizes the system proposes an additional Inference module which resizes images using Super Resolution and yields extremely high accuracy results. The word accuracy attained by the YOLO model was 88% and coupled with the Inference module TEYSuR attained a word accuracy of 96%. Hence, this system can be successfully used for extracting text from images with high efficiency.