{"title":"使用CRNN的OCR:一种文本识别的深度学习方法","authors":"Aditya Yadav, Shauryan Singh, Muzzamil Siddique, Nileshkumar Mehta, Archana Kotangale","doi":"10.1109/INCET57972.2023.10170436","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR) is a widely used technology that converts image text or handwritten text into digital form. However, recognizing handwritten text, printed text, and image text poses a significant challenge due to variations in writing styles and the complexity of characters. This paper proposes a novel approach for OCR using Convolutional Recurrent Neural Network (CRNN) that combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs). The proposed CRNN architecture can automatically learn and extract features from raw image pixels and recognize sequential patterns of characters. This research paper presents a robust OCR system using CRNN architecture with 7 convolutional layers and 2 LSTM layers for recognizing text in images with complex backgrounds and varying fonts. The proposed system achieved state-of-the-art performance on several benchmark datasets, demonstrating the effectiveness of the proposed approach. Our experimental results demonstrate that the proposed CRNN approach is better than other methods and achieves higher accuracy with less latency in recognizing text from an image. We also analyze the impact of different parameters, such as the number of layers, filter sizes, and hidden units, on the performance of the CRNN model. This paper provides a comprehensive study on OCR using CRNN and its potential to improve the accuracy and efficiency of recognizing text.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OCR using CRNN: A Deep Learning Approach for Text Recognition\",\"authors\":\"Aditya Yadav, Shauryan Singh, Muzzamil Siddique, Nileshkumar Mehta, Archana Kotangale\",\"doi\":\"10.1109/INCET57972.2023.10170436\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical Character Recognition (OCR) is a widely used technology that converts image text or handwritten text into digital form. However, recognizing handwritten text, printed text, and image text poses a significant challenge due to variations in writing styles and the complexity of characters. This paper proposes a novel approach for OCR using Convolutional Recurrent Neural Network (CRNN) that combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs). The proposed CRNN architecture can automatically learn and extract features from raw image pixels and recognize sequential patterns of characters. This research paper presents a robust OCR system using CRNN architecture with 7 convolutional layers and 2 LSTM layers for recognizing text in images with complex backgrounds and varying fonts. The proposed system achieved state-of-the-art performance on several benchmark datasets, demonstrating the effectiveness of the proposed approach. Our experimental results demonstrate that the proposed CRNN approach is better than other methods and achieves higher accuracy with less latency in recognizing text from an image. We also analyze the impact of different parameters, such as the number of layers, filter sizes, and hidden units, on the performance of the CRNN model. This paper provides a comprehensive study on OCR using CRNN and its potential to improve the accuracy and efficiency of recognizing text.\",\"PeriodicalId\":403008,\"journal\":{\"name\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCET57972.2023.10170436\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10170436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
OCR using CRNN: A Deep Learning Approach for Text Recognition
Optical Character Recognition (OCR) is a widely used technology that converts image text or handwritten text into digital form. However, recognizing handwritten text, printed text, and image text poses a significant challenge due to variations in writing styles and the complexity of characters. This paper proposes a novel approach for OCR using Convolutional Recurrent Neural Network (CRNN) that combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs). The proposed CRNN architecture can automatically learn and extract features from raw image pixels and recognize sequential patterns of characters. This research paper presents a robust OCR system using CRNN architecture with 7 convolutional layers and 2 LSTM layers for recognizing text in images with complex backgrounds and varying fonts. The proposed system achieved state-of-the-art performance on several benchmark datasets, demonstrating the effectiveness of the proposed approach. Our experimental results demonstrate that the proposed CRNN approach is better than other methods and achieves higher accuracy with less latency in recognizing text from an image. We also analyze the impact of different parameters, such as the number of layers, filter sizes, and hidden units, on the performance of the CRNN model. This paper provides a comprehensive study on OCR using CRNN and its potential to improve the accuracy and efficiency of recognizing text.