Varshitha Vankadaru, P. Srinivasu, Singavarapu Hemanth Hari Prasad, P. Rohit, Pydipamula Rohan Babu, Matta Deva Chandra Raju
{"title":"Text Identification from Handwritten Data using Bi-LSTM and CNN with FastAI","authors":"Varshitha Vankadaru, P. Srinivasu, Singavarapu Hemanth Hari Prasad, P. Rohit, Pydipamula Rohan Babu, Matta Deva Chandra Raju","doi":"10.1109/ICIDCA56705.2023.10099715","DOIUrl":null,"url":null,"abstract":"Text extraction is critical for any analysis in a document processing system. Text extraction is the process of recognizing text data from an image. The handcrafted elements used by traditional handwriting recognition systems require a lot of prior knowledge. Convolutional approaches can be used to train optical character recognition (OCR) systems, although doing so requires a lot of training data. Deep learning approaches are the main focus of handwriting recognition research, which has recently produced ground-breaking results. But the exponential expansion of handwritten text and the accessibility of vast computational power need an improvement in predictive performance and more study. To enable the automatic extraction of distinguishing features from handwritten characters and phrases, Convolutional Neural Networks (CNNs), a subset of Deep Learning technology, are especially adept at comprehending the structure of handwritten letters and phrases. The disadvantages of this approach include increased time and resource requirements. The proposed design is based on CNN with Bi-LSTM, is used to identify the text from the handwritten images.","PeriodicalId":108272,"journal":{"name":"2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIDCA56705.2023.10099715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Text extraction is critical for any analysis in a document processing system. Text extraction is the process of recognizing text data from an image. The handcrafted elements used by traditional handwriting recognition systems require a lot of prior knowledge. Convolutional approaches can be used to train optical character recognition (OCR) systems, although doing so requires a lot of training data. Deep learning approaches are the main focus of handwriting recognition research, which has recently produced ground-breaking results. But the exponential expansion of handwritten text and the accessibility of vast computational power need an improvement in predictive performance and more study. To enable the automatic extraction of distinguishing features from handwritten characters and phrases, Convolutional Neural Networks (CNNs), a subset of Deep Learning technology, are especially adept at comprehending the structure of handwritten letters and phrases. The disadvantages of this approach include increased time and resource requirements. The proposed design is based on CNN with Bi-LSTM, is used to identify the text from the handwritten images.