{"title":"Text Detection based on Deep Learning","authors":"A. Thilagavathy, Karuturi Hemanth Suresh, Katari Tejesh Chowdary, Meka Tejash, Vijayarao Lakshmi Chakradhar","doi":"10.1109/ICIDCA56705.2023.10099672","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition (OCR), a system for automatic recognition, is used in a variety of application sectors to convert text or images into editable data. With the distinct outline and size, printable or typed letters are easy to identify. However, this is never the case while dealing with handwritten text since each person's handwriting is unique. Handwritten text is difficult for OCR to read. This study presents a two-phase method for recognizing and classifying the input data. Convolutional Neural Networks (CNNs) were used as a framework for categorization. The process starts with the recognition of input text. The second stage is to determine the language used to generate the input number. Further, Python is used to analyze the handwritten characters in MNIST database. The simulation results show an error-free recognition rate and extremely high efficiency. The proposed work has achieved a 1.4% training loss, 99% testing accuracy and 99.6% training accuracy.","PeriodicalId":108272,"journal":{"name":"2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIDCA56705.2023.10099672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Optical Character Recognition (OCR), a system for automatic recognition, is used in a variety of application sectors to convert text or images into editable data. With the distinct outline and size, printable or typed letters are easy to identify. However, this is never the case while dealing with handwritten text since each person's handwriting is unique. Handwritten text is difficult for OCR to read. This study presents a two-phase method for recognizing and classifying the input data. Convolutional Neural Networks (CNNs) were used as a framework for categorization. The process starts with the recognition of input text. The second stage is to determine the language used to generate the input number. Further, Python is used to analyze the handwritten characters in MNIST database. The simulation results show an error-free recognition rate and extremely high efficiency. The proposed work has achieved a 1.4% training loss, 99% testing accuracy and 99.6% training accuracy.