{"title":"古吉拉特语的可变长度数字识别","authors":"Shrey Malvi, Nirmal Patel, Pratikkumar Prajapati","doi":"10.1109/ICAITPR51569.2022.9844182","DOIUrl":null,"url":null,"abstract":"In this paper, we describe a method to perform handwritten digit recognition for Gujarati - a regional Indian language. Our method can handle variable-length inputs, meaning that there are no limitations around the digit length for the input image. To our knowledge, this is the first attempt to do variable length digit classification for the Gujarati language numerals. We outline a two-step method to classify handwritten Gujarati numerals. The first step identifies connected components of the input image and predicts the numeric length of each connected component. The second step predicts the actual number that is contained within each connected component. The final result is a concatenation of individual predictions. Our Convolutional Neural Networks (CNN) architecture for this task has a low number of output classes (e.g. 30 classes for 3 digit classifier). Our method achieves 83.8% test set accuracy for 1 to 4 digit Gujarati numerals. On the NIST19 dataset, our method achieves 96.1% test set accuracy for 2 to 6 digit English numerals.","PeriodicalId":262409,"journal":{"name":"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Variable Length Digit Recognition for Gujarati Language\",\"authors\":\"Shrey Malvi, Nirmal Patel, Pratikkumar Prajapati\",\"doi\":\"10.1109/ICAITPR51569.2022.9844182\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we describe a method to perform handwritten digit recognition for Gujarati - a regional Indian language. Our method can handle variable-length inputs, meaning that there are no limitations around the digit length for the input image. To our knowledge, this is the first attempt to do variable length digit classification for the Gujarati language numerals. We outline a two-step method to classify handwritten Gujarati numerals. The first step identifies connected components of the input image and predicts the numeric length of each connected component. The second step predicts the actual number that is contained within each connected component. The final result is a concatenation of individual predictions. Our Convolutional Neural Networks (CNN) architecture for this task has a low number of output classes (e.g. 30 classes for 3 digit classifier). Our method achieves 83.8% test set accuracy for 1 to 4 digit Gujarati numerals. On the NIST19 dataset, our method achieves 96.1% test set accuracy for 2 to 6 digit English numerals.\",\"PeriodicalId\":262409,\"journal\":{\"name\":\"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAITPR51569.2022.9844182\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAITPR51569.2022.9844182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Variable Length Digit Recognition for Gujarati Language
In this paper, we describe a method to perform handwritten digit recognition for Gujarati - a regional Indian language. Our method can handle variable-length inputs, meaning that there are no limitations around the digit length for the input image. To our knowledge, this is the first attempt to do variable length digit classification for the Gujarati language numerals. We outline a two-step method to classify handwritten Gujarati numerals. The first step identifies connected components of the input image and predicts the numeric length of each connected component. The second step predicts the actual number that is contained within each connected component. The final result is a concatenation of individual predictions. Our Convolutional Neural Networks (CNN) architecture for this task has a low number of output classes (e.g. 30 classes for 3 digit classifier). Our method achieves 83.8% test set accuracy for 1 to 4 digit Gujarati numerals. On the NIST19 dataset, our method achieves 96.1% test set accuracy for 2 to 6 digit English numerals.