{"title":"基于卷积神经网络的Kannada-MNIST分类","authors":"Emily Xiaoxuan Gu","doi":"10.1109/ICCECE51280.2021.9342474","DOIUrl":null,"url":null,"abstract":"Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.","PeriodicalId":229425,"journal":{"name":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Convolutional Neural Network Based Kannada-MNIST Classification\",\"authors\":\"Emily Xiaoxuan Gu\",\"doi\":\"10.1109/ICCECE51280.2021.9342474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.\",\"PeriodicalId\":229425,\"journal\":{\"name\":\"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCECE51280.2021.9342474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51280.2021.9342474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Convolutional Neural Network Based Kannada-MNIST Classification
Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.