基于卷积神经网络的Kannada-MNIST分类

2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE) Pub Date : 2021-01-15 DOI:10.1109/ICCECE51280.2021.9342474

Emily Xiaoxuan Gu

{"title":"基于卷积神经网络的Kannada-MNIST分类","authors":"Emily Xiaoxuan Gu","doi":"10.1109/ICCECE51280.2021.9342474","DOIUrl":null,"url":null,"abstract":"Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.","PeriodicalId":229425,"journal":{"name":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Convolutional Neural Network Based Kannada-MNIST Classification\",\"authors\":\"Emily Xiaoxuan Gu\",\"doi\":\"10.1109/ICCECE51280.2021.9342474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.\",\"PeriodicalId\":229425,\"journal\":{\"name\":\"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCECE51280.2021.9342474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51280.2021.9342474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在机器学习出现之前，涉及手写字符识别的任务，包括邮政编码识别、文档数字化和古代字符识别，都需要大量的人力。如今，各种各样的方法已经被开发出来，使得这些任务可以由计算机高效而准确地处理。2018年，西南大学计算机与信息科学学院与贵州工程科技大学彝族研究所联合牵头开展了利用人工智能技术进行古彝语分类的研究。建立了一种基于卷积神经网络(CNN)的模型，并取得了较高的准确率。这引发了我对使用机器学习方法进行手写字符分类的极大好奇。2019年，传播了卡纳达语- mnist (K-MNIST)数据集，这是一个经过精心处理的卡纳达语数字数据集。我们决定利用这个数据集来研究手写字符分类。CNN模型主要是由于其特殊的架构，使其非常适合图像分类任务而开发的。本文重点对该CNN模型的建立和实验进行了详细的分析。我们还考虑了其他方法，如逻辑回归和支持向量机进行比较，这样我们就可以深入了解我们的模型与其他机器学习方法相比的性能。通过我们的实验，该模型在K-MNIST测试集上的准确率达到了98.77%，超过了所有基线，对于数据集(0-9)的所有类别的分类都是有效的。因此，我们最终得出了CNN模型在执行手写体字符分类任务时的强大能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Convolutional Neural Network Based Kannada-MNIST Classification

Before machine learning emerged, tasks that involve the recognition of handwritten characters, which include postcode recognition, document digitalization, and ancient character recognition, all require extensive human effort. Nowadays, various methods have been developed so that such tasks can be handled efficiently and accurately by computers. In 2018, the School of Computer and Information Science of Southwest University and the Research Institute of Yi Nationality at Guizhou University of Engineering Science jointly took lead in the research of ancient Yi language classification by using artificial intelligence technology. A model based on Convolutional Neural Networks (CNN) was developed and achieved a rather high accuracy. This triggered my great curiosity about handwritten character classification using machine learning methods.In 2019, the Kannada-MNIST (K-MNIST) dataset, a well-processed dataset of numerals in the Kannada language, was disseminated. We decided to make use of this dataset for our research in handwritten character classification. A CNN model was primarily developed due to its special architecture that makes it well suited for image classification tasks. This paper focuses on the establishment and experimentation of this CNN model and makes a detailed analysis. We also considered other methodologies such as Logistic Regression and Support Vector Machine for making comparisons, so that we can gain an insight into the performance of our model when compared to other machine learning methods. Through our experiment, the model achieved an accuracy of 98.77% over the testing set of K-MNIST, surpassing all the baselines, and it is effective for the classification of all categories of the dataset (0-9). Thus, we eventually concluded the strong capability of CNN models when performing classification tasks of handwritten characters.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)

自引率

0.00%

发文量