Shiyu Zhao, Menghua Jiang, Zengwen Li, Changxue Chen, Liang Song
{"title":"图像分类自主学习平台的研究与实现","authors":"Shiyu Zhao, Menghua Jiang, Zengwen Li, Changxue Chen, Liang Song","doi":"10.1109/INSAI56792.2022.00039","DOIUrl":null,"url":null,"abstract":"The image classification task aims to automatically classify image content based on machine learning methods. This task is a basic task in the field of computer vision, which has broad application prospects and great research value. At present, in the case of large-scale corpus annotation, mainstream image classification algorithms based on deep learning have been able to obtain better classification results. In order to achieve the above goals, this paper has carried out the following work. This paper studies two mainstream pre-training models in the field of image classification: one is the CNN network based on residual learning; the other is the Vision Transformer model based on Transformer. And according to the performance comparison of each model in four data sets: MNIST, CIFAR-10, CIFAR-100 and ImageNet under different parameters, the optimal model is selected as the background training model of the system. The experimental results show that the Transformer-based model Vision Transformer has better performance and can be used as the back-end training model of the autonomous learning platform.","PeriodicalId":318264,"journal":{"name":"2022 2nd International Conference on Networking Systems of AI (INSAI)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research and Implementation of Autonomous Learning Platform for Image Classification\",\"authors\":\"Shiyu Zhao, Menghua Jiang, Zengwen Li, Changxue Chen, Liang Song\",\"doi\":\"10.1109/INSAI56792.2022.00039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The image classification task aims to automatically classify image content based on machine learning methods. This task is a basic task in the field of computer vision, which has broad application prospects and great research value. At present, in the case of large-scale corpus annotation, mainstream image classification algorithms based on deep learning have been able to obtain better classification results. In order to achieve the above goals, this paper has carried out the following work. This paper studies two mainstream pre-training models in the field of image classification: one is the CNN network based on residual learning; the other is the Vision Transformer model based on Transformer. And according to the performance comparison of each model in four data sets: MNIST, CIFAR-10, CIFAR-100 and ImageNet under different parameters, the optimal model is selected as the background training model of the system. The experimental results show that the Transformer-based model Vision Transformer has better performance and can be used as the back-end training model of the autonomous learning platform.\",\"PeriodicalId\":318264,\"journal\":{\"name\":\"2022 2nd International Conference on Networking Systems of AI (INSAI)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 2nd International Conference on Networking Systems of AI (INSAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INSAI56792.2022.00039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Networking Systems of AI (INSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INSAI56792.2022.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research and Implementation of Autonomous Learning Platform for Image Classification
The image classification task aims to automatically classify image content based on machine learning methods. This task is a basic task in the field of computer vision, which has broad application prospects and great research value. At present, in the case of large-scale corpus annotation, mainstream image classification algorithms based on deep learning have been able to obtain better classification results. In order to achieve the above goals, this paper has carried out the following work. This paper studies two mainstream pre-training models in the field of image classification: one is the CNN network based on residual learning; the other is the Vision Transformer model based on Transformer. And according to the performance comparison of each model in four data sets: MNIST, CIFAR-10, CIFAR-100 and ImageNet under different parameters, the optimal model is selected as the background training model of the system. The experimental results show that the Transformer-based model Vision Transformer has better performance and can be used as the back-end training model of the autonomous learning platform.