Yibo Xu, Jiongming Su, Fengtao Xiang, Ce Guo, Haoran Ren, Huimin Lu
{"title":"基于视觉解释的深度卷积神经网络迁移学习优化","authors":"Yibo Xu, Jiongming Su, Fengtao Xiang, Ce Guo, Haoran Ren, Huimin Lu","doi":"10.1109/RCAR52367.2021.9517519","DOIUrl":null,"url":null,"abstract":"In image classification tasks, the training of deep convolutional neural networks generally requires a large amount of data, and due to the constraints of environment, resources and time, it is of great practical importance to use fewer training samples to obtain a higher recognition rate in the shortest possible time. A deep convolutional neural network transfer learning optimization method based on visual interpretation is proposed for a specific image classification task. Firstly, we use class activation mapping visualization as a visual interpretation, output the class activation heat map of the validation set images, and analyze the reasons for misrecognition of the images. Secondly, we introduce “feedback” by pre-recognizing and visualizing the optimized dataset with the model trained on the original dataset, selecting the images that have a greater impact on improving the recognition rate, and maximizing the impact of the optimized images on the original model. Finally, the model is retrained on the optimized training set. The experimental results show that this method can effectively improve the recognition rate of the transfer learning model for image classification.","PeriodicalId":232892,"journal":{"name":"2021 IEEE International Conference on Real-time Computing and Robotics (RCAR)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Convolutional Neural Network Transfer Learning Optimization Based on Visual Interpretation\",\"authors\":\"Yibo Xu, Jiongming Su, Fengtao Xiang, Ce Guo, Haoran Ren, Huimin Lu\",\"doi\":\"10.1109/RCAR52367.2021.9517519\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In image classification tasks, the training of deep convolutional neural networks generally requires a large amount of data, and due to the constraints of environment, resources and time, it is of great practical importance to use fewer training samples to obtain a higher recognition rate in the shortest possible time. A deep convolutional neural network transfer learning optimization method based on visual interpretation is proposed for a specific image classification task. Firstly, we use class activation mapping visualization as a visual interpretation, output the class activation heat map of the validation set images, and analyze the reasons for misrecognition of the images. Secondly, we introduce “feedback” by pre-recognizing and visualizing the optimized dataset with the model trained on the original dataset, selecting the images that have a greater impact on improving the recognition rate, and maximizing the impact of the optimized images on the original model. Finally, the model is retrained on the optimized training set. The experimental results show that this method can effectively improve the recognition rate of the transfer learning model for image classification.\",\"PeriodicalId\":232892,\"journal\":{\"name\":\"2021 IEEE International Conference on Real-time Computing and Robotics (RCAR)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Real-time Computing and Robotics (RCAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RCAR52367.2021.9517519\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Real-time Computing and Robotics (RCAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCAR52367.2021.9517519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Convolutional Neural Network Transfer Learning Optimization Based on Visual Interpretation
In image classification tasks, the training of deep convolutional neural networks generally requires a large amount of data, and due to the constraints of environment, resources and time, it is of great practical importance to use fewer training samples to obtain a higher recognition rate in the shortest possible time. A deep convolutional neural network transfer learning optimization method based on visual interpretation is proposed for a specific image classification task. Firstly, we use class activation mapping visualization as a visual interpretation, output the class activation heat map of the validation set images, and analyze the reasons for misrecognition of the images. Secondly, we introduce “feedback” by pre-recognizing and visualizing the optimized dataset with the model trained on the original dataset, selecting the images that have a greater impact on improving the recognition rate, and maximizing the impact of the optimized images on the original model. Finally, the model is retrained on the optimized training set. The experimental results show that this method can effectively improve the recognition rate of the transfer learning model for image classification.