Xiaohong Cai, Ming Li, H. Cao, Jin-gang Ma, Xiaoyan Wang, Xuqiang Zhuang
{"title":"基于自注意卷积神经网络的图像分类","authors":"Xiaohong Cai, Ming Li, H. Cao, Jin-gang Ma, Xiaoyan Wang, Xuqiang Zhuang","doi":"10.1117/12.2604788","DOIUrl":null,"url":null,"abstract":"Image classification technology is the most basic and important technical branch of computer vision. How to effectively extract effective information from images has become more and more urgent. First, we use the self-attention module to use the correlation between the features to weight and sum the features to get the image category. The self-attention mechanism is simpler to calculate, which greatly reduces the complexity of the model. Secondly, we have also made an optimization strategy for the complex CNN (Convolutional Neural Network) model. This article uses the global average pooling method to replace the fully connected method, which reduces the complexity of the model and generates fewer features. Finally, we verified the feasibility and effectiveness of our model on two data sets.","PeriodicalId":90079,"journal":{"name":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","volume":"57 1","pages":"1191307 - 1191307-5"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image classification based on self-attention convolutional neural network\",\"authors\":\"Xiaohong Cai, Ming Li, H. Cao, Jin-gang Ma, Xiaoyan Wang, Xuqiang Zhuang\",\"doi\":\"10.1117/12.2604788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image classification technology is the most basic and important technical branch of computer vision. How to effectively extract effective information from images has become more and more urgent. First, we use the self-attention module to use the correlation between the features to weight and sum the features to get the image category. The self-attention mechanism is simpler to calculate, which greatly reduces the complexity of the model. Secondly, we have also made an optimization strategy for the complex CNN (Convolutional Neural Network) model. This article uses the global average pooling method to replace the fully connected method, which reduces the complexity of the model and generates fewer features. Finally, we verified the feasibility and effectiveness of our model on two data sets.\",\"PeriodicalId\":90079,\"journal\":{\"name\":\"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging\",\"volume\":\"57 1\",\"pages\":\"1191307 - 1191307-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2604788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2604788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Image classification based on self-attention convolutional neural network
Image classification technology is the most basic and important technical branch of computer vision. How to effectively extract effective information from images has become more and more urgent. First, we use the self-attention module to use the correlation between the features to weight and sum the features to get the image category. The self-attention mechanism is simpler to calculate, which greatly reduces the complexity of the model. Secondly, we have also made an optimization strategy for the complex CNN (Convolutional Neural Network) model. This article uses the global average pooling method to replace the fully connected method, which reduces the complexity of the model and generates fewer features. Finally, we verified the feasibility and effectiveness of our model on two data sets.