Xiaohong Cai, Ming Li, H. Cao, Jin-gang Ma, Xiaoyan Wang, Xuqiang Zhuang
{"title":"Image classification based on self-attention convolutional neural network","authors":"Xiaohong Cai, Ming Li, H. Cao, Jin-gang Ma, Xiaoyan Wang, Xuqiang Zhuang","doi":"10.1117/12.2604788","DOIUrl":null,"url":null,"abstract":"Image classification technology is the most basic and important technical branch of computer vision. How to effectively extract effective information from images has become more and more urgent. First, we use the self-attention module to use the correlation between the features to weight and sum the features to get the image category. The self-attention mechanism is simpler to calculate, which greatly reduces the complexity of the model. Secondly, we have also made an optimization strategy for the complex CNN (Convolutional Neural Network) model. This article uses the global average pooling method to replace the fully connected method, which reduces the complexity of the model and generates fewer features. Finally, we verified the feasibility and effectiveness of our model on two data sets.","PeriodicalId":90079,"journal":{"name":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","volume":"57 1","pages":"1191307 - 1191307-5"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2604788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Image classification technology is the most basic and important technical branch of computer vision. How to effectively extract effective information from images has become more and more urgent. First, we use the self-attention module to use the correlation between the features to weight and sum the features to get the image category. The self-attention mechanism is simpler to calculate, which greatly reduces the complexity of the model. Secondly, we have also made an optimization strategy for the complex CNN (Convolutional Neural Network) model. This article uses the global average pooling method to replace the fully connected method, which reduces the complexity of the model and generates fewer features. Finally, we verified the feasibility and effectiveness of our model on two data sets.