{"title":"基于注意力的精细图像分类监督对比学习","authors":"Qian Li, Weining Wu","doi":"10.1007/s10044-024-01317-5","DOIUrl":null,"url":null,"abstract":"<p>To solve the problem of fine-grained image classification performance caused by intra-class diversity and inter-class similarity in fine-grained images, we propose an Attention-based Supervised Contrastive (ASC) algorithm for fine-grained image classification. The method involves three stages: firstly, local parts are generated by a multi-attention module for constructing contrastive objectives to filter useless background information; an attention-based supervised contrastive framework is introduced to pre-train an encoder network and learn generalized features by pulling positive pairs closer while pushing negatives apart. Finally, we use cross-entropy to fine-tune the model pre-trained in the second stage to obtain classification results. Comprehensive experiments on CUB-200-2011, FGVC-Aircraft, and Stanford Cars datasets demonstrate the effectiveness of the proposed method.</p>","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"26 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attention-based supervised contrastive learning on fine-grained image classification\",\"authors\":\"Qian Li, Weining Wu\",\"doi\":\"10.1007/s10044-024-01317-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>To solve the problem of fine-grained image classification performance caused by intra-class diversity and inter-class similarity in fine-grained images, we propose an Attention-based Supervised Contrastive (ASC) algorithm for fine-grained image classification. The method involves three stages: firstly, local parts are generated by a multi-attention module for constructing contrastive objectives to filter useless background information; an attention-based supervised contrastive framework is introduced to pre-train an encoder network and learn generalized features by pulling positive pairs closer while pushing negatives apart. Finally, we use cross-entropy to fine-tune the model pre-trained in the second stage to obtain classification results. Comprehensive experiments on CUB-200-2011, FGVC-Aircraft, and Stanford Cars datasets demonstrate the effectiveness of the proposed method.</p>\",\"PeriodicalId\":54639,\"journal\":{\"name\":\"Pattern Analysis and Applications\",\"volume\":\"26 1\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Analysis and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s10044-024-01317-5\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Analysis and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10044-024-01317-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Attention-based supervised contrastive learning on fine-grained image classification
To solve the problem of fine-grained image classification performance caused by intra-class diversity and inter-class similarity in fine-grained images, we propose an Attention-based Supervised Contrastive (ASC) algorithm for fine-grained image classification. The method involves three stages: firstly, local parts are generated by a multi-attention module for constructing contrastive objectives to filter useless background information; an attention-based supervised contrastive framework is introduced to pre-train an encoder network and learn generalized features by pulling positive pairs closer while pushing negatives apart. Finally, we use cross-entropy to fine-tune the model pre-trained in the second stage to obtain classification results. Comprehensive experiments on CUB-200-2011, FGVC-Aircraft, and Stanford Cars datasets demonstrate the effectiveness of the proposed method.
期刊介绍:
The journal publishes high quality articles in areas of fundamental research in intelligent pattern analysis and applications in computer science and engineering. It aims to provide a forum for original research which describes novel pattern analysis techniques and industrial applications of the current technology. In addition, the journal will also publish articles on pattern analysis applications in medical imaging. The journal solicits articles that detail new technology and methods for pattern recognition and analysis in applied domains including, but not limited to, computer vision and image processing, speech analysis, robotics, multimedia, document analysis, character recognition, knowledge engineering for pattern recognition, fractal analysis, and intelligent control. The journal publishes articles on the use of advanced pattern recognition and analysis methods including statistical techniques, neural networks, genetic algorithms, fuzzy pattern recognition, machine learning, and hardware implementations which are either relevant to the development of pattern analysis as a research area or detail novel pattern analysis applications. Papers proposing new classifier systems or their development, pattern analysis systems for real-time applications, fuzzy and temporal pattern recognition and uncertainty management in applied pattern recognition are particularly solicited.