{"title":"AW-PCNN:用于细粒度少镜头学习的自适应加权金字塔卷积神经网络","authors":"Li Hengbai","doi":"10.1109/ICCWAMTIP56608.2022.10016575","DOIUrl":null,"url":null,"abstract":"Due to the unique challenges of high intra-class variation and low inter-class variation, Fine-Grained Visual Classification (FGVC) tasks extremely need to extract multilevel semantic features of fine-grained images for classification. Moreover, labor-intensive annotations and the existence of long-tailed distribution of fine-grained images in real life make fine-grained few-shot learning an urgent problem to be solved. In this paper, we propose an Adaptive Weighting Pyramidal Convolutional Neural Network (AW-PCNN) for fine-grained few-shot learning. Our AW-PCNN consists of a PCNN module and a AW module, which are improved in two aspects. First, our PCNN module extracts the features of each layer in CNN to obtain both high-level global and low-level local subtle features of images to overcome the challenge of FGVC tasks. Second, We employ the metric learning approach for few-shot learning, and our AW module improves it by selecting decisive pairs and adaptively weighting the pairs based on their similarity scores to mitigate the challenge of FGVC tasks and learn a better embedding space. Our AW-PCNN achieves state-of-the-art performance on three benchmark fine-grained datasets, which proves the effectiveness and superiority of our model.","PeriodicalId":159508,"journal":{"name":"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AW-PCNN: Adaptive Weighting Pyramidal Convolutional Neural Network for Fine-Grained Few-Shot Learning\",\"authors\":\"Li Hengbai\",\"doi\":\"10.1109/ICCWAMTIP56608.2022.10016575\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the unique challenges of high intra-class variation and low inter-class variation, Fine-Grained Visual Classification (FGVC) tasks extremely need to extract multilevel semantic features of fine-grained images for classification. Moreover, labor-intensive annotations and the existence of long-tailed distribution of fine-grained images in real life make fine-grained few-shot learning an urgent problem to be solved. In this paper, we propose an Adaptive Weighting Pyramidal Convolutional Neural Network (AW-PCNN) for fine-grained few-shot learning. Our AW-PCNN consists of a PCNN module and a AW module, which are improved in two aspects. First, our PCNN module extracts the features of each layer in CNN to obtain both high-level global and low-level local subtle features of images to overcome the challenge of FGVC tasks. Second, We employ the metric learning approach for few-shot learning, and our AW module improves it by selecting decisive pairs and adaptively weighting the pairs based on their similarity scores to mitigate the challenge of FGVC tasks and learn a better embedding space. Our AW-PCNN achieves state-of-the-art performance on three benchmark fine-grained datasets, which proves the effectiveness and superiority of our model.\",\"PeriodicalId\":159508,\"journal\":{\"name\":\"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCWAMTIP56608.2022.10016575\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWAMTIP56608.2022.10016575","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Due to the unique challenges of high intra-class variation and low inter-class variation, Fine-Grained Visual Classification (FGVC) tasks extremely need to extract multilevel semantic features of fine-grained images for classification. Moreover, labor-intensive annotations and the existence of long-tailed distribution of fine-grained images in real life make fine-grained few-shot learning an urgent problem to be solved. In this paper, we propose an Adaptive Weighting Pyramidal Convolutional Neural Network (AW-PCNN) for fine-grained few-shot learning. Our AW-PCNN consists of a PCNN module and a AW module, which are improved in two aspects. First, our PCNN module extracts the features of each layer in CNN to obtain both high-level global and low-level local subtle features of images to overcome the challenge of FGVC tasks. Second, We employ the metric learning approach for few-shot learning, and our AW module improves it by selecting decisive pairs and adaptively weighting the pairs based on their similarity scores to mitigate the challenge of FGVC tasks and learn a better embedding space. Our AW-PCNN achieves state-of-the-art performance on three benchmark fine-grained datasets, which proves the effectiveness and superiority of our model.