{"title":"基于多分支卷积网络的少弹分类","authors":"Jie Hua, Xueliang Liu","doi":"10.1145/3476098.3485053","DOIUrl":null,"url":null,"abstract":"Few-shot learning aims to complete the classification by only a small number of samples. In many few-shot learning frameworks, relation network is an end-to-end method, which can identify new categories through a small number of label samples based on metric learning. However, a simple feature extractor is used in this method, which limits the further improvement of the classification accuracy. To solve this problem, this paper proposes a multi-branch convolution network for feature extraction. This method combines the training strategies of multi-scale feature extraction, relation network, receptive field block and meta-learning. Firstly, the multi-scale feature vectors of the input image are extracted from the multi-branch convolution network. Then the feature vectors from the support set and the prediction set are input into the relation model, while the receptive field block is employed to improve the measurement ability of the network. Finally, the classification of the testing samples are realized according to the similarity score. In this paper, the effectiveness of the proposed model is verified on Omniglot and MiniImageNet datasets. The experimental results show that the classification accuracy of the proposed model is higher than that of other classical few-shot learning models.","PeriodicalId":390904,"journal":{"name":"Multimedia Understanding with Less Labeling on Multimedia Understanding with Less Labeling","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Branch Convolution Network for Few-Shot Classification\",\"authors\":\"Jie Hua, Xueliang Liu\",\"doi\":\"10.1145/3476098.3485053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot learning aims to complete the classification by only a small number of samples. In many few-shot learning frameworks, relation network is an end-to-end method, which can identify new categories through a small number of label samples based on metric learning. However, a simple feature extractor is used in this method, which limits the further improvement of the classification accuracy. To solve this problem, this paper proposes a multi-branch convolution network for feature extraction. This method combines the training strategies of multi-scale feature extraction, relation network, receptive field block and meta-learning. Firstly, the multi-scale feature vectors of the input image are extracted from the multi-branch convolution network. Then the feature vectors from the support set and the prediction set are input into the relation model, while the receptive field block is employed to improve the measurement ability of the network. Finally, the classification of the testing samples are realized according to the similarity score. In this paper, the effectiveness of the proposed model is verified on Omniglot and MiniImageNet datasets. The experimental results show that the classification accuracy of the proposed model is higher than that of other classical few-shot learning models.\",\"PeriodicalId\":390904,\"journal\":{\"name\":\"Multimedia Understanding with Less Labeling on Multimedia Understanding with Less Labeling\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Multimedia Understanding with Less Labeling on Multimedia Understanding with Less Labeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3476098.3485053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Understanding with Less Labeling on Multimedia Understanding with Less Labeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3476098.3485053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Branch Convolution Network for Few-Shot Classification
Few-shot learning aims to complete the classification by only a small number of samples. In many few-shot learning frameworks, relation network is an end-to-end method, which can identify new categories through a small number of label samples based on metric learning. However, a simple feature extractor is used in this method, which limits the further improvement of the classification accuracy. To solve this problem, this paper proposes a multi-branch convolution network for feature extraction. This method combines the training strategies of multi-scale feature extraction, relation network, receptive field block and meta-learning. Firstly, the multi-scale feature vectors of the input image are extracted from the multi-branch convolution network. Then the feature vectors from the support set and the prediction set are input into the relation model, while the receptive field block is employed to improve the measurement ability of the network. Finally, the classification of the testing samples are realized according to the similarity score. In this paper, the effectiveness of the proposed model is verified on Omniglot and MiniImageNet datasets. The experimental results show that the classification accuracy of the proposed model is higher than that of other classical few-shot learning models.