{"title":"一种用于少拍图像分类的改进型推土机距离","authors":"Zhiyu Jin, Zhuohe Tang, Jintao Yan","doi":"10.1109/PHM2022-London52454.2022.00077","DOIUrl":null,"url":null,"abstract":"The problem of few-shot image classification has become a popular research area in the field of imaging. In order to improve the accuracy of few-shot image classification, many learning methods have been proposed, among which there are four main approaches: meta-based learning, data augmentation-based, migration-based learning and metric-based learning. In this paper, we propose a modified earth mover’s distance (MEMD) based on the metric learning approach, which has received much attention due to its simple structure and accurate classification. Similarly, MEMD constructs correlations between image regions, using such correlations to characterise the class of the image. MEMD generates a stream of best matches between image regions, and this stream of matches represents the similarity of the classified images. the MEMD algorithm requires the generation of feature weights for image regions, and EMD uses a mechanism of cross-referenced citations to generate uniform consultation weights. in contrast to EMD, MEMD generates inconsistent reference weights based on the similarity of regions. In dealing with the K-Shot problem, we used a learnable class prototype to characterise the class feature vectors. We conducted comprehensive experiments to validate our improved MEMD algorithm and tested it on four popular few-shot datasets. Namely: miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and the CUB dataset.","PeriodicalId":269605,"journal":{"name":"2022 Prognostics and Health Management Conference (PHM-2022 London)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Modified Earth Mover’s Distance for Few-Shot Image Classification\",\"authors\":\"Zhiyu Jin, Zhuohe Tang, Jintao Yan\",\"doi\":\"10.1109/PHM2022-London52454.2022.00077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The problem of few-shot image classification has become a popular research area in the field of imaging. In order to improve the accuracy of few-shot image classification, many learning methods have been proposed, among which there are four main approaches: meta-based learning, data augmentation-based, migration-based learning and metric-based learning. In this paper, we propose a modified earth mover’s distance (MEMD) based on the metric learning approach, which has received much attention due to its simple structure and accurate classification. Similarly, MEMD constructs correlations between image regions, using such correlations to characterise the class of the image. MEMD generates a stream of best matches between image regions, and this stream of matches represents the similarity of the classified images. the MEMD algorithm requires the generation of feature weights for image regions, and EMD uses a mechanism of cross-referenced citations to generate uniform consultation weights. in contrast to EMD, MEMD generates inconsistent reference weights based on the similarity of regions. In dealing with the K-Shot problem, we used a learnable class prototype to characterise the class feature vectors. We conducted comprehensive experiments to validate our improved MEMD algorithm and tested it on four popular few-shot datasets. Namely: miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and the CUB dataset.\",\"PeriodicalId\":269605,\"journal\":{\"name\":\"2022 Prognostics and Health Management Conference (PHM-2022 London)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Prognostics and Health Management Conference (PHM-2022 London)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PHM2022-London52454.2022.00077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Prognostics and Health Management Conference (PHM-2022 London)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PHM2022-London52454.2022.00077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
少镜头图像分类问题已成为成像领域的研究热点。为了提高少拍图像分类的准确率,人们提出了许多学习方法,其中主要有四种方法:基于元的学习、基于数据增强的学习、基于迁移的学习和基于度量的学习。本文提出了一种基于度量学习方法的改进型推土机距离(MEMD),该方法因其结构简单、分类准确而受到广泛关注。类似地,MEMD在图像区域之间构建相关性,使用这种相关性来表征图像的类别。MEMD生成图像区域之间的最佳匹配流,该匹配流表示分类图像的相似性。MEMD算法需要生成图像区域的特征权值,而EMD算法使用交叉引用机制生成统一的咨询权值。与EMD相比,MEMD基于区域的相似性产生不一致的参考权重。在处理K-Shot问题时,我们使用可学习的类原型来表征类特征向量。我们进行了全面的实验来验证我们改进的MEMD算法,并在四个流行的少量数据集上进行了测试。即:miniImageNet, tieredImageNet, few - shot- cifar100 (FC100)和CUB数据集。
An Modified Earth Mover’s Distance for Few-Shot Image Classification
The problem of few-shot image classification has become a popular research area in the field of imaging. In order to improve the accuracy of few-shot image classification, many learning methods have been proposed, among which there are four main approaches: meta-based learning, data augmentation-based, migration-based learning and metric-based learning. In this paper, we propose a modified earth mover’s distance (MEMD) based on the metric learning approach, which has received much attention due to its simple structure and accurate classification. Similarly, MEMD constructs correlations between image regions, using such correlations to characterise the class of the image. MEMD generates a stream of best matches between image regions, and this stream of matches represents the similarity of the classified images. the MEMD algorithm requires the generation of feature weights for image regions, and EMD uses a mechanism of cross-referenced citations to generate uniform consultation weights. in contrast to EMD, MEMD generates inconsistent reference weights based on the similarity of regions. In dealing with the K-Shot problem, we used a learnable class prototype to characterise the class feature vectors. We conducted comprehensive experiments to validate our improved MEMD algorithm and tested it on four popular few-shot datasets. Namely: miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100) and the CUB dataset.