G2NetPL: Generic Game-Theoretic Network for Partial-Label Image Classification

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference Pub Date : 2022-10-20 DOI:10.48550/arXiv.2210.11469

R. Abdelfattah, Xin Zhang, M. Fouda, Xiaofeng Wang, Song Wang

{"title":"G2NetPL: Generic Game-Theoretic Network for Partial-Label Image Classification","authors":"R. Abdelfattah, Xin Zhang, M. Fouda, Xiaofeng Wang, Song Wang","doi":"10.48550/arXiv.2210.11469","DOIUrl":null,"url":null,"abstract":"Multi-label image classification aims to predict all possible labels in an image. It is usually formulated as a partial-label learning problem, since it could be expensive in practice to annotate all the labels in every training image. Existing works on partial-label learning focus on the case where each training image is labeled with only a subset of its positive/negative labels. To effectively address partial-label classification, this paper proposes an end-to-end Generic Game-theoretic Network (G2NetPL) for partial-label learning, which can be applied to most partial-label settings, including a very challenging, but annotation-efficient case where only a subset of the training images are labeled, each with only one positive label, while the rest of the training images remain unlabeled. In G2NetPL, each unobserved label is associated with a soft pseudo label, which, together with the network, formulates a two-player non-zero-sum non-cooperative game. The objective of the network is to minimize the loss function with given pseudo labels, while the pseudo labels will seek convergence to 1 (positive) or 0 (negative) with a penalty of deviating from the predicted labels determined by the network. In addition, we introduce a confidence-aware scheduler into the loss of the network to adaptively perform easy-to-hard learning for different labels. Extensive experiments demonstrate that our proposed G2NetPL outperforms many state-of-the-art multi-label classification methods under various partial-label settings on three different datasets.","PeriodicalId":72437,"journal":{"name":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","volume":"25 1","pages":"309"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.11469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Multi-label image classification aims to predict all possible labels in an image. It is usually formulated as a partial-label learning problem, since it could be expensive in practice to annotate all the labels in every training image. Existing works on partial-label learning focus on the case where each training image is labeled with only a subset of its positive/negative labels. To effectively address partial-label classification, this paper proposes an end-to-end Generic Game-theoretic Network (G2NetPL) for partial-label learning, which can be applied to most partial-label settings, including a very challenging, but annotation-efficient case where only a subset of the training images are labeled, each with only one positive label, while the rest of the training images remain unlabeled. In G2NetPL, each unobserved label is associated with a soft pseudo label, which, together with the network, formulates a two-player non-zero-sum non-cooperative game. The objective of the network is to minimize the loss function with given pseudo labels, while the pseudo labels will seek convergence to 1 (positive) or 0 (negative) with a penalty of deviating from the predicted labels determined by the network. In addition, we introduce a confidence-aware scheduler into the loss of the network to adaptively perform easy-to-hard learning for different labels. Extensive experiments demonstrate that our proposed G2NetPL outperforms many state-of-the-art multi-label classification methods under various partial-label settings on three different datasets.

查看原文本刊更多论文

G2NetPL:部分标签图像分类的通用博弈论网络

多标签图像分类旨在预测图像中所有可能的标签。它通常被表述为一个部分标签学习问题，因为在实践中标注每个训练图像中的所有标签可能是昂贵的。部分标签学习的现有工作集中在每个训练图像只被标记为其正/负标签的子集的情况下。为了有效地解决部分标签分类问题，本文提出了一个用于部分标签学习的端到端通用博弈论网络(G2NetPL)，它可以应用于大多数部分标签设置，包括一个非常具有挑战性但注释效率高的情况，即只有一部分训练图像被标记，每个图像只有一个正标签，而其余训练图像保持未标记。在G2NetPL中，每个未观察到的标签都与一个软伪标签相关联，软伪标签与网络一起构成了一个二人非零和非合作博弈。网络的目标是最小化给定伪标签的损失函数，而伪标签将寻求收敛到1(正)或0(负)，并以偏离网络确定的预测标签为代价。此外，我们在网络的损失中引入了一个自信感知的调度程序，以自适应地对不同的标签进行易难学习。大量的实验表明，我们提出的G2NetPL在三种不同数据集的不同部分标签设置下优于许多最先进的多标签分类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference

自引率

0.00%

发文量