Learning to Reweight Samples with Offline Loss Sequence

2021 IEEE International Conference on Data Mining (ICDM) Pub Date : 2021-12-01 DOI:10.1109/ICDM51629.2021.00083

Yuhua Wei, Xiaoyu Li, Jishang Wei, B. Qian, Chen Li

{"title":"Learning to Reweight Samples with Offline Loss Sequence","authors":"Yuhua Wei, Xiaoyu Li, Jishang Wei, B. Qian, Chen Li","doi":"10.1109/ICDM51629.2021.00083","DOIUrl":null,"url":null,"abstract":"Deep neural networks (DNNs) provide the best of class solutions to many supervised tasks due to their powerful function fitting capabilities. However, it is challenging to handle data bias, such as label noise and class imbalance, when applying DNNs to solve real-world problems. Sample reweighting is a popular strategy to tackle data bias, which assigns higher weights to informative samples or samples with clean labels. However, conventional reweighting methods require prior knowledge of the distribution information of data bias, which is intractable in practice. In recent years, meta-learning-based methods have been proposed to learn to assign weights to training samples adaptively by using their online training loss or gradient directions. However, the latent bias distribution cannot be adequately characterized in an online fashion. The online loss distribution changes over the training procedure, making it even harder to perform the sample weight learning. In contrast to past methods, we propose a two-stage training strategy to tackle the above problems. In the first stage, the loss sequences of samples are collected. In the second stage, a subnet with convolutional layers is utilized to learn the mapping from offline sample loss sequence to sample weight adaptively. Guided by a small unbiased meta dataset, this subnet is optimized iteratively with the main classifier network in a meta-learning manner. Empirical results show that our method, called Meta Reweighting with Offline Loss Sequence (MROLS), outperforms state-of-the-art reweighting techniques on most benchmarks. Moreover, the weights of training samples learned via MROLS can be well utilized by other classifiers, which can directly enhance the standard training schema. Our source code is available at https://github.com/Neronjust2017/MROLS.","PeriodicalId":320970,"journal":{"name":"2021 IEEE International Conference on Data Mining (ICDM)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM51629.2021.00083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep neural networks (DNNs) provide the best of class solutions to many supervised tasks due to their powerful function fitting capabilities. However, it is challenging to handle data bias, such as label noise and class imbalance, when applying DNNs to solve real-world problems. Sample reweighting is a popular strategy to tackle data bias, which assigns higher weights to informative samples or samples with clean labels. However, conventional reweighting methods require prior knowledge of the distribution information of data bias, which is intractable in practice. In recent years, meta-learning-based methods have been proposed to learn to assign weights to training samples adaptively by using their online training loss or gradient directions. However, the latent bias distribution cannot be adequately characterized in an online fashion. The online loss distribution changes over the training procedure, making it even harder to perform the sample weight learning. In contrast to past methods, we propose a two-stage training strategy to tackle the above problems. In the first stage, the loss sequences of samples are collected. In the second stage, a subnet with convolutional layers is utilized to learn the mapping from offline sample loss sequence to sample weight adaptively. Guided by a small unbiased meta dataset, this subnet is optimized iteratively with the main classifier network in a meta-learning manner. Empirical results show that our method, called Meta Reweighting with Offline Loss Sequence (MROLS), outperforms state-of-the-art reweighting techniques on most benchmarks. Moreover, the weights of training samples learned via MROLS can be well utilized by other classifiers, which can directly enhance the standard training schema. Our source code is available at https://github.com/Neronjust2017/MROLS.

查看原文本刊更多论文

学习用离线损失序列重加权样本

深度神经网络(dnn)由于其强大的函数拟合能力，为许多监督任务提供了最佳的解决方案。然而，在应用深度神经网络解决现实问题时，处理数据偏差(如标签噪声和类别不平衡)是具有挑战性的。样本重加权是解决数据偏差的一种流行策略，它为信息样本或具有干净标签的样本分配更高的权重。然而，传统的加权方法需要预先知道数据偏差的分布信息，这在实践中是很棘手的。近年来，人们提出了基于元学习的方法来学习利用在线训练损失或梯度方向自适应地为训练样本分配权重。然而，潜在偏差分布不能在在线方式中充分表征。在线损失分布在训练过程中发生变化，使得执行样本权重学习变得更加困难。与以往的方法相比，我们提出了一种两阶段训练策略来解决上述问题。在第一阶段，收集样本的损失序列。第二阶段，利用具有卷积层的子网自适应学习离线样本损失序列到样本权值的映射。在一个小的无偏元数据集的指导下，该子网以元学习的方式与主分类器网络进行迭代优化。实证结果表明，我们的方法，称为离线损失序列(MROLS)的元重加权，在大多数基准测试中优于最先进的重加权技术。此外，通过MROLS学习到的训练样本权值可以被其他分类器很好地利用，这可以直接增强标准训练模式。我们的源代码可从https://github.com/Neronjust2017/MROLS获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Data Mining (ICDM)

自引率

0.00%

发文量