Data Expansion Approach with Attention Mechanism for Learning with Noisy Labels

IF 1 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal on Artificial Intelligence Tools Pub Date : 2023-02-14 DOI:10.1142/s0218213023500276

Yuichiro Nomura, Takio Kurita

{"title":"Data Expansion Approach with Attention Mechanism for Learning with Noisy Labels","authors":"Yuichiro Nomura, Takio Kurita","doi":"10.1142/s0218213023500276","DOIUrl":null,"url":null,"abstract":"In recent years, the development of deep learning has contributed to various areas of machine learning. However, deep learning requires a huge amount of data to train the model, and data collection techniques such as web crawling can easily generate incorrect labels. If a training dataset has noisy labels, the generalization performance of deep learning significantly decreases. Some recent works have successfully divided the dataset into samples with clean labels and ones with noisy labels. In light of these studies, we propose a novel data expansion framework to robustly train the models on noisy labels with the attention mechanisms. First, our method trains a deep learning model with the sample selection approach and saves the samples selected as clean at the end of training. The original noisy dataset is then extended with the selected samples and the model is trained on the dataset again. To prevent over-fitting and allow the model to learn different patterns of the selected samples, we leverage the attention mechanism of deep learning to modify the representation of the selected samples. We evaluated our method with synthetic noisy labels on CIFAR-10 and CUB-200-2011 and real-world dataset Clothing1M. Our method obtained comparable results to baseline CNNs and state-of-the-art methods.","PeriodicalId":50280,"journal":{"name":"International Journal on Artificial Intelligence Tools","volume":"90 1","pages":"2350027:1-2350027:19"},"PeriodicalIF":1.0000,"publicationDate":"2023-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal on Artificial Intelligence Tools","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1142/s0218213023500276","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, the development of deep learning has contributed to various areas of machine learning. However, deep learning requires a huge amount of data to train the model, and data collection techniques such as web crawling can easily generate incorrect labels. If a training dataset has noisy labels, the generalization performance of deep learning significantly decreases. Some recent works have successfully divided the dataset into samples with clean labels and ones with noisy labels. In light of these studies, we propose a novel data expansion framework to robustly train the models on noisy labels with the attention mechanisms. First, our method trains a deep learning model with the sample selection approach and saves the samples selected as clean at the end of training. The original noisy dataset is then extended with the selected samples and the model is trained on the dataset again. To prevent over-fitting and allow the model to learn different patterns of the selected samples, we leverage the attention mechanism of deep learning to modify the representation of the selected samples. We evaluated our method with synthetic noisy labels on CIFAR-10 and CUB-200-2011 and real-world dataset Clothing1M. Our method obtained comparable results to baseline CNNs and state-of-the-art methods.

查看原文本刊更多论文

基于注意机制的带噪声标签学习数据扩展方法

近年来，深度学习的发展为机器学习的各个领域做出了贡献。然而，深度学习需要大量的数据来训练模型，而网络爬行等数据收集技术很容易产生错误的标签。如果训练数据集有噪声标签，深度学习的泛化性能会显著下降。最近的一些工作已经成功地将数据集分为带有干净标签的样本和带有噪声标签的样本。针对这些研究，我们提出了一种新的数据扩展框架，利用注意机制对噪声标签上的模型进行鲁棒训练。首先，我们的方法使用样本选择方法训练深度学习模型，并在训练结束时将选择的样本保存为干净的。然后用选择的样本扩展原始噪声数据集，并在数据集上再次训练模型。为了防止过度拟合并允许模型学习所选样本的不同模式，我们利用深度学习的注意机制来修改所选样本的表示。我们用CIFAR-10和CUB-200-2011以及真实数据集Clothing1M上的合成噪声标签来评估我们的方法。我们的方法获得了与基线cnn和最先进方法相当的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal on Artificial Intelligence Tools 工程技术-计算机：跨学科应用

CiteScore

2.10

自引率

9.10%

发文量

审稿时长

8.5 months

期刊介绍： The International Journal on Artificial Intelligence Tools (IJAIT) provides an interdisciplinary forum in which AI scientists and professionals can share their research results and report new advances on AI tools or tools that use AI. Tools refer to architectures, languages or algorithms, which constitute the means connecting theory with applications. So, IJAIT is a medium for promoting general and/or special purpose tools, which are very important for the evolution of science and manipulation of knowledge. IJAIT can also be used as a test ground for new AI tools. Topics covered by IJAIT include but are not limited to: AI in Bioinformatics, AI for Service Engineering, AI for Software Engineering, AI for Ubiquitous Computing, AI for Web Intelligence Applications, AI Parallel Processing Tools (hardware/software), AI Programming Languages, AI Tools for CAD and VLSI Analysis/Design/Testing, AI Tools for Computer Vision and Speech Understanding, AI Tools for Multimedia, Cognitive Informatics, Data Mining and Machine Learning Tools, Heuristic and AI Planning Strategies and Tools, Image Understanding, Integrated/Hybrid AI Approaches, Intelligent System Architectures, Knowledge-Based/Expert Systems, Knowledge Management and Processing Tools, Knowledge Representation Languages, Natural Language Understanding, Neural Networks for AI, Object-Oriented Programming for AI, Reasoning and Evolution of Knowledge Bases, Self-Healing and Autonomous Systems, and Software Engineering for AI.