{"title":"基于动态类平均损失的长尾分类","authors":"Do Ryun Lee, Chang Ouk Kim","doi":"10.1016/j.eswa.2025.128292","DOIUrl":null,"url":null,"abstract":"<div><div>In real-world data distributions, class imbalance is a common issue. When training deep learning models on class-imbalanced data, the performance of classes with fewer samples tends to deteriorate. Numerous studies have addressed this problem, focusing on loss reweighting techniques based on the number of training samples per class. However, because some classes are inherently easier or harder to classify, having a larger number of samples in a particular class does not necessarily ensure lower loss or better learning for that class. Additionally, if the ratio of loss magnitudes differs substantially from the ratio of the number of training samples per class, reweighting based solely on sample size may be inappropriate. This study proposes a method to reweight losses based on dynamic class average losses rather than the number of training samples per class to address these issues. Specifically, this method evaluates the class average losses for each mini-batch, applies a nonlinear transformation to these values, and dynamically adjusts the class-wise loss weights within the loss function during training to better mitigate class imbalance. Experimental results from various types of datasets, including image and tabular data, demonstrate that the proposed method improves performance by 1%–8% across various datasets compared to existing methods.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"288 ","pages":"Article 128292"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Long-tailed classification based on dynamic class average loss\",\"authors\":\"Do Ryun Lee, Chang Ouk Kim\",\"doi\":\"10.1016/j.eswa.2025.128292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In real-world data distributions, class imbalance is a common issue. When training deep learning models on class-imbalanced data, the performance of classes with fewer samples tends to deteriorate. Numerous studies have addressed this problem, focusing on loss reweighting techniques based on the number of training samples per class. However, because some classes are inherently easier or harder to classify, having a larger number of samples in a particular class does not necessarily ensure lower loss or better learning for that class. Additionally, if the ratio of loss magnitudes differs substantially from the ratio of the number of training samples per class, reweighting based solely on sample size may be inappropriate. This study proposes a method to reweight losses based on dynamic class average losses rather than the number of training samples per class to address these issues. Specifically, this method evaluates the class average losses for each mini-batch, applies a nonlinear transformation to these values, and dynamically adjusts the class-wise loss weights within the loss function during training to better mitigate class imbalance. Experimental results from various types of datasets, including image and tabular data, demonstrate that the proposed method improves performance by 1%–8% across various datasets compared to existing methods.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"288 \",\"pages\":\"Article 128292\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425019116\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425019116","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Long-tailed classification based on dynamic class average loss
In real-world data distributions, class imbalance is a common issue. When training deep learning models on class-imbalanced data, the performance of classes with fewer samples tends to deteriorate. Numerous studies have addressed this problem, focusing on loss reweighting techniques based on the number of training samples per class. However, because some classes are inherently easier or harder to classify, having a larger number of samples in a particular class does not necessarily ensure lower loss or better learning for that class. Additionally, if the ratio of loss magnitudes differs substantially from the ratio of the number of training samples per class, reweighting based solely on sample size may be inappropriate. This study proposes a method to reweight losses based on dynamic class average losses rather than the number of training samples per class to address these issues. Specifically, this method evaluates the class average losses for each mini-batch, applies a nonlinear transformation to these values, and dynamically adjusts the class-wise loss weights within the loss function during training to better mitigate class imbalance. Experimental results from various types of datasets, including image and tabular data, demonstrate that the proposed method improves performance by 1%–8% across various datasets compared to existing methods.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.