Faizanuddin Ansari , Abhranta Panigrahi , Swagatam Das
{"title":"Sampling-tailored two-pronged network for long-tailed class imbalance learning","authors":"Faizanuddin Ansari , Abhranta Panigrahi , Swagatam Das","doi":"10.1016/j.engappai.2025.111466","DOIUrl":null,"url":null,"abstract":"<div><div>A long-tail class imbalanced learning problem is a scenario where the rare or minority classes, representing infrequent events or categories, make up the <em>long tail</em> of the class distribution and have disproportionately few examples compared to the dominant classes. The resulting imbalance makes it challenging to train models effectively for these underrepresented classes. We introduce a comprehensive solution - <strong>STTP-Net</strong>: <strong>S</strong>ampling-<strong>T</strong>ailored <strong>T</strong>wo-<strong>P</strong>ronged <strong>Net</strong>work for long-tail class-imbalanced learning, which aims to address this issue holistically. The study thoroughly examines mixed sample data augmentation techniques in conjunction with various sampling strategies to identify the most effective approaches for handling long-tail imbalance. Based on this analysis, a hybrid mixup strategy tailored explicitly for data augmentation in long-tail imbalanced settings is proposed. The core of the proposed approach comprises a two-pronged network consisting of two classification heads designed to handle long-tail imbalanced datasets. One head specializes in learning the head and median classes in this design. In contrast, the other head becomes an expert in tail classes, striking a balance between accurate prediction of tail classes without compromising accuracy for the head classes. Additionally, we address the label distribution shifts in long-tail imbalance by introducing an Effective Balanced Softmax (EBS) function. The presented method achieves state-of-the-art performance in several benchmark categories for long-tail visual recognition datasets, surpassing the most prominent and pertinent end-to-end and dual-branch approaches.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111466"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095219762501468X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
A long-tail class imbalanced learning problem is a scenario where the rare or minority classes, representing infrequent events or categories, make up the long tail of the class distribution and have disproportionately few examples compared to the dominant classes. The resulting imbalance makes it challenging to train models effectively for these underrepresented classes. We introduce a comprehensive solution - STTP-Net: Sampling-Tailored Two-Pronged Network for long-tail class-imbalanced learning, which aims to address this issue holistically. The study thoroughly examines mixed sample data augmentation techniques in conjunction with various sampling strategies to identify the most effective approaches for handling long-tail imbalance. Based on this analysis, a hybrid mixup strategy tailored explicitly for data augmentation in long-tail imbalanced settings is proposed. The core of the proposed approach comprises a two-pronged network consisting of two classification heads designed to handle long-tail imbalanced datasets. One head specializes in learning the head and median classes in this design. In contrast, the other head becomes an expert in tail classes, striking a balance between accurate prediction of tail classes without compromising accuracy for the head classes. Additionally, we address the label distribution shifts in long-tail imbalance by introducing an Effective Balanced Softmax (EBS) function. The presented method achieves state-of-the-art performance in several benchmark categories for long-tail visual recognition datasets, surpassing the most prominent and pertinent end-to-end and dual-branch approaches.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.