BAUODNET for Class Imbalance Learning in Underwater Object Detection

IF 5.3 3区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Emerging Topics in Computational Intelligence Pub Date : 2024-09-25 DOI:10.1109/TETCI.2024.3462249

Long Chen;Haohan Yu;Xirui Dong;Yaxin Li;Jialie Shen;Jiangrong Shen;Qi Xu

{"title":"BAUODNET for Class Imbalance Learning in Underwater Object Detection","authors":"Long Chen;Haohan Yu;Xirui Dong;Yaxin Li;Jialie Shen;Jiangrong Shen;Qi Xu","doi":"10.1109/TETCI.2024.3462249","DOIUrl":null,"url":null,"abstract":"Underwater object detection is of great significance for various applications in underwater scenes. However, class imbalance issue is still an unsolved bottleneck for current underwater object detection algorithms. It leads to large discrepancies in the detection precision among different classes that the dominant classes with more training data achieve higher precision while the minority classes with less training data achieve much lower precision. In this paper, we propose a balanced underwater object detection network (BAUODNET) to address the class imbalance issue by exploiting two techniques, i.e., the style augmentation technique and the example re-weighting technique. Firstly, we propose a class-wise style augmentation (CWSA) algorithm to augment the training data for the minority classes that generates different colors, textures and contrasts for the minority classes whilst preserving geometry. The augmented dataset possesses more balanced data distribution; Secondly, we exploit the the focal loss to re-weight the examples during the training of the deep detector, it down-weights the loss assigned to the well-detected examples from the dominant classes and focuses on learning undetected hard examples from the minority classes. Extensive experiments show the effectiveness of CWSA and focal loss for addressing the class imbalance problem in underwater scenes, BAUODNET obtains 49.5% mAP on URPC2017 and 66.8% mAP on URPC2018, achieving state-of-the-art or comparable performance on URPC2017 and URPC2018.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"2452-2461"},"PeriodicalIF":5.3000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10693644/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Underwater object detection is of great significance for various applications in underwater scenes. However, class imbalance issue is still an unsolved bottleneck for current underwater object detection algorithms. It leads to large discrepancies in the detection precision among different classes that the dominant classes with more training data achieve higher precision while the minority classes with less training data achieve much lower precision. In this paper, we propose a balanced underwater object detection network (BAUODNET) to address the class imbalance issue by exploiting two techniques, i.e., the style augmentation technique and the example re-weighting technique. Firstly, we propose a class-wise style augmentation (CWSA) algorithm to augment the training data for the minority classes that generates different colors, textures and contrasts for the minority classes whilst preserving geometry. The augmented dataset possesses more balanced data distribution; Secondly, we exploit the the focal loss to re-weight the examples during the training of the deep detector, it down-weights the loss assigned to the well-detected examples from the dominant classes and focuses on learning undetected hard examples from the minority classes. Extensive experiments show the effectiveness of CWSA and focal loss for addressing the class imbalance problem in underwater scenes, BAUODNET obtains 49.5% mAP on URPC2017 and 66.8% mAP on URPC2018, achieving state-of-the-art or comparable performance on URPC2017 and URPC2018.

查看原文本刊更多论文

基于BAUODNET的水下目标检测中的类不平衡学习

水下目标检测对于水下场景的各种应用具有重要意义。然而，类不平衡问题仍然是当前水下目标检测算法尚未解决的瓶颈。这导致不同类别之间的检测精度差异较大，训练数据较多的优势类别的检测精度较高，而训练数据较少的少数类别的检测精度较低。本文提出了一种平衡的水下目标检测网络（BAUODNET），利用两种技术，即风格增强技术和实例重加权技术来解决类不平衡问题。首先，我们提出了一种类智能风格增强（CWSA）算法来增强少数类的训练数据，该算法在保留几何形状的同时为少数类生成不同的颜色，纹理和对比度。增强后的数据集数据分布更加均衡；其次，在深度检测器的训练过程中，我们利用焦点损失对样本进行重新加权，它降低了分配给优势类中检测良好的样本的损失的权重，并专注于学习少数类中未检测到的困难样本。大量的实验表明CWSA和焦点损失对于解决水下场景中的类不平衡问题是有效的，BAUODNET在URPC2017上获得49.5%的mAP，在URPC2018上获得66.8%的mAP，在URPC2017和URPC2018上实现了最先进或相当的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Emerging Topics in Computational Intelligence Mathematics-Control and Optimization

CiteScore

10.30

自引率

7.50%

发文量

147

期刊介绍： The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.