Long Chen;Haohan Yu;Xirui Dong;Yaxin Li;Jialie Shen;Jiangrong Shen;Qi Xu
{"title":"BAUODNET for Class Imbalance Learning in Underwater Object Detection","authors":"Long Chen;Haohan Yu;Xirui Dong;Yaxin Li;Jialie Shen;Jiangrong Shen;Qi Xu","doi":"10.1109/TETCI.2024.3462249","DOIUrl":null,"url":null,"abstract":"Underwater object detection is of great significance for various applications in underwater scenes. However, class imbalance issue is still an unsolved bottleneck for current underwater object detection algorithms. It leads to large discrepancies in the detection precision among different classes that the dominant classes with more training data achieve higher precision while the minority classes with less training data achieve much lower precision. In this paper, we propose a balanced underwater object detection network (BAUODNET) to address the class imbalance issue by exploiting two techniques, i.e., the style augmentation technique and the example re-weighting technique. Firstly, we propose a class-wise style augmentation (CWSA) algorithm to augment the training data for the minority classes that generates different colors, textures and contrasts for the minority classes whilst preserving geometry. The augmented dataset possesses more balanced data distribution; Secondly, we exploit the the focal loss to re-weight the examples during the training of the deep detector, it down-weights the loss assigned to the well-detected examples from the dominant classes and focuses on learning undetected hard examples from the minority classes. Extensive experiments show the effectiveness of CWSA and focal loss for addressing the class imbalance problem in underwater scenes, BAUODNET obtains 49.5% mAP on URPC2017 and 66.8% mAP on URPC2018, achieving state-of-the-art or comparable performance on URPC2017 and URPC2018.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"2452-2461"},"PeriodicalIF":5.3000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10693644/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater object detection is of great significance for various applications in underwater scenes. However, class imbalance issue is still an unsolved bottleneck for current underwater object detection algorithms. It leads to large discrepancies in the detection precision among different classes that the dominant classes with more training data achieve higher precision while the minority classes with less training data achieve much lower precision. In this paper, we propose a balanced underwater object detection network (BAUODNET) to address the class imbalance issue by exploiting two techniques, i.e., the style augmentation technique and the example re-weighting technique. Firstly, we propose a class-wise style augmentation (CWSA) algorithm to augment the training data for the minority classes that generates different colors, textures and contrasts for the minority classes whilst preserving geometry. The augmented dataset possesses more balanced data distribution; Secondly, we exploit the the focal loss to re-weight the examples during the training of the deep detector, it down-weights the loss assigned to the well-detected examples from the dominant classes and focuses on learning undetected hard examples from the minority classes. Extensive experiments show the effectiveness of CWSA and focal loss for addressing the class imbalance problem in underwater scenes, BAUODNET obtains 49.5% mAP on URPC2017 and 66.8% mAP on URPC2018, achieving state-of-the-art or comparable performance on URPC2017 and URPC2018.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.