{"title":"Exploring the Depth From EAST: Efficient Aggregated State-Space Tanh-Tuned Model for Underwater Object Detection","authors":"Yili Xu;Xuanxuan Xiao","doi":"10.1109/LSP.2025.3606841","DOIUrl":null,"url":null,"abstract":"Underwater object detection faces severe challenges due to light attenuation, color distortion, and low contrast. This letter presents EAST-YOLO, an efficient architecture achieving 79.6% average mAP@0.5 across six underwater datasets—2.1% higher than YOLO11n—while maintaining 2.6 M parameters, 6.5 GFLOPs, and 70 FPS real-time performance. Three problem-driven modules address specific underwater challenges: VSS-Enhanced Block for visibility-limited global context modeling with <inline-formula><tex-math>$\\mathcal {O}(N)$</tex-math></inline-formula> complexity, Aggregated Pathway Block for refraction-robust multi-scale detection, and Tanh-Tuned Attention Block for spatially-adaptive feature modulation. Extensive evaluation on RUOD, DUO, URPC2020, UTDAC2020, DFUI, and AUDD datasets demonstrates EAST-YOLO’s effectiveness as a practical solution for resource-constrained underwater applications, with promising robustness across diverse degraded conditions.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"3809-3813"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11153389/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater object detection faces severe challenges due to light attenuation, color distortion, and low contrast. This letter presents EAST-YOLO, an efficient architecture achieving 79.6% average mAP@0.5 across six underwater datasets—2.1% higher than YOLO11n—while maintaining 2.6 M parameters, 6.5 GFLOPs, and 70 FPS real-time performance. Three problem-driven modules address specific underwater challenges: VSS-Enhanced Block for visibility-limited global context modeling with $\mathcal {O}(N)$ complexity, Aggregated Pathway Block for refraction-robust multi-scale detection, and Tanh-Tuned Attention Block for spatially-adaptive feature modulation. Extensive evaluation on RUOD, DUO, URPC2020, UTDAC2020, DFUI, and AUDD datasets demonstrates EAST-YOLO’s effectiveness as a practical solution for resource-constrained underwater applications, with promising robustness across diverse degraded conditions.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.