Xinran Liu , Jianmin Yang , Wenhao Xu , Qihang Chen , Haining Lu , Yu Chai , Changyu Lu , Yulong Xue
{"title":"DSM-Net:深海采矿船声纳图像的多尺度检测网络","authors":"Xinran Liu , Jianmin Yang , Wenhao Xu , Qihang Chen , Haining Lu , Yu Chai , Changyu Lu , Yulong Xue","doi":"10.1016/j.apor.2025.104551","DOIUrl":null,"url":null,"abstract":"<div><div>Deep-sea mining vehicles (DSMVs) play a crucial role in deep-sea mining operations, requiring high-precision, real-time detection of seabed rocks and terrain of varying scales to ensure safe navigation and operation. However, the complexity of multi-scale seabed terrains, along with the low resolution and high noise levels in sonar images, makes accurate real-time detection a challenge. To address these issues, DSM-Net, a multi-scale terrain detection network specifically designed for deep-sea mining, is proposed. DSM-Net integrates several innovative modules: the Tri-Scale Attention Module (TSA) extracts multi-scale features and reduces noise interference, the Partial-Dynamic Module (PDM) improves inference speed, and the ASFF* detection head incorporates an additional small-target detection layer. Furthermore, an adaptive weighting function, Adaptive Difficulty Loss (ADL), is introduced to handle the imbalance in the number of targets across different scales in actual seabed environments. DSM-Net was evaluated on the DSMSD dataset, showing a 2.16 % improvement in [email protected]:0.95 and an 18.75 % reduction in inference time compared to the YOLOv8 baseline, striking an effective balance between detection speed and accuracy. In sea trials, DSM-Net contributed to path planning and obstacle avoidance, proving its practical engineering value.</div></div>","PeriodicalId":8261,"journal":{"name":"Applied Ocean Research","volume":"158 ","pages":"Article 104551"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DSM-Net: A multi-scale detection network of sonar images for deep-sea mining vehicle\",\"authors\":\"Xinran Liu , Jianmin Yang , Wenhao Xu , Qihang Chen , Haining Lu , Yu Chai , Changyu Lu , Yulong Xue\",\"doi\":\"10.1016/j.apor.2025.104551\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep-sea mining vehicles (DSMVs) play a crucial role in deep-sea mining operations, requiring high-precision, real-time detection of seabed rocks and terrain of varying scales to ensure safe navigation and operation. However, the complexity of multi-scale seabed terrains, along with the low resolution and high noise levels in sonar images, makes accurate real-time detection a challenge. To address these issues, DSM-Net, a multi-scale terrain detection network specifically designed for deep-sea mining, is proposed. DSM-Net integrates several innovative modules: the Tri-Scale Attention Module (TSA) extracts multi-scale features and reduces noise interference, the Partial-Dynamic Module (PDM) improves inference speed, and the ASFF* detection head incorporates an additional small-target detection layer. Furthermore, an adaptive weighting function, Adaptive Difficulty Loss (ADL), is introduced to handle the imbalance in the number of targets across different scales in actual seabed environments. DSM-Net was evaluated on the DSMSD dataset, showing a 2.16 % improvement in [email protected]:0.95 and an 18.75 % reduction in inference time compared to the YOLOv8 baseline, striking an effective balance between detection speed and accuracy. In sea trials, DSM-Net contributed to path planning and obstacle avoidance, proving its practical engineering value.</div></div>\",\"PeriodicalId\":8261,\"journal\":{\"name\":\"Applied Ocean Research\",\"volume\":\"158 \",\"pages\":\"Article 104551\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Ocean Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141118725001397\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, OCEAN\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Ocean Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141118725001397","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, OCEAN","Score":null,"Total":0}
DSM-Net: A multi-scale detection network of sonar images for deep-sea mining vehicle
Deep-sea mining vehicles (DSMVs) play a crucial role in deep-sea mining operations, requiring high-precision, real-time detection of seabed rocks and terrain of varying scales to ensure safe navigation and operation. However, the complexity of multi-scale seabed terrains, along with the low resolution and high noise levels in sonar images, makes accurate real-time detection a challenge. To address these issues, DSM-Net, a multi-scale terrain detection network specifically designed for deep-sea mining, is proposed. DSM-Net integrates several innovative modules: the Tri-Scale Attention Module (TSA) extracts multi-scale features and reduces noise interference, the Partial-Dynamic Module (PDM) improves inference speed, and the ASFF* detection head incorporates an additional small-target detection layer. Furthermore, an adaptive weighting function, Adaptive Difficulty Loss (ADL), is introduced to handle the imbalance in the number of targets across different scales in actual seabed environments. DSM-Net was evaluated on the DSMSD dataset, showing a 2.16 % improvement in [email protected]:0.95 and an 18.75 % reduction in inference time compared to the YOLOv8 baseline, striking an effective balance between detection speed and accuracy. In sea trials, DSM-Net contributed to path planning and obstacle avoidance, proving its practical engineering value.
期刊介绍:
The aim of Applied Ocean Research is to encourage the submission of papers that advance the state of knowledge in a range of topics relevant to ocean engineering.