Haotian Zheng;Yushan Sun;Hao Xu;Liwen Zhang;Yatong Han;Shuguang Cui;Zhen Li
{"title":"MLMFFNet:用于实时前视声纳图像分割的多级混合特征融合网络","authors":"Haotian Zheng;Yushan Sun;Hao Xu;Liwen Zhang;Yatong Han;Shuguang Cui;Zhen Li","doi":"10.1109/JOE.2025.3529132","DOIUrl":null,"url":null,"abstract":"Forward-looking sonar is a critical tool for underwater target detection, and segmentation is an essential component of forward-looking sonar image processing. Accurate segmentation of sonar images is vital, but the complexity of the underwater environment introduces challenges, such as low resolution, significant noise, and blurred target edges. These factors make real-time, precise segmentation particularly difficult. To address these challenges, we propose a novel real-time segmentation network, the multilevel mixed feature fusion network (MLMFFNet), specifically designed for forward-looking sonar images. Our approach leverages a unique MFF module and a multiscale MFF module to extract both local and contextual information using deep convolutional networks, dilated convolutions, and partial convolution combinations for effective information integration. Additionally, we incorporate a context connection module to enhance feature fusion by utilizing high-level contextual information. To further improve accuracy, we introduce three weighted loss functions designed to address imbalanced sample distributions and blurred boundaries. Experimental evaluations on two distinct forward-looking sonar data sets demonstrate that MLMFFNet significantly outperforms many state-of-the-art general and sonar-specific semantic segmentation networks, delivering superior segmentation accuracy while maintaining real-time performance.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 2","pages":"1356-1369"},"PeriodicalIF":3.8000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MLMFFNet: Multilevel Mixed Feature Fusion Network for Real-Time Forward-Looking Sonar Image Segmentation\",\"authors\":\"Haotian Zheng;Yushan Sun;Hao Xu;Liwen Zhang;Yatong Han;Shuguang Cui;Zhen Li\",\"doi\":\"10.1109/JOE.2025.3529132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Forward-looking sonar is a critical tool for underwater target detection, and segmentation is an essential component of forward-looking sonar image processing. Accurate segmentation of sonar images is vital, but the complexity of the underwater environment introduces challenges, such as low resolution, significant noise, and blurred target edges. These factors make real-time, precise segmentation particularly difficult. To address these challenges, we propose a novel real-time segmentation network, the multilevel mixed feature fusion network (MLMFFNet), specifically designed for forward-looking sonar images. Our approach leverages a unique MFF module and a multiscale MFF module to extract both local and contextual information using deep convolutional networks, dilated convolutions, and partial convolution combinations for effective information integration. Additionally, we incorporate a context connection module to enhance feature fusion by utilizing high-level contextual information. To further improve accuracy, we introduce three weighted loss functions designed to address imbalanced sample distributions and blurred boundaries. Experimental evaluations on two distinct forward-looking sonar data sets demonstrate that MLMFFNet significantly outperforms many state-of-the-art general and sonar-specific semantic segmentation networks, delivering superior segmentation accuracy while maintaining real-time performance.\",\"PeriodicalId\":13191,\"journal\":{\"name\":\"IEEE Journal of Oceanic Engineering\",\"volume\":\"50 2\",\"pages\":\"1356-1369\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Oceanic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10916523/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10916523/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Forward-looking sonar is a critical tool for underwater target detection, and segmentation is an essential component of forward-looking sonar image processing. Accurate segmentation of sonar images is vital, but the complexity of the underwater environment introduces challenges, such as low resolution, significant noise, and blurred target edges. These factors make real-time, precise segmentation particularly difficult. To address these challenges, we propose a novel real-time segmentation network, the multilevel mixed feature fusion network (MLMFFNet), specifically designed for forward-looking sonar images. Our approach leverages a unique MFF module and a multiscale MFF module to extract both local and contextual information using deep convolutional networks, dilated convolutions, and partial convolution combinations for effective information integration. Additionally, we incorporate a context connection module to enhance feature fusion by utilizing high-level contextual information. To further improve accuracy, we introduce three weighted loss functions designed to address imbalanced sample distributions and blurred boundaries. Experimental evaluations on two distinct forward-looking sonar data sets demonstrate that MLMFFNet significantly outperforms many state-of-the-art general and sonar-specific semantic segmentation networks, delivering superior segmentation accuracy while maintaining real-time performance.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.