Dashe Li , Siwei Zhao , Jingzhe Hu , Yufang Yang , Jinqiang Ding
{"title":"An underwater image segmentation model for complex scenes in aquaculture using vision Transformer","authors":"Dashe Li , Siwei Zhao , Jingzhe Hu , Yufang Yang , Jinqiang Ding","doi":"10.1016/j.compag.2025.110764","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater image processing holds significant importance for aquaculture. Traditional segmentation models have primarily been trained and tested in terrestrial environments. However, substantial differences exist between underwater and terrestrial environments, particularly in lighting conditions and color distribution, and other underwater physical phenomena. The unique underwater conditions render traditional segmentation models unsuitable for direct application, and obtaining ample data becomes challenging. Consequently, underwater image segmentation techniques encounter issues of diminished segmentation performance in practical applications. Therefore, this paper constructs an underwater image segmentation model for complex scenes in aquaculture. This paper first proposes a data preprocessing method based on Mosaic data enhancement technology to process limited datasets and increase their richness and diversity. Second, a residual feature enhancement module is constructed to capture global context information and enhance local feature extraction so local and global dependencies are effectively combined. Finally, a multiscale feature residual fusion module is proposed, which uses residual convolution to enhance the efficient fusion of low- and high-level semantic features and solves the semantic gap between the encoder and decoder. This paper conducts extensive experiments on the three datasets FishData, the underwater images segmentation dataset, and the large-scale fish dataset to verify the effectiveness of the underwater image segmentation model proposed in this paper. Compared with the existing classical network model, the <span><math><mrow><mi>M</mi><mi>I</mi><mi>o</mi><mi>U</mi></mrow></math></span>, <span><math><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></math></span>, <span><math><mrow><mi>C</mi><mi>P</mi><mi>A</mi></mrow></math></span> and <span><math><mrow><mi>F</mi><mn>1</mn><mo>−</mo><mi>s</mi><mi>c</mi><mi>o</mi><mi>r</mi><mi>e</mi></mrow></math></span> of the proposed model reached 92.20%, 97.85%, 92.47% and 92.9%, respectively. Experimental results show that the proposed model performs better in underwater image segmentation tasks.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"238 ","pages":"Article 110764"},"PeriodicalIF":7.7000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925008701","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater image processing holds significant importance for aquaculture. Traditional segmentation models have primarily been trained and tested in terrestrial environments. However, substantial differences exist between underwater and terrestrial environments, particularly in lighting conditions and color distribution, and other underwater physical phenomena. The unique underwater conditions render traditional segmentation models unsuitable for direct application, and obtaining ample data becomes challenging. Consequently, underwater image segmentation techniques encounter issues of diminished segmentation performance in practical applications. Therefore, this paper constructs an underwater image segmentation model for complex scenes in aquaculture. This paper first proposes a data preprocessing method based on Mosaic data enhancement technology to process limited datasets and increase their richness and diversity. Second, a residual feature enhancement module is constructed to capture global context information and enhance local feature extraction so local and global dependencies are effectively combined. Finally, a multiscale feature residual fusion module is proposed, which uses residual convolution to enhance the efficient fusion of low- and high-level semantic features and solves the semantic gap between the encoder and decoder. This paper conducts extensive experiments on the three datasets FishData, the underwater images segmentation dataset, and the large-scale fish dataset to verify the effectiveness of the underwater image segmentation model proposed in this paper. Compared with the existing classical network model, the , , and of the proposed model reached 92.20%, 97.85%, 92.47% and 92.9%, respectively. Experimental results show that the proposed model performs better in underwater image segmentation tasks.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.