{"title":"S2DiNet: Towards lightweight and fast high-resolution dichotomous image segmentation","authors":"Shuhan Chen , Haonan Tang , Yuan Huang , Lifeng Zhang , Xuelong Hu","doi":"10.1016/j.patcog.2025.111506","DOIUrl":null,"url":null,"abstract":"<div><div>The Dichotomous Image Segmentation task aims to achieve ultra-high precision binary segmentation for category-agnostic objects, including salient, camouflaged, structurally complex, or feature-similar entities. Traditional methods designed for low-resolution inputs produce blurred segmentation, failing to meet such critical safety and stability requirements. Although existing DIS methods achieve high accuracy, they are often parameter-heavy and slow, neglecting practical application needs. To address these challenges, this paper proposes a light-weight and fast framework, aims at improving processing efficiency while ensuring accuracy in high-resolution natural scenes. The proposed method utilizes a shared-weight ResNet-18 backbone to process inputs of different scales. A Feature Synchronization module is employed to enhance the correlation between encoded features of different resolutions. To reduce the parameter and increase the inference speed, the number of feature channels are decreased; however, this also resulted in information loss. The Star Fusion module is introduced to mitigate this issue. Furthermore, a Decoupling and Integration Decoder is adopted to progressively decode and fuse the body, detail, and mask features of the object, enhancing feature decoding accuracy. The proposed model runs at <strong>26.3 FPS</strong> with a <strong>48.7 MB</strong> size, reducing parameters by 72.4% and increasing speed by 30.8% compared to baseline method ISNet, while maintaining superior performance. Moreover, it surpasses several existing high-resolution methods in terms of accuracy.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"164 ","pages":"Article 111506"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325001669","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
S2DiNet: Towards lightweight and fast high-resolution dichotomous image segmentation
The Dichotomous Image Segmentation task aims to achieve ultra-high precision binary segmentation for category-agnostic objects, including salient, camouflaged, structurally complex, or feature-similar entities. Traditional methods designed for low-resolution inputs produce blurred segmentation, failing to meet such critical safety and stability requirements. Although existing DIS methods achieve high accuracy, they are often parameter-heavy and slow, neglecting practical application needs. To address these challenges, this paper proposes a light-weight and fast framework, aims at improving processing efficiency while ensuring accuracy in high-resolution natural scenes. The proposed method utilizes a shared-weight ResNet-18 backbone to process inputs of different scales. A Feature Synchronization module is employed to enhance the correlation between encoded features of different resolutions. To reduce the parameter and increase the inference speed, the number of feature channels are decreased; however, this also resulted in information loss. The Star Fusion module is introduced to mitigate this issue. Furthermore, a Decoupling and Integration Decoder is adopted to progressively decode and fuse the body, detail, and mask features of the object, enhancing feature decoding accuracy. The proposed model runs at 26.3 FPS with a 48.7 MB size, reducing parameters by 72.4% and increasing speed by 30.8% compared to baseline method ISNet, while maintaining superior performance. Moreover, it surpasses several existing high-resolution methods in terms of accuracy.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.