Zekun Tian , Dunlu Peng , Debby D. Wang , Linna Zhang , Zheng Zou , Hejing Huang , Shiqi Zhang
{"title":"基于多方向移位窗口关注和感应偏置的Swin变压器诊断胸腔积液","authors":"Zekun Tian , Dunlu Peng , Debby D. Wang , Linna Zhang , Zheng Zou , Hejing Huang , Shiqi Zhang","doi":"10.1016/j.asoc.2025.113146","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of healthcare, deep learning has shown promise in addressing diagnostic challenges. However, existing methods often struggle with generalization due to overfitting on non-discriminative features and limited datasets. To address these limitations, <em>Ultra-Multi-SWIN</em> is introduced as a novel deep learning model for pleural effusion diagnosis using ultrasound images. The model incorporates physician-inspired inductive biases into its architecture, enabling it to focus on discriminative features while avoiding overfitting to irrelevant information. Specifically, a multi-directional-shift window structure captures spatial features dependent on direction, and a MASK-based masking module suppresses redundant non-ultrasound features. A dataset comprising 50 subjects and four levels of pleural effusion severity (large, moderate, small, none) is established to evaluate the model’s performance. Experimental results demonstrate that <em>Ultra-Multi-SWIN</em> achieves state-of-the-art performance, with average accuracies of 0.988 (subject-dependent) and 0.952 (subject-independent). Visualization and ablation studies further confirm the model’s ability to generalize effectively by focusing on clinically relevant regions. The open-source code is released at <span><span>Ultra-Multi-SWIN</span><svg><path></path></svg></span>, promoting broader adoption and future research.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"177 ","pages":"Article 113146"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Swin Transformer based on multi-directional-shift window attention and inductive bias for diagnosis of pleural effusion\",\"authors\":\"Zekun Tian , Dunlu Peng , Debby D. Wang , Linna Zhang , Zheng Zou , Hejing Huang , Shiqi Zhang\",\"doi\":\"10.1016/j.asoc.2025.113146\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the field of healthcare, deep learning has shown promise in addressing diagnostic challenges. However, existing methods often struggle with generalization due to overfitting on non-discriminative features and limited datasets. To address these limitations, <em>Ultra-Multi-SWIN</em> is introduced as a novel deep learning model for pleural effusion diagnosis using ultrasound images. The model incorporates physician-inspired inductive biases into its architecture, enabling it to focus on discriminative features while avoiding overfitting to irrelevant information. Specifically, a multi-directional-shift window structure captures spatial features dependent on direction, and a MASK-based masking module suppresses redundant non-ultrasound features. A dataset comprising 50 subjects and four levels of pleural effusion severity (large, moderate, small, none) is established to evaluate the model’s performance. Experimental results demonstrate that <em>Ultra-Multi-SWIN</em> achieves state-of-the-art performance, with average accuracies of 0.988 (subject-dependent) and 0.952 (subject-independent). Visualization and ablation studies further confirm the model’s ability to generalize effectively by focusing on clinically relevant regions. The open-source code is released at <span><span>Ultra-Multi-SWIN</span><svg><path></path></svg></span>, promoting broader adoption and future research.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"177 \",\"pages\":\"Article 113146\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625004570\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625004570","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A Swin Transformer based on multi-directional-shift window attention and inductive bias for diagnosis of pleural effusion
In the field of healthcare, deep learning has shown promise in addressing diagnostic challenges. However, existing methods often struggle with generalization due to overfitting on non-discriminative features and limited datasets. To address these limitations, Ultra-Multi-SWIN is introduced as a novel deep learning model for pleural effusion diagnosis using ultrasound images. The model incorporates physician-inspired inductive biases into its architecture, enabling it to focus on discriminative features while avoiding overfitting to irrelevant information. Specifically, a multi-directional-shift window structure captures spatial features dependent on direction, and a MASK-based masking module suppresses redundant non-ultrasound features. A dataset comprising 50 subjects and four levels of pleural effusion severity (large, moderate, small, none) is established to evaluate the model’s performance. Experimental results demonstrate that Ultra-Multi-SWIN achieves state-of-the-art performance, with average accuracies of 0.988 (subject-dependent) and 0.952 (subject-independent). Visualization and ablation studies further confirm the model’s ability to generalize effectively by focusing on clinically relevant regions. The open-source code is released at Ultra-Multi-SWIN, promoting broader adoption and future research.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.