Jui-Chung Ni , Shih-Hsiung Lee , Yen-Cheng Shen , Chu-Sing Yang
{"title":"Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation","authors":"Jui-Chung Ni , Shih-Hsiung Lee , Yen-Cheng Shen , Chu-Sing Yang","doi":"10.1016/j.medengphy.2025.104298","DOIUrl":null,"url":null,"abstract":"<div><div>In previous tasks of glottis image segmentation, the position attention mechanism was rarely incorporated, neglecting the detailed information in glottis position detection. Aiming to improve the U-Net architecture, this study introduces the dual attention mechanism based on the squeeze and excitation (SE)-Net model. This mechanism can integrate traditional channel attention with position attention mechanisms to effectively adjust the weights of crucial features and significance of positions. Replacing the weight adjustment mechanism in SE-Net with the dual attention mechanism creates a broader perspective, enhancing the sensitivity to important features in the model. Furthermore, based on the characteristics of SE-Net, the skip-connection feature of U-Net can still be retained. The architecture proposed in this paper further replaces the convolutional layers in the U-Net encoder with the bottleneck to preserve the information on the features without significantly increasing the amount of computation. In addition, the decoder is replaced with residual blocks to reduce overfitting. The results of the experiment showed that models with retained features demonstrate better accuracy while reducing overfitting. The proposed model achieved positive results in predicting the scores on the public benchmark for automatic glottis segmentation (BAGLS) dataset.</div></div>","PeriodicalId":49836,"journal":{"name":"Medical Engineering & Physics","volume":"136 ","pages":"Article 104298"},"PeriodicalIF":1.7000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Engineering & Physics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350453325000177","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In previous tasks of glottis image segmentation, the position attention mechanism was rarely incorporated, neglecting the detailed information in glottis position detection. Aiming to improve the U-Net architecture, this study introduces the dual attention mechanism based on the squeeze and excitation (SE)-Net model. This mechanism can integrate traditional channel attention with position attention mechanisms to effectively adjust the weights of crucial features and significance of positions. Replacing the weight adjustment mechanism in SE-Net with the dual attention mechanism creates a broader perspective, enhancing the sensitivity to important features in the model. Furthermore, based on the characteristics of SE-Net, the skip-connection feature of U-Net can still be retained. The architecture proposed in this paper further replaces the convolutional layers in the U-Net encoder with the bottleneck to preserve the information on the features without significantly increasing the amount of computation. In addition, the decoder is replaced with residual blocks to reduce overfitting. The results of the experiment showed that models with retained features demonstrate better accuracy while reducing overfitting. The proposed model achieved positive results in predicting the scores on the public benchmark for automatic glottis segmentation (BAGLS) dataset.
期刊介绍:
Medical Engineering & Physics provides a forum for the publication of the latest developments in biomedical engineering, and reflects the essential multidisciplinary nature of the subject. The journal publishes in-depth critical reviews, scientific papers and technical notes. Our focus encompasses the application of the basic principles of physics and engineering to the development of medical devices and technology, with the ultimate aim of producing improvements in the quality of health care.Topics covered include biomechanics, biomaterials, mechanobiology, rehabilitation engineering, biomedical signal processing and medical device development. Medical Engineering & Physics aims to keep both engineers and clinicians abreast of the latest applications of technology to health care.