{"title":"Laplacian-guided hierarchical transformer: A network for medical image segmentation.","authors":"Yuxiao Chen, Diwei Su, Jianxu Luo","doi":"10.1016/j.cmpb.2024.108526","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objective: </strong>Accurate medical image segmentation is crucial for diagnosis and treatment planning, particularly in tumor localization and organ measurement. Despite the success of Transformer models in various domains, they still struggle to capture high-frequency features, limiting their performance in medical image segmentation, especially in edge texture extraction. To overcome this limitation and improve segmentation accuracy, this study proposes a novel model architecture aimed at enhancing the Transformer's ability to capture and integrate both high-frequency and low-frequency features.</p><p><strong>Methods: </strong>Our model combines the extraction of high-frequency features using a Laplacian pyramid with the capture of low-frequency features through a Local-Global Feature Aggregation Module. A Feature Interaction Fusion module is employed to integrate these features, focusing on target areas. Additionally, a new bridging module facilitates the transfer of spatial information between the encoder and decoder via layer-wise attention mechanisms. The model's performance was evaluated using the Synapse dataset with statistical measures such as the Dice Similarity Coefficient and Hausdorff Distance. The code is available at https://github.com/chenyuxiao123/LGHF.</p><p><strong>Results: </strong>The proposed model demonstrated state-of-the-art performance in 2D medical image segmentation, achieving a Dice Similarity Coefficient of 84.10% and a Hausdorff Distance of 12.78. The evaluation metrics indicate significant improvements compared to existing methods.</p><p><strong>Conclusion: </strong>This novel model architecture, with its enhanced capability to capture and integrate both high-frequency and low-frequency features, shows significant potential for advancing medical image segmentation. The results on the Synapse dataset demonstrate its effectiveness and suggest its application could improve diagnosis and treatment planning in clinical settings.</p>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"108526"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.cmpb.2024.108526","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and objective: Accurate medical image segmentation is crucial for diagnosis and treatment planning, particularly in tumor localization and organ measurement. Despite the success of Transformer models in various domains, they still struggle to capture high-frequency features, limiting their performance in medical image segmentation, especially in edge texture extraction. To overcome this limitation and improve segmentation accuracy, this study proposes a novel model architecture aimed at enhancing the Transformer's ability to capture and integrate both high-frequency and low-frequency features.
Methods: Our model combines the extraction of high-frequency features using a Laplacian pyramid with the capture of low-frequency features through a Local-Global Feature Aggregation Module. A Feature Interaction Fusion module is employed to integrate these features, focusing on target areas. Additionally, a new bridging module facilitates the transfer of spatial information between the encoder and decoder via layer-wise attention mechanisms. The model's performance was evaluated using the Synapse dataset with statistical measures such as the Dice Similarity Coefficient and Hausdorff Distance. The code is available at https://github.com/chenyuxiao123/LGHF.
Results: The proposed model demonstrated state-of-the-art performance in 2D medical image segmentation, achieving a Dice Similarity Coefficient of 84.10% and a Hausdorff Distance of 12.78. The evaluation metrics indicate significant improvements compared to existing methods.
Conclusion: This novel model architecture, with its enhanced capability to capture and integrate both high-frequency and low-frequency features, shows significant potential for advancing medical image segmentation. The results on the Synapse dataset demonstrate its effectiveness and suggest its application could improve diagnosis and treatment planning in clinical settings.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.