医学图像的结构张量和频率引导半监督分割

IF 3.2 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Medical physics Pub Date : 2024-09-17 DOI:10.1002/mp.17399
Xuesong Leng, Xiaxia Wang, Wenbo Yue, Jianxiu Jin, Guoping Xu
{"title":"医学图像的结构张量和频率引导半监督分割","authors":"Xuesong Leng, Xiaxia Wang, Wenbo Yue, Jianxiu Jin, Guoping Xu","doi":"10.1002/mp.17399","DOIUrl":null,"url":null,"abstract":"BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" xlink:href=\"https://github.com/apple1986/STLFAL\">https://github.com/apple1986/STLFAL</jats:ext-link>.","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"65 1","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Structural tensor and frequency guided semi‐supervised segmentation for medical images\",\"authors\":\"Xuesong Leng, Xiaxia Wang, Wenbo Yue, Jianxiu Jin, Guoping Xu\",\"doi\":\"10.1002/mp.17399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at <jats:ext-link xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\" xlink:href=\\\"https://github.com/apple1986/STLFAL\\\">https://github.com/apple1986/STLFAL</jats:ext-link>.\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"65 1\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/mp.17399\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/mp.17399","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

摘要

背景半监督语义分割方法需要使用有限数量的标注样本和大量未标注样本进行训练,目的是减少对像素级注释的依赖。大多数半监督语义分割方法主要侧重于在空间维度上增加样本,以减少标记样本的不足。这些方法往往会忽略物体的结构信息。此外,频域信息也为评估图像信息提供了另一个视角,与空间域相比,频域信息具有不同的属性。目的在本研究中,我们试图回答以下两个问题:(1) 在医学图像的半监督语义分割任务中提供对象的结构信息是否有帮助?(2) 在半监督医学图像分割中,在频域评估分割性能是否比在空间域评估更有效?因此,我们试图引入结构和频率信息来提高医学图像半监督语义分割的性能。方法我们提出了一种新颖的结构张量损失(STL)来指导半监督语义分割的空间域特征学习。具体来说,STL 利用编码在张量中的结构信息来加强跨空间区域对象的一致性,从而提高特征提取的鲁棒性和准确性。此外,我们还提出了一种频域对齐损耗(FAL),使神经网络能够在不同的增强样本中学习频域信息。结果我们在三个基准数据集上进行了实验,这三个数据集包括针对心脏的核磁共振成像(ACDC)、针对腹部器官的 CT(Synapse)和针对乳腺病变分割的超声波图像(BUSI)。实验结果表明,在 Dice 相似性系数方面,我们的方法优于最先进的半监督方法。它将有助于减少对医学图像标签的需求。我们的代码见 https://github.com/apple1986/STLFAL。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Structural tensor and frequency guided semi‐supervised segmentation for medical images
BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at https://github.com/apple1986/STLFAL.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Medical physics
Medical physics 医学-核医学
CiteScore
6.80
自引率
15.80%
发文量
660
审稿时长
1.7 months
期刊介绍: Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信