{"title":"医学图像的结构张量和频率引导半监督分割","authors":"Xuesong Leng, Xiaxia Wang, Wenbo Yue, Jianxiu Jin, Guoping Xu","doi":"10.1002/mp.17399","DOIUrl":null,"url":null,"abstract":"BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" xlink:href=\"https://github.com/apple1986/STLFAL\">https://github.com/apple1986/STLFAL</jats:ext-link>.","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"65 1","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Structural tensor and frequency guided semi‐supervised segmentation for medical images\",\"authors\":\"Xuesong Leng, Xiaxia Wang, Wenbo Yue, Jianxiu Jin, Guoping Xu\",\"doi\":\"10.1002/mp.17399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at <jats:ext-link xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\" xlink:href=\\\"https://github.com/apple1986/STLFAL\\\">https://github.com/apple1986/STLFAL</jats:ext-link>.\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"65 1\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/mp.17399\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/mp.17399","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Structural tensor and frequency guided semi‐supervised segmentation for medical images
BackgroundThe method of semi‐supervised semantic segmentation entails training with a limited number of labeled samples alongside many unlabeled samples, aiming to reduce dependence on pixel‐level annotations. Most semi‐supervised semantic segmentation methods primarily focus on sample augmentation in spatial dimensions to reduce the shortage of labeled samples. These methods tend to ignore the structural information of objects. In addition, frequency‐domain information also supplies another perspective to evaluate information from images, which includes different properties compared to the spatial domain.PurposeIn this study, we attempt to answer these two questions: (1) is it helpful to provide structural information of objects in semi‐supervised semantic segmentation tasks for medical images? (2) is it more effective to evaluate the segmentation performance in the frequency domain compared to the spatial domain for semi‐supervised medical image segmentation? Therefore, we seek to introduce structural and frequency information to improve the performance of semi‐supervised semantic segmentation for medical images.MethodsWe present a novel structural tensor loss (STL) to guide feature learning on the spatial domain for semi‐supervised semantic segmentation. Specifically, STL utilizes the structural information encoded in the tensors to enforce the consistency of objects across spatial regions, thereby promoting more robust and accurate feature extraction. Additionally, we proposed a frequency‐domain alignment loss (FAL) to enable the neural networks to learn frequency‐domain information across different augmented samples. It leverages the inherent patterns present in frequency‐domain representations to guide the network in capturing and aligning features across diverse augmentation variations, thereby enhancing the model's robustness for the inputting variations.ResultsWe conduct our experiments on three benchmark datasets, which include MRI (ACDC) for cardiac, CT (Synapse) for abdomen organs, and ultrasound image (BUSI) for breast lesion segmentation. The experimental results demonstrate that our method outperforms state‐of‐the‐art semi‐supervised approaches regarding the Dice similarity coefficient.ConclusionsWe find the proposed approach could improve the final performance of the semi‐supervised medical image segmentation task. It will help reduce the need for medical image labels. Our code will are available at https://github.com/apple1986/STLFAL.
期刊介绍:
Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments
Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.