{"title":"评估自监督xLSTM-UNet架构在头颈部肿瘤分割的mr引导应用。","authors":"Abdul Qayyum, Moona Mazher, Steven A Niederer","doi":"10.1007/978-3-031-83274-1_12","DOIUrl":null,"url":null,"abstract":"<p><p>Radiation therapy (RT) plays a pivotal role in treating head and neck cancer (HNC), with MRI-guided approaches offering superior soft tissue contrast and daily adaptive capabilities that significantly enhance treatment precision while minimizing side effects. To optimize MRI-guided adaptive RT for HNC, we propose a novel two-stage model for Head and Neck Tumor Segmentation. In the first stage, we leverage a Self-Supervised 3D Student-Teacher Learning Framework, specifically utilizing the DINOv2 architecture, to learn effective representations from a limited unlabeled dataset. This approach effectively addresses the challenge posed by the scarcity of annotated data, enabling the model to generalize better in tumor identification and segmentation. In the second stage, we fine-tune an xLSTM-based UNet model that is specifically designed to capture both spatial and sequential features of tumor progression. This hybrid architecture improves segmentation accuracy by integrating temporal dependencies, making it particularly well-suited for MRI-guided adaptive RT planning in HNC. The model's performance is rigorously evaluated on a diverse set of HNC cases, demonstrating significant improvements over state-of-the-art deep learning models in accurately segmenting tumor structures. Our proposed solution achieved an impressive mean aggregated Dice Coefficient of 0.81 for pre-RT segments and 0.65 for mid-RT segments, underscoring its effectiveness in automated segmentation tasks. This work advances the field of HNC imaging by providing a robust, generalizable solution for automated Head and Neck Tumor Segmentation, ultimately enhancing the quality of care for patients undergoing RT. Our team name is DeepLearnAI (CEMRG). The code for this work is available at https://github.com/RespectKnowledge/SSL-based-DINOv2_Vision-LSTM_Head-and-Neck-Tumor_Segmentation.</p>","PeriodicalId":520475,"journal":{"name":"Head and Neck Tumor Segmentation for MR-Guided Applications : First MICCAI Challenge, HNTS-MRG 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 17, 2024, proceedings","volume":"15273 ","pages":"166-178"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12091698/pdf/","citationCount":"0","resultStr":"{\"title\":\"Assessing Self-supervised xLSTM-UNet Architectures for Head and Neck Tumor Segmentation in MR-Guided Applications.\",\"authors\":\"Abdul Qayyum, Moona Mazher, Steven A Niederer\",\"doi\":\"10.1007/978-3-031-83274-1_12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Radiation therapy (RT) plays a pivotal role in treating head and neck cancer (HNC), with MRI-guided approaches offering superior soft tissue contrast and daily adaptive capabilities that significantly enhance treatment precision while minimizing side effects. To optimize MRI-guided adaptive RT for HNC, we propose a novel two-stage model for Head and Neck Tumor Segmentation. In the first stage, we leverage a Self-Supervised 3D Student-Teacher Learning Framework, specifically utilizing the DINOv2 architecture, to learn effective representations from a limited unlabeled dataset. This approach effectively addresses the challenge posed by the scarcity of annotated data, enabling the model to generalize better in tumor identification and segmentation. In the second stage, we fine-tune an xLSTM-based UNet model that is specifically designed to capture both spatial and sequential features of tumor progression. This hybrid architecture improves segmentation accuracy by integrating temporal dependencies, making it particularly well-suited for MRI-guided adaptive RT planning in HNC. The model's performance is rigorously evaluated on a diverse set of HNC cases, demonstrating significant improvements over state-of-the-art deep learning models in accurately segmenting tumor structures. Our proposed solution achieved an impressive mean aggregated Dice Coefficient of 0.81 for pre-RT segments and 0.65 for mid-RT segments, underscoring its effectiveness in automated segmentation tasks. This work advances the field of HNC imaging by providing a robust, generalizable solution for automated Head and Neck Tumor Segmentation, ultimately enhancing the quality of care for patients undergoing RT. Our team name is DeepLearnAI (CEMRG). The code for this work is available at https://github.com/RespectKnowledge/SSL-based-DINOv2_Vision-LSTM_Head-and-Neck-Tumor_Segmentation.</p>\",\"PeriodicalId\":520475,\"journal\":{\"name\":\"Head and Neck Tumor Segmentation for MR-Guided Applications : First MICCAI Challenge, HNTS-MRG 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 17, 2024, proceedings\",\"volume\":\"15273 \",\"pages\":\"166-178\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12091698/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Head and Neck Tumor Segmentation for MR-Guided Applications : First MICCAI Challenge, HNTS-MRG 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 17, 2024, proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/978-3-031-83274-1_12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/3 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Head and Neck Tumor Segmentation for MR-Guided Applications : First MICCAI Challenge, HNTS-MRG 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 17, 2024, proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-83274-1_12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/3 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
Assessing Self-supervised xLSTM-UNet Architectures for Head and Neck Tumor Segmentation in MR-Guided Applications.
Radiation therapy (RT) plays a pivotal role in treating head and neck cancer (HNC), with MRI-guided approaches offering superior soft tissue contrast and daily adaptive capabilities that significantly enhance treatment precision while minimizing side effects. To optimize MRI-guided adaptive RT for HNC, we propose a novel two-stage model for Head and Neck Tumor Segmentation. In the first stage, we leverage a Self-Supervised 3D Student-Teacher Learning Framework, specifically utilizing the DINOv2 architecture, to learn effective representations from a limited unlabeled dataset. This approach effectively addresses the challenge posed by the scarcity of annotated data, enabling the model to generalize better in tumor identification and segmentation. In the second stage, we fine-tune an xLSTM-based UNet model that is specifically designed to capture both spatial and sequential features of tumor progression. This hybrid architecture improves segmentation accuracy by integrating temporal dependencies, making it particularly well-suited for MRI-guided adaptive RT planning in HNC. The model's performance is rigorously evaluated on a diverse set of HNC cases, demonstrating significant improvements over state-of-the-art deep learning models in accurately segmenting tumor structures. Our proposed solution achieved an impressive mean aggregated Dice Coefficient of 0.81 for pre-RT segments and 0.65 for mid-RT segments, underscoring its effectiveness in automated segmentation tasks. This work advances the field of HNC imaging by providing a robust, generalizable solution for automated Head and Neck Tumor Segmentation, ultimately enhancing the quality of care for patients undergoing RT. Our team name is DeepLearnAI (CEMRG). The code for this work is available at https://github.com/RespectKnowledge/SSL-based-DINOv2_Vision-LSTM_Head-and-Neck-Tumor_Segmentation.