{"title":"MD-SA2:优化分段任何2多模式,深度感知脑肿瘤细分撒哈拉以南地区的人口。","authors":"Benjamin Li, Kai Ding, Dimah Dera","doi":"10.1117/1.JMI.12.2.024007","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Machine learning algorithms are emerging as valuable aides for radiologists in medical image segmentation due to their accuracy and speed. However, existing approaches, including both conventional machine learning and Segment Anything (SA)-based models, face challenges with the complex, multimodal, and varied quality of magnetic resonance imaging (MRI) scan images used for brain tumor segmentation. To address these challenges, we propose MD-SA2, adapting Segment Anything 2 (SA2) to medical image segmentation and introducing a lightweight U-Net \"aggregator\" model.</p><p><strong>Approach: </strong>Various modifications are incorporated to enhance segmentation accuracy and throughput. SA2 is first customized and fine-tuned for greater efficiency than the original Segment Anything. Slices from multiple image modalities are concatenated for input into the image encoder to improve the delineation of tumor subtypes. In addition, a lightweight U-Net aggregator model is integrated with SA2 to introduce depth awareness. The 2023 BraTS-Africa dataset, containing low-resolution MRI images from 60 sub-Saharan patients, is used to evaluate the algorithm's performance.</p><p><strong>Results: </strong>MD-SA2 attains notable improvements over existing approaches under challenging data circumstances. It achieves a tenfold cross-validated, statistically significant improvement over current methods with a 0.7893 Dice coefficient. It also reaches a higher Intersection over Union and lower 95% Hausdorff distance metrics. An ablation study verifies the impact of key components.</p><p><strong>Conclusions: </strong>MD-SA2 displays strong potential for supporting the diagnosis and treatment planning of brain tumors. It may contribute to narrowing health inequities, especially in medically underserved areas where data quantity and quality limitations reduce the efficacy of traditional automated approaches.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 2","pages":"024007"},"PeriodicalIF":1.7000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12014943/pdf/","citationCount":"0","resultStr":"{\"title\":\"MD-SA2: optimizing Segment Anything 2 for multimodal, depth-aware brain tumor segmentation in sub-Saharan populations.\",\"authors\":\"Benjamin Li, Kai Ding, Dimah Dera\",\"doi\":\"10.1117/1.JMI.12.2.024007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Machine learning algorithms are emerging as valuable aides for radiologists in medical image segmentation due to their accuracy and speed. However, existing approaches, including both conventional machine learning and Segment Anything (SA)-based models, face challenges with the complex, multimodal, and varied quality of magnetic resonance imaging (MRI) scan images used for brain tumor segmentation. To address these challenges, we propose MD-SA2, adapting Segment Anything 2 (SA2) to medical image segmentation and introducing a lightweight U-Net \\\"aggregator\\\" model.</p><p><strong>Approach: </strong>Various modifications are incorporated to enhance segmentation accuracy and throughput. SA2 is first customized and fine-tuned for greater efficiency than the original Segment Anything. Slices from multiple image modalities are concatenated for input into the image encoder to improve the delineation of tumor subtypes. In addition, a lightweight U-Net aggregator model is integrated with SA2 to introduce depth awareness. The 2023 BraTS-Africa dataset, containing low-resolution MRI images from 60 sub-Saharan patients, is used to evaluate the algorithm's performance.</p><p><strong>Results: </strong>MD-SA2 attains notable improvements over existing approaches under challenging data circumstances. It achieves a tenfold cross-validated, statistically significant improvement over current methods with a 0.7893 Dice coefficient. It also reaches a higher Intersection over Union and lower 95% Hausdorff distance metrics. An ablation study verifies the impact of key components.</p><p><strong>Conclusions: </strong>MD-SA2 displays strong potential for supporting the diagnosis and treatment planning of brain tumors. It may contribute to narrowing health inequities, especially in medically underserved areas where data quantity and quality limitations reduce the efficacy of traditional automated approaches.</p>\",\"PeriodicalId\":47707,\"journal\":{\"name\":\"Journal of Medical Imaging\",\"volume\":\"12 2\",\"pages\":\"024007\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12014943/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Medical Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JMI.12.2.024007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/22 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.2.024007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/22 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
MD-SA2: optimizing Segment Anything 2 for multimodal, depth-aware brain tumor segmentation in sub-Saharan populations.
Purpose: Machine learning algorithms are emerging as valuable aides for radiologists in medical image segmentation due to their accuracy and speed. However, existing approaches, including both conventional machine learning and Segment Anything (SA)-based models, face challenges with the complex, multimodal, and varied quality of magnetic resonance imaging (MRI) scan images used for brain tumor segmentation. To address these challenges, we propose MD-SA2, adapting Segment Anything 2 (SA2) to medical image segmentation and introducing a lightweight U-Net "aggregator" model.
Approach: Various modifications are incorporated to enhance segmentation accuracy and throughput. SA2 is first customized and fine-tuned for greater efficiency than the original Segment Anything. Slices from multiple image modalities are concatenated for input into the image encoder to improve the delineation of tumor subtypes. In addition, a lightweight U-Net aggregator model is integrated with SA2 to introduce depth awareness. The 2023 BraTS-Africa dataset, containing low-resolution MRI images from 60 sub-Saharan patients, is used to evaluate the algorithm's performance.
Results: MD-SA2 attains notable improvements over existing approaches under challenging data circumstances. It achieves a tenfold cross-validated, statistically significant improvement over current methods with a 0.7893 Dice coefficient. It also reaches a higher Intersection over Union and lower 95% Hausdorff distance metrics. An ablation study verifies the impact of key components.
Conclusions: MD-SA2 displays strong potential for supporting the diagnosis and treatment planning of brain tumors. It may contribute to narrowing health inequities, especially in medically underserved areas where data quantity and quality limitations reduce the efficacy of traditional automated approaches.
期刊介绍:
JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.