放射治疗中计算机断层图像自动肝脏分割的定性评价

IF 1.5 4区 医学 Q4 ONCOLOGY
Dorea Maria Khalal , Souleyman Slimani , Zine Eddine Bouraoui , Hacene Azizi
{"title":"放射治疗中计算机断层图像自动肝脏分割的定性评价","authors":"Dorea Maria Khalal ,&nbsp;Souleyman Slimani ,&nbsp;Zine Eddine Bouraoui ,&nbsp;Hacene Azizi","doi":"10.1016/j.canrad.2025.104648","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Segmentation of target volumes and organs at risk on computed tomography (CT) images constitutes an important step in the radiotherapy workflow. Artificial intelligence-based methods have significantly improved organ segmentation in medical images. Automatic segmentations are frequently evaluated using geometric metrics. Before a clinical implementation in the radiotherapy workflow, automatic segmentations must also be evaluated by clinicians. The aim of this study was to investigate the correlation between geometric metrics used for segmentation evaluation and the assessment performed by clinicians.</div></div><div><h3>Materials and Methods</h3><div>In this study, we used the U-Net model to segment the liver in CT images from a publicly available dataset. The model's performance was evaluated using two geometric metrics: the Dice similarity coefficient and the Hausdorff distance. Additionally, a qualitative evaluation was performed by clinicians who reviewed the automatic segmentations to rate their clinical acceptability for use in the radiotherapy workflow. The correlation between the geometric metrics and the clinicians’ evaluations was studied.</div></div><div><h3>Results</h3><div>The results showed that while the Dice coefficient and Hausdorff distance are reliable indicators of segmentation accuracy, they do not always align with clinician segmentation. In some cases, segmentations with high Dice scores still required clinician corrections before clinical use in the radiotherapy workflow.</div></div><div><h3>Conclusion</h3><div>This study highlights the need for more comprehensive evaluation metrics beyond geometric measures to assess the clinical acceptability of artificial intelligence-based segmentation. Although the deep learning model provided promising segmentation results, the present study shows that standardized validation methodologies are crucial for ensuring the clinical viability of automatic segmentation systems.</div></div>","PeriodicalId":9504,"journal":{"name":"Cancer Radiotherapie","volume":"29 4","pages":"Article 104648"},"PeriodicalIF":1.5000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Qualitative evaluation of automatic liver segmentation in computed tomography images for clinical use in radiation therapy\",\"authors\":\"Dorea Maria Khalal ,&nbsp;Souleyman Slimani ,&nbsp;Zine Eddine Bouraoui ,&nbsp;Hacene Azizi\",\"doi\":\"10.1016/j.canrad.2025.104648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><div>Segmentation of target volumes and organs at risk on computed tomography (CT) images constitutes an important step in the radiotherapy workflow. Artificial intelligence-based methods have significantly improved organ segmentation in medical images. Automatic segmentations are frequently evaluated using geometric metrics. Before a clinical implementation in the radiotherapy workflow, automatic segmentations must also be evaluated by clinicians. The aim of this study was to investigate the correlation between geometric metrics used for segmentation evaluation and the assessment performed by clinicians.</div></div><div><h3>Materials and Methods</h3><div>In this study, we used the U-Net model to segment the liver in CT images from a publicly available dataset. The model's performance was evaluated using two geometric metrics: the Dice similarity coefficient and the Hausdorff distance. Additionally, a qualitative evaluation was performed by clinicians who reviewed the automatic segmentations to rate their clinical acceptability for use in the radiotherapy workflow. The correlation between the geometric metrics and the clinicians’ evaluations was studied.</div></div><div><h3>Results</h3><div>The results showed that while the Dice coefficient and Hausdorff distance are reliable indicators of segmentation accuracy, they do not always align with clinician segmentation. In some cases, segmentations with high Dice scores still required clinician corrections before clinical use in the radiotherapy workflow.</div></div><div><h3>Conclusion</h3><div>This study highlights the need for more comprehensive evaluation metrics beyond geometric measures to assess the clinical acceptability of artificial intelligence-based segmentation. Although the deep learning model provided promising segmentation results, the present study shows that standardized validation methodologies are crucial for ensuring the clinical viability of automatic segmentation systems.</div></div>\",\"PeriodicalId\":9504,\"journal\":{\"name\":\"Cancer Radiotherapie\",\"volume\":\"29 4\",\"pages\":\"Article 104648\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cancer Radiotherapie\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1278321825000642\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Radiotherapie","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1278321825000642","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的计算机断层扫描(CT)图像中靶体积和危险器官的分割是放疗工作流程中的重要步骤。基于人工智能的方法显著改善了医学图像中的器官分割。自动分割通常使用几何度量来评估。在放疗工作流程的临床实施之前,自动分割也必须由临床医生进行评估。本研究的目的是调查用于分割评估的几何指标与临床医生进行的评估之间的相关性。材料和方法在本研究中,我们使用U-Net模型从公开可用的数据集中分割CT图像中的肝脏。模型的性能用两个几何指标来评估:Dice相似系数和Hausdorff距离。此外,临床医生对自动分割进行了定性评估,以评估其在放疗工作流程中使用的临床可接受性。研究几何指标与临床医师评价的相关性。结果Dice系数和Hausdorff距离是分割精度的可靠指标,但并不总是与临床医生的分割一致。在某些情况下,高Dice分数的分割仍然需要临床医生在临床使用放射治疗工作流程之前进行校正。结论本研究强调了在几何测量之外需要更全面的评估指标来评估基于人工智能的分割的临床可接受性。尽管深度学习模型提供了有希望的分割结果,但目前的研究表明,标准化的验证方法对于确保自动分割系统的临床可行性至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Qualitative evaluation of automatic liver segmentation in computed tomography images for clinical use in radiation therapy

Purpose

Segmentation of target volumes and organs at risk on computed tomography (CT) images constitutes an important step in the radiotherapy workflow. Artificial intelligence-based methods have significantly improved organ segmentation in medical images. Automatic segmentations are frequently evaluated using geometric metrics. Before a clinical implementation in the radiotherapy workflow, automatic segmentations must also be evaluated by clinicians. The aim of this study was to investigate the correlation between geometric metrics used for segmentation evaluation and the assessment performed by clinicians.

Materials and Methods

In this study, we used the U-Net model to segment the liver in CT images from a publicly available dataset. The model's performance was evaluated using two geometric metrics: the Dice similarity coefficient and the Hausdorff distance. Additionally, a qualitative evaluation was performed by clinicians who reviewed the automatic segmentations to rate their clinical acceptability for use in the radiotherapy workflow. The correlation between the geometric metrics and the clinicians’ evaluations was studied.

Results

The results showed that while the Dice coefficient and Hausdorff distance are reliable indicators of segmentation accuracy, they do not always align with clinician segmentation. In some cases, segmentations with high Dice scores still required clinician corrections before clinical use in the radiotherapy workflow.

Conclusion

This study highlights the need for more comprehensive evaluation metrics beyond geometric measures to assess the clinical acceptability of artificial intelligence-based segmentation. Although the deep learning model provided promising segmentation results, the present study shows that standardized validation methodologies are crucial for ensuring the clinical viability of automatic segmentation systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cancer Radiotherapie
Cancer Radiotherapie 医学-核医学
CiteScore
2.20
自引率
23.10%
发文量
129
审稿时长
63 days
期刊介绍: Cancer/radiothérapie se veut d''abord et avant tout un organe francophone de publication des travaux de recherche en radiothérapie. La revue a pour objectif de diffuser les informations majeures sur les travaux de recherche en cancérologie et tout ce qui touche de près ou de loin au traitement du cancer par les radiations : technologie, radiophysique, radiobiologie et radiothérapie clinique.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信