{"title":"Deep-learning based automated pancreas segmentation on CT scans of chronic pancreatitis patients","authors":"Surenth Nalliah , Esben Bolvig Mark , Marjolein Henrieke Liedenbaum , Mille Kristence Lillien Mosegaard , Tobias Hellström , Erlend Hodneland , Ingfrid Helene Salvesen Haldorsen , Trond Engjom , Asbjørn Mohr Drewes , Søren Schou Olesen , Jens Brøndum Frøkjær","doi":"10.1016/j.ejrad.2025.112175","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>This study aimed to develop an artificial intelligence (AI)-based segmentation model for accurate delineation of the complex pancreas in patients with chronic pancreatitis (CP) using computer tomography (CT) scans obtained during routine clinical work-up. Validation was performed with internal and external test datasets. A secondary objective was to evaluate the impact of visceral fat area (at the third lumbar level), pancreas volume, and CT parameters on model performance.</div></div><div><h3>Methods</h3><div>This multicenter study included 550 retrospectively collected CT scans from Aalborg (n = 373; 224 CP, 150 healthy subjects) and Bergen Hospitals (n = 97 CP), and an online dataset from the National Institutes of Health (NIH) (n = 80, healthy subjects). The Aalborg dataset was divided into a training cohort (n = 326) and an internal test set (n = 47), while the Bergen and NIH datasets served as external test sets. The AI model employed the nnU-Net architecture, with performance evaluated using the Sørensen-Dice index. Correlations with visceral fat, pancreas volume, and CT parameters were assessed.</div></div><div><h3>Results</h3><div>The pancreas segmentation AI model achieved a Dice score of 0.85 ± 0.08 on the Aalborg test set, 0.79 ± 0.19 on the Bergen dataset, and 0.79 ± 0.18 on the NIH dataset. Visceral fat and pancreas volume positively correlated with Dice scores (r = 0.45 and r = 0.53, both <em>p</em> < 0.0001), while CT parameters had no significant impact (all <em>p</em>-values > 0.07).</div></div><div><h3>Conclusion</h3><div>The AI model demonstrated high accuracy and robustness in pancreas segmentation of both CP patients and healthy subjects, and across diverse sites and scanners, suggesting its potential for clinical application.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"189 ","pages":"Article 112175"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X2500261X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
This study aimed to develop an artificial intelligence (AI)-based segmentation model for accurate delineation of the complex pancreas in patients with chronic pancreatitis (CP) using computer tomography (CT) scans obtained during routine clinical work-up. Validation was performed with internal and external test datasets. A secondary objective was to evaluate the impact of visceral fat area (at the third lumbar level), pancreas volume, and CT parameters on model performance.
Methods
This multicenter study included 550 retrospectively collected CT scans from Aalborg (n = 373; 224 CP, 150 healthy subjects) and Bergen Hospitals (n = 97 CP), and an online dataset from the National Institutes of Health (NIH) (n = 80, healthy subjects). The Aalborg dataset was divided into a training cohort (n = 326) and an internal test set (n = 47), while the Bergen and NIH datasets served as external test sets. The AI model employed the nnU-Net architecture, with performance evaluated using the Sørensen-Dice index. Correlations with visceral fat, pancreas volume, and CT parameters were assessed.
Results
The pancreas segmentation AI model achieved a Dice score of 0.85 ± 0.08 on the Aalborg test set, 0.79 ± 0.19 on the Bergen dataset, and 0.79 ± 0.18 on the NIH dataset. Visceral fat and pancreas volume positively correlated with Dice scores (r = 0.45 and r = 0.53, both p < 0.0001), while CT parameters had no significant impact (all p-values > 0.07).
Conclusion
The AI model demonstrated high accuracy and robustness in pancreas segmentation of both CP patients and healthy subjects, and across diverse sites and scanners, suggesting its potential for clinical application.
期刊介绍:
European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field.
Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.