Felix L Herr, Natascha Hohmann, Christian Dascalescu, Boj Hoppe, Hannah Gildein, Verena Schäfer, Jens Ricke, Boris M Holzapfel, Lennart Schröder, Nina Hesse, Jörg Arnholdt, Paul Reidler
{"title":"3D isotropic FastView MRI localizer allows reliable torsion measurements of the lower limb.","authors":"Felix L Herr, Natascha Hohmann, Christian Dascalescu, Boj Hoppe, Hannah Gildein, Verena Schäfer, Jens Ricke, Boris M Holzapfel, Lennart Schröder, Nina Hesse, Jörg Arnholdt, Paul Reidler","doi":"10.1186/s41747-025-00631-9","DOIUrl":"10.1186/s41747-025-00631-9","url":null,"abstract":"<p><p>Computed tomography (CT) and magnetic resonance imaging (MRI) are commonly used to assess femoral and tibial torsion. While CT offers high spatial resolution, it involves ionizing radiation. MRI avoids radiation but requires multiple sequences and extended acquisition time. We retrospectively evaluated whether a three-dimensional isotropic MRI localizer (FastView) could serve as a reliable and faster alternative. In this retrospective single-center study, 60 lower limbs from 30 patients, aged 27.1 ± 11.5 years (mean ± standard deviation), 19 females and 11 males, were assessed using both FastView and a dedicated MRI protocol. FastView (5 × 5 × 5 mm<sup>3</sup> voxels) imaged the entire lower limb in 17.4 s compared to nearly 7 min for the dedicated protocol. Torsion angles were measured independently by two readers. Agreement between methods was evaluated using intraclass correlation coefficients (ICCs), Bland-Altman plots, and Pearson R². No significant differences in torsion values were found (all p > 0.305). Femoral (ICC: 0.91-0.96) and tibial (ICC: 0.91-0.94) torsion showed excellent inter-modality agreement. Inter-reader reliability was also high (ICC: 0.95-0.99). Correlation values confirmed strong agreement (R²: 0.891-0.963). FastView demonstrated accuracy comparable to the dedicated protocol, offering a fast, efficient, and radiation-free option for routine torsion assessment. RELEVANCE STATEMENT: FastView MRI localizer offers a fast and resource-efficient method for assessing lower limb torsion, potentially replacing standard multisequence protocols in routine clinical practice. KEY POINTS: FastView MRI enables lower limb torsion measurements with full-limb coverage in under 20 s. Torsion angles from FastView and dedicated MRI showed no significant differences. Femoral and tibial ICCs between 0.91 and 0.96 confirm excellent inter-protocol agreement. Inter-reader agreement was consistently high across both protocols. FastView may replace multisequence MRI protocols in routine clinical torsion assessment.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"87"},"PeriodicalIF":3.6,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12420559/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145030965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kelly Payette, Julia Geiger, Michael Zellner, Céline Steger, Christian J Kellenberger, Ruth Tuura, Raimund Kottke, Andras Jakab
{"title":"Lung volume segmentation in fetal MRI: super-resolution reconstructions improve inter-rater reliability.","authors":"Kelly Payette, Julia Geiger, Michael Zellner, Céline Steger, Christian J Kellenberger, Ruth Tuura, Raimund Kottke, Andras Jakab","doi":"10.1186/s41747-025-00628-4","DOIUrl":"10.1186/s41747-025-00628-4","url":null,"abstract":"<p><strong>Background: </strong>Fetal MRI is increasingly used to investigate fetal lung pathologies, and super-resolution (SR) algorithms could be a powerful clinical tool for this assessment. Our goal was to investigate whether SR reconstructions result in an improved agreement in lung volume measurements determined by different raters, also known as inter-rater reliability.</p><p><strong>Materials and methods: </strong>In this single-center retrospective study, fetal lung volumes calculated from both SR reconstructions and the original images were analyzed. Three radiologists manually segmented the fetal lungs and rated the image quality of all images and reconstructions. Fetal lung volumes were calculated, and the coefficient of variation (CV) was determined for each set of images. Bland-Altman plots were generated, and intraclass correlation coefficients (ICCs) were calculated. A one-sided paired Wilcoxon test was used to compare the fetal lung volume CVs, and a two-sided paired t-test was used to compare the lung volumes. The quality ratings were compared using a two-sided paired Wilcoxon test.</p><p><strong>Results: </strong>A total of 98 fetal scans with gestational ages from 19 to 37 weeks were evaluated. There was a significantly lower CV in the lung volumes segmented from the SR reconstructions (p < 0.001), and the ICCs of the reconstructions were higher than those determined from the original images. Bland-Altman plots demonstrated better agreement in the SR reconstruction lung volumes. No significant differences in quality ratings or lung volumes were found.</p><p><strong>Conclusion: </strong>SR reconstructions of the fetal lungs in MRI enabled better inter-rater reliability of fetal lung volume assessment.</p><p><strong>Relevance statement: </strong>SR reconstructions of the fetal body, obtained through fetal MRI, can be a valuable tool for improving the inter-rater reliability of fetal lung volume measurements, a crucial clinical biomarker for assessing fetal development and predicting pregnancy outcomes.</p><p><strong>Key points: </strong>Deformable slice-to-volume reconstructions of fetal body MRI could be a valuable clinical tool. Quantitative advantages of fetal body MRI reconstructions need to be proven. Fetal body MRI reconstructions improved inter-rater reliability in lung volume measurements.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"88"},"PeriodicalIF":3.6,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12420550/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145030933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lung lobe segmentation: performance of open-source MOOSE, TotalSegmentator, and LungMask models compared to a local in-house model.","authors":"Elaheh Amini, Ran Klein","doi":"10.1186/s41747-025-00623-9","DOIUrl":"10.1186/s41747-025-00623-9","url":null,"abstract":"<p><strong>Background: </strong>Lung lobe segmentation is required to assess lobar function with nuclear imaging before surgical interventions. We evaluated the performance of open-source deep learning-based lung lobe segmentation tools, compared to a similar nnU-Net model trained on a smaller but more representative clinical dataset.</p><p><strong>Materials and methods: </strong>We collated and semi-automatically segmented an internal dataset of 164 computed tomography scans and classified them for task difficulty as easy, moderate, or hard. The performance of three open-source models-multi-organ objective segmentation (MOOSE), TotalSegmentator, and LungMask-was assessed using Dice similarity coefficient (DSC), robust Hausdorff distance (rHd95), and normalized surface distance (NSD). Additionally, we trained, validated, and tested an nnU-Net model using our local dataset and compared its performance with that of the other software on the test subset. All models were evaluated for generalizability using an external competition (LOLA11, n = 55).</p><p><strong>Results: </strong>TotalSegmentator outperformed MOOSE in DSC and NSD across all difficulty levels (p < 0.001), but not in rHd95 (p = 1.000). MOOSE and TotalSegmentator surpassed LungMask across metrics and difficulty classes (p < 0.001). Our model exceeded all other models on the internal dataset (n = 33) in all metrics, across all difficulty classes (p < 0.001), and on the external dataset. Missing lobes were correctly identified only by our model and LungMask in 3 and 1 of 7 cases, respectively.</p><p><strong>Conclusion: </strong>Open-source segmentation tools perform well in straightforward cases but struggle in unfamiliar, complex cases. Training on diverse, specialized datasets can improve generalizability, emphasizing representative data over sheer quantity.</p><p><strong>Relevance statement: </strong>Training lung lobe segmentation models on a local variety of cases improves accuracy, thus enhancing presurgical planning, ventilation-perfusion analysis, and disease localization, potentially impacting treatment decisions and patient outcomes in respiratory and thoracic care.</p><p><strong>Key points: </strong>Deep learning models trained on non-specialized datasets struggle with complex lung anomalies, yet their real-world limitations are insufficiently assessed. Training an identical model on a smaller yet clinically diverse and representative cohort improved performance in challenging cases. Data diversity outweighs the quantity in deep learning-based segmentation models. Accurate lung lobe segmentation may enhance presurgical assessment of lung lobar ventilation and perfusion function, optimizing clinical decision-making and patient outcomes.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"86"},"PeriodicalIF":3.6,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12411369/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145001574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rethinking feature reproducibility in radiomics: the elephant in the dark.","authors":"Aydin Demircioğlu","doi":"10.1186/s41747-025-00629-3","DOIUrl":"10.1186/s41747-025-00629-3","url":null,"abstract":"<p><p>In radiomics, features are often linked to biomarkers and are generally expected to be reproducible, as reproducibility is considered a prerequisite for developing predictive models in clinical applications. However, this perspective overlooks feature interactions and may underestimate the potential value of nonreproducible features. Through experiments simulating a test-retest scenario, we demonstrate that even non-reproducible features can contribute significantly to predictive performance. Removing these features can lower model accuracy. These findings suggest that the emphasis on feature reproducibility should be reconsidered and that features should not be evaluated in isolation. Underlying information can be spread across multiple features. Focusing on individual features ignores feature interactions and may limit the model's predictive power. Ultimately, radiomics must prioritize prediction and clinical relevance. KEY POINTS: Feature reproducibility assessments often ignore feature interactions, overlooking predictive performance. Feature reproducibility depends on subjective thresholds, chosen metrics, and sample size. Nonreproducible features can be more predictive than reproducible ones. Predictive information may be distributed across multiple features rather than confined to individual ones.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"85"},"PeriodicalIF":3.6,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12411371/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145001535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Frederic Deschamps, Enzo Gautreau, Lambros Tselikas, Baptiste Bonnet, Paul Beunon, Adlane Feddal, Thierry de Baere, Amelie Gaudin, Lluis M Mir
{"title":"Electrochemotherapy of spinal metastasis using transpedicular approach: a preclinical safety animal study.","authors":"Frederic Deschamps, Enzo Gautreau, Lambros Tselikas, Baptiste Bonnet, Paul Beunon, Adlane Feddal, Thierry de Baere, Amelie Gaudin, Lluis M Mir","doi":"10.1186/s41747-025-00607-9","DOIUrl":"10.1186/s41747-025-00607-9","url":null,"abstract":"<p><strong>Background: </strong>Electrochemotherapy (ECT) of vertebral metastasis is a new treatment option for metastasis that is not accessible to thermal ablation or radiotherapy. A numerical feasibility study has investigated the transpedicular approach for electrode insertion. We conducted a preclinical study to assess its safety.</p><p><strong>Methods: </strong>Histologic examination of the spinal cord was performed in 12 consecutive pigs treated with ECT at three consecutive levels (T11, T12, and L1) to evaluate any cellular or vascular damage. Pigs of group A (n = 6) had an intraoperative neuromonitoring immediately for 1 h after ECT and then were euthanized. Pain and clinical symptoms were daily evaluated for group B (n = 3) and group C (n = 3) until day-3 and day-30, respectively.</p><p><strong>Results: </strong>At gross pathology, no apoptosis, no vascular/thrombosis or hemorrhagic focus was observed in any pig. Motor-evoked potential responses of the lower limbs were transiently lost in response in 5 of the 6 pigs, but complete recovery always occurred within 30 min. Clinical examination (groups B and C) revealed no symptoms during the follow-up. Pigs were all able to walk normally, without weakness or paralysis of the lower extremities. No urinary/fecal retention or incontinence was observed, nor any sign of pain.</p><p><strong>Conclusion: </strong>Our results confirm that the insertion of electrodes through the pedicles is safe for the ECT of vertebral metastases. Further studies are needed to evaluate the safety profile of ECT of vertebral metastases invading the cortical and epidural fat, which represents a privileged pathway for the electric field between the electrodes.</p><p><strong>Relevance statement: </strong>Electrochemotherapy of vertebral metastases should be performed using a transpedicular approach for the insertion of electrodes, without definitive sequelae at the spinal cord level.</p><p><strong>Key points: </strong>Electrochemotherapy is a new treatment for vertebral metastases not accessible to radiotherapy, but it could result in spinal cord injury related to electrical trauma. In a swine model, the transpedicular approach has demonstrated no definitive sequelae at intraoperative neuromonitoring and during clinical follow-up. Electrochemotherapy should be performed using a transpedicular approach to avoid spinal cord damage.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"84"},"PeriodicalIF":3.6,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12411335/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144993790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xin Bai, Lin Lu, Anli Tong, Jianhua Deng, Lili Xu, Xiaoxiao Zhang, Jiahui Zhang, Li Chen, Qianyu Peng, Erjia Guo, Yongfei Wu, Yun Wang, Kai Xu, Chao Zhang, Xi Zhao, Zhengyu Jin, Gumuyang Zhang, Hao Sun
{"title":"Virtual noncontrast images of adrenal lesions: a photon-counting CT prospective study.","authors":"Xin Bai, Lin Lu, Anli Tong, Jianhua Deng, Lili Xu, Xiaoxiao Zhang, Jiahui Zhang, Li Chen, Qianyu Peng, Erjia Guo, Yongfei Wu, Yun Wang, Kai Xu, Chao Zhang, Xi Zhao, Zhengyu Jin, Gumuyang Zhang, Hao Sun","doi":"10.1186/s41747-025-00621-x","DOIUrl":"https://doi.org/10.1186/s41747-025-00621-x","url":null,"abstract":"<p><strong>Background: </strong>The value of virtual noncontrast (VNC) images from photon-counting computed tomography (PCCT) for evaluating adrenal lesions and diagnosing adrenal adenomas remains to be clarified.</p><p><strong>Materials and methods: </strong>Participants with adrenal masses who underwent unenhanced and portal venous phase PCCT were prospectively included. Portal-venous phase images were reconstructed using conventional VNC (VNC<sub>Conv</sub>) and PureCalcium VNC (VNC<sub>PC</sub>). We measured two-dimensional (2D) attenuation of adrenal masses at their largest slice on true noncontrast (TNC), VNC<sub>Conv</sub>, and VNC<sub>PC</sub> images. Three-dimensional (3D) attenuation and radiomic features of adrenal masses were semiautomatically extracted. These parameters were statistically compared, and diagnostic performance for adenomas was evaluated.</p><p><strong>Results: </strong>The study included 54 participants (27 females, mean age 45.3 years) with 68 adrenal lesions. Attenuation values on VNC were higher than those on TNC. TNC, VNC<sub>Conv</sub>, and VNC<sub>PC</sub> attenuation values did not differ between 2D and 3D measurements. The intraclass correlation coefficients of first-order, shape, and texture features between TNC and VNC were 0.671, 0.822, and 0.616, respectively. The sensitivity and specificity of the proposed thresholds (VNC<sub>Conv</sub> 25 HU, VNC<sub>PC</sub> 20 HU) were higher than those of the previously established threshold of 10 HU in diagnosing adenomas. There was no significant difference between VNC<sub>Conv</sub> and VNC<sub>PC</sub> in diagnosing adenomas (area under the receiver operating characteristic curve: 0.841 versus 0.838, p = 0.873).</p><p><strong>Conclusion: </strong>VNC algorithms from PCCT overestimated CT attenuation of adrenal lesions. Higher thresholds showed better diagnostic performance for discriminating adrenal adenomas from non-adenomas than the established 10 HU.</p><p><strong>Relevance statement: </strong>We investigated the application of VNC images from PCCT in adrenal disease. On VNC images, higher thresholds, superior to the accepted 10 HU, are needed for discriminating adenomas from non-adenomas, reducing the need for secondary examinations.</p><p><strong>Key points: </strong>This study investigated the value of VNC images from PCCT in adrenal lesions. VNC reconstruction overestimated the CT attenuation of adrenal lesions. Higher thresholds on VNC images were superior to the accepted 10 HU for differentiating adenomas from non-adenomas.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"82"},"PeriodicalIF":3.6,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12396999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144972430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Johannes de Boer, Nigar Salimova, Friederike Weidemann, Lea Behrendt, Thomas Werncke, Frank K Wacker, Lena Sonnow
{"title":"Photon-counting CT versus energy-integrating detector and flat-panel CT for cadaveric wrist arthrography with additional tin filter dose reduction.","authors":"Johannes de Boer, Nigar Salimova, Friederike Weidemann, Lea Behrendt, Thomas Werncke, Frank K Wacker, Lena Sonnow","doi":"10.1186/s41747-025-00604-y","DOIUrl":"https://doi.org/10.1186/s41747-025-00604-y","url":null,"abstract":"<p><strong>Background: </strong>This study aimed to evaluate the imaging performance and diagnostic value of a photon-counting detector (PCD) computed tomography (CT) compared to an energy-integrating detector (EID) and flat panel detector (FPD) for cadaveric wrist arthrographies.</p><p><strong>Methods: </strong>Following ethics committee approval, ten cadaveric wrists were injected with diluted iodinated contrast agent. CT arthrographies using PCD-, EID-, and FPD-CT were performed. Six dose protocols between 0.1 mGy (using a tin filter) and 6 mGy, ultrahigh-resolution-mode, and two reconstruction kernels were used for the PCD-CT and EID-CT. FPD-CT images were reconstructed using a \"normal\" and \"sharp\" kernel. Signal-to-noise ratios (SNR) and contrast-to-noise ratios (CNR) were calculated and analyzed using analysis of variance (ANOVA) and post hoc tests. Three blinded radiologists independently rated image quality concerning trabecular, cartilage, and intrinsic structures. Intraclass correlation coefficients (ICC) were calculated, followed by a Friedman and post hoc test.</p><p><strong>Results: </strong>At 1.5 mGy, 3 mGy, and 6 mGy with the Br89 kernel, the PCD-CT yielded up to 2.35 times higher SNR and up to 7 times higher CNR than dose-equivalent and higher dose EID-CT scans. Subjective ratings favored the PCD-CT over the EID-CT and occasionally the FPD-CT, with a combined ICC of 0.942. Applying sharper kernels, SNR did not differ significantly between the PCD-CT (1.5 mGy, 3 mGy, and 6 mGy) and the FPD-CT.</p><p><strong>Conclusion: </strong>Using sharp kernels, the PCD-CT provided superior image quality to the EID-CT and achieved comparable or better quality than the FPD at certain parameters. Thus, the PCD-CT could be considered a possible alternative in clinical routine for evaluating wrist injuries.</p><p><strong>Relevance statement: </strong>This study demonstrates the potential of the PCD-CT as a valuable tool in diagnosing wrist injuries. Its superior image quality compared to the EID-CT can increase confidence in diagnosing subtle bone pathologies and additionally yields the possibility of radiation exposure reduction.</p><p><strong>Key points: </strong>The technical advantages of the PCD-CT allow for dose reduction while generating high-quality images. PCD-CT showed superior image quality over EID-CT and was comparable to the FPD-CT. PDC-CT offers improved visualization of fine joint structures in wrist arthrography and should be considered in clinical routine.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"83"},"PeriodicalIF":3.6,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397008/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144972432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"MRI-based machine-learning radiomics of the liver to predict liver-related events in hepatitis B virus-associated fibrosis.","authors":"Yuankai Luo, Qinian Luo, Yaobo Wu, Shaorui Zhang, Huan Ren, Xiaofeng Wang, Xiujuan Liu, Qin Yang, Weiguo Xu, Qingsong Wu, Yong Li","doi":"10.1186/s41747-025-00602-0","DOIUrl":"https://doi.org/10.1186/s41747-025-00602-0","url":null,"abstract":"<p><strong>Background: </strong>The onset of liver-related events (LREs) in fibrosis indicates a poor prognosis and worsens patients' quality of life, making the prediction and early detection of LREs crucial. The aim of this study was to develop a radiomics model using liver magnetic resonance imaging (MRI) to predict LRE risk in patients undergoing antiviral treatment for chronic fibrosis caused by hepatitis B virus (HBV).</p><p><strong>Methods: </strong>Patients with HBV-associated liver fibrosis and liver stiffness measurements ≥ 10 kPa were included. Feature selection and dimensionality reduction techniques identified discriminative features from three MRI sequences. Radiomics models were built using eight machine learning techniques and evaluated for performance. Shapley additive explanation and permutation importance techniques were applied to interpret the model output.</p><p><strong>Results: </strong>A total of 222 patients aged 49 ± 10 years (mean ± standard deviation), 175 males, were evaluated, with 41 experiencing LREs. The radiomics model, incorporating 58 selected features, outperformed traditional clinical tools in prediction accuracy. Developed using a support vector machine classifier, the model achieved optimal areas under the receiver operating characteristic curves of 0.94 and 0.93 in the training and test sets, respectively, demonstrating good calibration.</p><p><strong>Conclusion: </strong>Machine learning techniques effectively predicted LREs in patients with fibrosis and HBV, offering comparable accuracy across algorithms and supporting personalized care decisions for HBV-related liver disease.</p><p><strong>Relevance statement: </strong>Radiomics models based on liver multisequence MRI can improve risk prediction and management of patients with HBV-associated chronic fibrosis. In addition, it offers valuable prognostic insights and aids in making informed clinical decisions.</p><p><strong>Key points: </strong>Liver-related events (LREs) are associated with poor prognosis in chronic fibrosis. Radiomics models could predict LREs in patients with hepatitis B-associated chronic fibrosis. Radiomics contributes to personalized care choices for patients with hepatitis B-associated fibrosis.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"81"},"PeriodicalIF":3.6,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12390902/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144972387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christoph Birkl, Marlene Panzer, Christian Kames, Anna Maria Birkl-Toeglhofer, Alexander Rauscher, Bernhard Glodny, Elke R Gizewski, Heinz Zoller
{"title":"MRI R2* and quantitative susceptibility mapping in brain tissue with extreme iron overload.","authors":"Christoph Birkl, Marlene Panzer, Christian Kames, Anna Maria Birkl-Toeglhofer, Alexander Rauscher, Bernhard Glodny, Elke R Gizewski, Heinz Zoller","doi":"10.1186/s41747-025-00622-w","DOIUrl":"https://doi.org/10.1186/s41747-025-00622-w","url":null,"abstract":"<p><strong>Background: </strong>R2* and quantitative susceptibility mapping (QSM) are regarded as robust techniques for assessing iron content in the brain. While these techniques are established for normal or moderate iron levels, their usability in extreme iron overload, as seen in aceruloplasminemia (ACP), is unclear. We aimed to evaluate various R2* and QSM algorithms in assessing brain iron levels in patients with ACP compared to healthy controls.</p><p><strong>Materials and methods: </strong>We acquired a three-dimensional multiecho gradient-echo sequence for R2* and QSM in three patients with ACP and three healthy subjects. Six algorithms each for R2* and QSM were compared. QSM was performed with referencing to whole brain, to cerebrospinal fluid and without referencing. R2* and QSM values were assessed in the caudate nucleus, putamen, globus pallidus, and thalamus.</p><p><strong>Results: </strong>R2* values varied significantly across algorithms, particularly in the putamen (F(5,50) = 16.51, p < 0.001). For QSM, reference region choice (F(5,150) = 264, p < 0.001) and algorithm selection (F(2,9) = 10, p < 0.001) had an impact on susceptibility values. In patients, referencing to whole brain yielded lower susceptibility values than cerebrospinal fluid (median = 0.147 ppm, range = 0.527 ppm versus median = 0.279 ppm, range = 0.593 ppm).</p><p><strong>Conclusion: </strong>Extreme iron overload amplifies variability in R2* and QSM measurements. QSM referencing is particularly challenging in diffuse whole-brain iron accumulation; thus, analysis with multiple reference regions might mitigate bias. Both algorithm selection and referencing approaches play a pivotal role in determining measurement accuracy and clinical interpretation under extreme brain iron overload.</p><p><strong>Relevance statement: </strong>As QSM transitions into clinical use, it will encounter cases of extreme iron overload. Our study in patients with aceruloplasminemia revealed that the choice of reference region significantly influences susceptibility values, with variations exceeding algorithm-dependent differences.</p><p><strong>Key points: </strong>R2* and QSM vary across algorithms in brain tissue with iron overload. Whole-brain referenced QSM leads to lower susceptibility values in aceruloplasminemia patients. QSM, if properly processed, provides reliable maps in iron overload brain regions. In brain regions with extremely high iron content, R2* mapping might fail.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"80"},"PeriodicalIF":3.6,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12374935/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144972725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jakub Pristoupil, Laura Oleaga, Vanesa Junquero, Cristina Merino, Suha Sureyya Ozbek, Lukas Lambert
{"title":"Five advanced chatbots solving European Diploma in Radiology (EDiR) text-based questions: differences in performance and consistency.","authors":"Jakub Pristoupil, Laura Oleaga, Vanesa Junquero, Cristina Merino, Suha Sureyya Ozbek, Lukas Lambert","doi":"10.1186/s41747-025-00591-0","DOIUrl":"10.1186/s41747-025-00591-0","url":null,"abstract":"<p><strong>Background: </strong>We compared the performance, confidence, and response consistency of five chatbots powered by large language models in solving European Diploma in Radiology (EDiR) text-based multiple-response questions.</p><p><strong>Methods: </strong>ChatGPT-4o, ChatGPT-4o-mini, Copilot, Gemini, and Claude 3.5 Sonnet were tested using 52 text-based multiple-response questions from two previous EDiR sessions in two iterations. Chatbots were prompted to evaluate each answer as correct or incorrect and grade its confidence level on a scale of 0 (not confident at all) to 10 (most confident). Scores per question were calculated using a weighted formula that accounted for correct and incorrect answers (range 0.0-1.0).</p><p><strong>Results: </strong>Claude 3.5 Sonnet achieved the highest score per question (0.84 ± 0.26, mean ± standard deviation) compared to ChatGPT-4o (0.76 ± 0.31), ChatGPT-4o-mini (0.64 ± 0.35), Copilot (0.62 ± 0.37), and Gemini (0.54 ± 0.39) (p < 0.001). A self-reported confidence in answering the questions was 9.0 ± 0.9 for Claude 3.5 Sonnet followed by ChatGPT-4o (8.7 ± 1.1), compared to ChatGPT-4o-mini (8.2 ± 1.3), Copilot (8.2 ± 2.2), and Gemini (8.2 ± 1.6, p < 0.001). Claude 3.5 Sonnet demonstrated superior consistency, changing responses in 5.4% of cases between the two iterations, compared to ChatGPT-4o (6.5%), ChatGPT-4o-mini (8.8%), Copilot (13.8%), and Gemini (18.5%). All chatbots outperformed human candidates from previous EDiR sessions, achieving a passing grade from this part of the examination.</p><p><strong>Conclusion: </strong>Claude 3.5 Sonnet exhibited superior accuracy, confidence, and consistency, with ChatGPT-4o performing nearly as well. The variation in performance among the evaluated models was substantial.</p><p><strong>Relevance statement: </strong>Variation in performance, consistency, and confidence among chatbots in solving EDiR test-based questions highlights the need for cautious deployment, particularly in high-stakes clinical and educational settings.</p><p><strong>Key points: </strong>Claude 3.5 Sonnet outperformed other chatbots in accuracy and response consistency. ChatGPT-4o ranked second, showing strong but slightly less reliable performance. All chatbots surpassed EDiR candidates in text-based EDiR questions.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"9 1","pages":"79"},"PeriodicalIF":3.6,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12364795/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144883978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}