Radiological Reporting of Brain Atrophy in MRI: Real-Life Comparison Between Narrative Reports, Semiquantitative Scales and Automated Software-Based Volumetry.
Federico Bruno, Cristina Fagotti, Gaspare Saltarelli, Giovanni Di Cerbo, Alessandra Sabatelli, Claudia De Felici, Antonio Innocenzi, Ernesto Di Cesare, Alessandra Splendiani
{"title":"Radiological Reporting of Brain Atrophy in MRI: Real-Life Comparison Between Narrative Reports, Semiquantitative Scales and Automated Software-Based Volumetry.","authors":"Federico Bruno, Cristina Fagotti, Gaspare Saltarelli, Giovanni Di Cerbo, Alessandra Sabatelli, Claudia De Felici, Antonio Innocenzi, Ernesto Di Cesare, Alessandra Splendiani","doi":"10.3390/diagnostics15101246","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background</b>: Accurate assessment of brain atrophy is essential in the diagnosis and monitoring of brain aging and neurodegenerative disorders. Radiological methods range from narrative reporting to semi-quantitative visual rating scales (VRSs) and fully automated volumetric software. However, their integration and consistency in clinical practice remain limited. <b>Methods</b>: In this retrospective study, brain MRI images of 43 patients were evaluated. Brain atrophy was assessed by extrapolating findings from narrative radiology reports, three validated VRSs (MTA, Koedam, Pasquier), and Pixyl.Neuro.BV, a commercially available volumetric software platform. Agreement between methods was assessed using intraclass correlation coefficients (ICCs), Cohen's kappa, Spearman's correlation, and McNemar tests. <b>Results</b>: Moderate correlation was found between narrative reports and VRSs (ρ = 0.55-0.69), but categorical agreement was limited (kappa = 0.21-0.30). Visual scales underestimated atrophy relative to software (mean scores: VRSs = 0.196; software = 0.279), while reports tended to overestimate. Agreement between VRSs and software was poor (kappa = 0.14-0.33), though MTA showed a significant correlation with hippocampal volume. Agreement between reports and software was lowest for global atrophy. <b>Conclusions</b>: Narrative reports, while common in practice, show low consistency with structured scales and quantitative software, especially in subtle cases. VRSs improve standardization but remain subjective and less sensitive. Integrating structured scales and volumetric tools into clinical workflows may enhance diagnostic accuracy and consistency in dementia imaging.</p>","PeriodicalId":11225,"journal":{"name":"Diagnostics","volume":"15 10","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12110413/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/diagnostics15101246","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Accurate assessment of brain atrophy is essential in the diagnosis and monitoring of brain aging and neurodegenerative disorders. Radiological methods range from narrative reporting to semi-quantitative visual rating scales (VRSs) and fully automated volumetric software. However, their integration and consistency in clinical practice remain limited. Methods: In this retrospective study, brain MRI images of 43 patients were evaluated. Brain atrophy was assessed by extrapolating findings from narrative radiology reports, three validated VRSs (MTA, Koedam, Pasquier), and Pixyl.Neuro.BV, a commercially available volumetric software platform. Agreement between methods was assessed using intraclass correlation coefficients (ICCs), Cohen's kappa, Spearman's correlation, and McNemar tests. Results: Moderate correlation was found between narrative reports and VRSs (ρ = 0.55-0.69), but categorical agreement was limited (kappa = 0.21-0.30). Visual scales underestimated atrophy relative to software (mean scores: VRSs = 0.196; software = 0.279), while reports tended to overestimate. Agreement between VRSs and software was poor (kappa = 0.14-0.33), though MTA showed a significant correlation with hippocampal volume. Agreement between reports and software was lowest for global atrophy. Conclusions: Narrative reports, while common in practice, show low consistency with structured scales and quantitative software, especially in subtle cases. VRSs improve standardization but remain subjective and less sensitive. Integrating structured scales and volumetric tools into clinical workflows may enhance diagnostic accuracy and consistency in dementia imaging.
DiagnosticsBiochemistry, Genetics and Molecular Biology-Clinical Biochemistry
CiteScore
4.70
自引率
8.30%
发文量
2699
审稿时长
19.64 days
期刊介绍:
Diagnostics (ISSN 2075-4418) is an international scholarly open access journal on medical diagnostics. It publishes original research articles, reviews, communications and short notes on the research and development of medical diagnostics. There is no restriction on the length of the papers. Our aim is to encourage scientists to publish their experimental and theoretical research in as much detail as possible. Full experimental and/or methodological details must be provided for research articles.