Human Readers versus AI-Based Systems in ASPECTS Scoring for Acute Ischemic Stroke: A Systematic Review and Meta-Analysis with Region-Specific Guidance.
Ahmed Y Azzam, Ibrahim Hadadi, Leen M Al-Shahrani, Ummkulthum A Shanqeeti, Noor A Alqurqush, Mohammed A Alsehli, Rudaynah S Alali, Rahaf S Tammar, Mahmoud M Morsy, Muhammed Amir Essibayi
{"title":"Human Readers versus AI-Based Systems in ASPECTS Scoring for Acute Ischemic Stroke: A Systematic Review and Meta-Analysis with Region-Specific Guidance.","authors":"Ahmed Y Azzam, Ibrahim Hadadi, Leen M Al-Shahrani, Ummkulthum A Shanqeeti, Noor A Alqurqush, Mohammed A Alsehli, Rudaynah S Alali, Rahaf S Tammar, Mahmoud M Morsy, Muhammed Amir Essibayi","doi":"10.71079/aside.im.05172573","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The Alberta Stroke Program Early CT Score (ASPECTS) is widely used to evaluate early ischemic changes and guide thrombectomy decisions in acute stroke patients. However, significant interobserver variability in manual ASPECTS assessment presents a challenge. Recent advances in artificial intelligence have enabled the development of automated ASPECTS scoring systems; however, their comparative performance against expert interpretation remains insufficiently studied.</p><p><strong>Methods: </strong>We conducted a systematic review and meta-analysis following PRISMA 2020 guidelines. We searched multiple scientific databases for studies comparing automated and manual ASPECTS on Non-Contrast Computed Tomography (NCCT). Interobserver reliability was assessed using pooled interclass correlation coefficients (ICCs). Subgroup analyses were made using software types, reference standards, time windows, and computed tomography-based factors.</p><p><strong>Results: </strong>Eleven studies with a total of 1,976 patients were included. Automated ASPECTS demonstrated good reliability against reference standards (ICC: 0.72), comparable to expert readings (ICC: 0.62). RAPID ASPECTS performed highest (ICC: 0.86), especially for high-stakes decision-making. AI advantages were most significant with thin-slice CT (≤2.5mm; +0.16), intermediate time windows (120-240min; +0.16), and higher NIHSS scores (p=0.026).</p><p><strong>Conclusion: </strong>AI-driven ASPECTS systems perform comparably or even better in some cases than human readers in detecting early ischemic changes, especially in specific scenarios. Strategic utilization focusing on high-impact scenarios and region-specific performance patterns offers better diagnostic accuracy, reduced interpretation times, and better and wiser treatment selection in acute stroke care.</p>","PeriodicalId":520384,"journal":{"name":"ASIDE internal medicine","volume":"1 4","pages":"1-9"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12490272/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ASIDE internal medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.71079/aside.im.05172573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/17 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: The Alberta Stroke Program Early CT Score (ASPECTS) is widely used to evaluate early ischemic changes and guide thrombectomy decisions in acute stroke patients. However, significant interobserver variability in manual ASPECTS assessment presents a challenge. Recent advances in artificial intelligence have enabled the development of automated ASPECTS scoring systems; however, their comparative performance against expert interpretation remains insufficiently studied.
Methods: We conducted a systematic review and meta-analysis following PRISMA 2020 guidelines. We searched multiple scientific databases for studies comparing automated and manual ASPECTS on Non-Contrast Computed Tomography (NCCT). Interobserver reliability was assessed using pooled interclass correlation coefficients (ICCs). Subgroup analyses were made using software types, reference standards, time windows, and computed tomography-based factors.
Results: Eleven studies with a total of 1,976 patients were included. Automated ASPECTS demonstrated good reliability against reference standards (ICC: 0.72), comparable to expert readings (ICC: 0.62). RAPID ASPECTS performed highest (ICC: 0.86), especially for high-stakes decision-making. AI advantages were most significant with thin-slice CT (≤2.5mm; +0.16), intermediate time windows (120-240min; +0.16), and higher NIHSS scores (p=0.026).
Conclusion: AI-driven ASPECTS systems perform comparably or even better in some cases than human readers in detecting early ischemic changes, especially in specific scenarios. Strategic utilization focusing on high-impact scenarios and region-specific performance patterns offers better diagnostic accuracy, reduced interpretation times, and better and wiser treatment selection in acute stroke care.