Ping Ding , Tinghong Liu , Jinshan Xu, Liu Yuan, Liwei Zhang, Zhirong Wei, Yuchen Tai, Shuli Liang
{"title":"Development and validation of a video-based deep learning model for distinguishing epileptic seizures from non-epileptic events in a pediatric cohort","authors":"Ping Ding , Tinghong Liu , Jinshan Xu, Liu Yuan, Liwei Zhang, Zhirong Wei, Yuchen Tai, Shuli Liang","doi":"10.1016/j.yebeh.2025.110785","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aimed to develop and validate a video-based deep learning system for distinguishing epileptic seizures (ES) from non-epileptic events (NEE) in a pediatric cohort. Using a prospective validation cohort, we further assessed the diagnostic performance and clinical applicability of the artificial intelligence (AI) model, investigated potential factors contributing to its diagnostic errors, and benchmarked its clinical utility against clinicians grouped by different levels of expertise.</div></div><div><h3>Methods</h3><div>An enhanced multiscale vision transformer was trained on 438 retrospectively collected videos, with benchmark comparisons against MViTv2 and SlowFast architectures. Prospective validation was performed using 130 consecutive videos to assess the diagnostic performance of the AI system against tiered clinician groups (interns, attending physicians, and chief physicians). A generalized linear mixed model (GLMM) was employed to identify factors associated with AI misdiagnosis, with further comparative analysis of diagnostic performance between AI and human clinicians.</div></div><div><h3>Results</h3><div>Our model demonstrated significantly higher accuracy (p = 0.001) and sensitivity (p = 0.004) compared to the MViTv2 model. Although all performance metrics were numerically higher than those of the SlowFast model, these differences did not reach statistical significance. GLMM analysis indicated that event type (motor vs. non-motor) was a significant factor influencing model misclassification (p = 0.020). The model achieved substantially higher diagnostic accuracy for motor events compared to non-motor events (p < 0.001).</div></div><div><h3>Conclusion</h3><div>The video-based AI classifier shows promise as an assistive tool for clinicians in differentiating ES from NEE based on video evidence in a pediatric cohort. Our AI model demonstrated notably effective diagnostic performance for motor events, while its accuracy was more limited for non-motor events.</div></div>","PeriodicalId":11847,"journal":{"name":"Epilepsy & Behavior","volume":"173 ","pages":"Article 110785"},"PeriodicalIF":2.3000,"publicationDate":"2025-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epilepsy & Behavior","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1525505025005256","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study aimed to develop and validate a video-based deep learning system for distinguishing epileptic seizures (ES) from non-epileptic events (NEE) in a pediatric cohort. Using a prospective validation cohort, we further assessed the diagnostic performance and clinical applicability of the artificial intelligence (AI) model, investigated potential factors contributing to its diagnostic errors, and benchmarked its clinical utility against clinicians grouped by different levels of expertise.
Methods
An enhanced multiscale vision transformer was trained on 438 retrospectively collected videos, with benchmark comparisons against MViTv2 and SlowFast architectures. Prospective validation was performed using 130 consecutive videos to assess the diagnostic performance of the AI system against tiered clinician groups (interns, attending physicians, and chief physicians). A generalized linear mixed model (GLMM) was employed to identify factors associated with AI misdiagnosis, with further comparative analysis of diagnostic performance between AI and human clinicians.
Results
Our model demonstrated significantly higher accuracy (p = 0.001) and sensitivity (p = 0.004) compared to the MViTv2 model. Although all performance metrics were numerically higher than those of the SlowFast model, these differences did not reach statistical significance. GLMM analysis indicated that event type (motor vs. non-motor) was a significant factor influencing model misclassification (p = 0.020). The model achieved substantially higher diagnostic accuracy for motor events compared to non-motor events (p < 0.001).
Conclusion
The video-based AI classifier shows promise as an assistive tool for clinicians in differentiating ES from NEE based on video evidence in a pediatric cohort. Our AI model demonstrated notably effective diagnostic performance for motor events, while its accuracy was more limited for non-motor events.
期刊介绍:
Epilepsy & Behavior is the fastest-growing international journal uniquely devoted to the rapid dissemination of the most current information available on the behavioral aspects of seizures and epilepsy.
Epilepsy & Behavior presents original peer-reviewed articles based on laboratory and clinical research. Topics are drawn from a variety of fields, including clinical neurology, neurosurgery, neuropsychiatry, neuropsychology, neurophysiology, neuropharmacology, and neuroimaging.
From September 2012 Epilepsy & Behavior stopped accepting Case Reports for publication in the journal. From this date authors who submit to Epilepsy & Behavior will be offered a transfer or asked to resubmit their Case Reports to its new sister journal, Epilepsy & Behavior Case Reports.