Artificial Intelligence Diagnosis of Ocular Motility Disorders From Clinical Videos.

IF 2 4区医学 Q3 CLINICAL NEUROLOGY

Journal of Neuro-Ophthalmology Pub Date : 2025-08-28 DOI:10.1097/WNO.0000000000002393

David Mikhail, Michael Balas, Jonathan A Micieli

{"title":"Artificial Intelligence Diagnosis of Ocular Motility Disorders From Clinical Videos.","authors":"David Mikhail, Michael Balas, Jonathan A Micieli","doi":"10.1097/WNO.0000000000002393","DOIUrl":null,"url":null,"abstract":"Background: Multimodal artificial intelligence (AI) models have recently expanded into video analysis. In ophthalmology, one exploratory application is the automated detection of extraocular movement (EOM) disorders. This proof-of-concept study evaluated the feasibility of using Gemini 2.0 to recognize EOM abnormalities, identify the affected eye, and recognize specific movement limitations from publicly available, real-world clinical videos.Methods: We retrospectively collected 114 YouTube videos of EOM disorders, including cranial nerve (CN) palsies, internuclear ophthalmoplegia (INO), supranuclear disorders, nystagmus, and ocular myasthenia gravis (MG), alongside 15 control videos demonstrating normal EOMs. Videos were trimmed to include only the pertinent clinical examinations, and audio was removed to avoid diagnostic cues. Using a standardized zero-shot prompt, Gemini 2.0 analyzed each video via the Google AI Studio platform. Gemini 2.0 was evaluated based on its ability to provide the correct diagnosis, identify the affected eye, and recognize the specific movement limitation (if any). Descriptive statistics, Spearman correlations, and comparative analyses were used to assess performance.Results: Gemini 2.0 correctly identified the primary diagnosis in 43 of 114 videos, yielding an overall diagnostic accuracy of 37.7%. Diagnostic performance varied by condition, with the highest accuracies observed in third nerve palsy (81.1%), INO (80.0%), sixth nerve palsy (66.7%), and ocular MG (20.0%), whereas normal EOMs were correctly classified in 93.3% of cases. In misclassified cases, the correct diagnosis appeared in the differential diagnosis in 15.5% of instances. Laterality was correctly identified in 26.5% of eligible cases overall, 73.1% among correctly diagnosed cases vs. 9.6% in misclassified ones. Similarly, movement limitations were accurately identified in 30.3% of eligible cases overall, with a marked increase to 88.5% accuracy in correctly diagnosed cases compared to 9.6% in misclassified cases. Longer videos moderately correlated with longer processing time (ρ = 0.55, P < 0.001). Significant correlations were observed between correct diagnosis and correct laterality identification (ρ = 0.45, P < 0.001), correct diagnosis and correct movement limitation identification (ρ = 0.61, P < 0.001), and laterality and movement limitation (ρ = 0.51, P < 0.001). Processing time averaged 11.0 seconds and correlated with video length (ρ = 0.55, P < 0.001).Conclusions: This proof-of-concept study demonstrates the feasibility of using Gemini 2.0 for automated recognition of EOM abnormalities in clinical videos. Although performance was stronger in overt cases, overall diagnostic accuracy remains limited. Substantial validation on standardized, clinician-annotated datasets is needed before clinical application.","PeriodicalId":16485,"journal":{"name":"Journal of Neuro-Ophthalmology","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neuro-Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/WNO.0000000000002393","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Multimodal artificial intelligence (AI) models have recently expanded into video analysis. In ophthalmology, one exploratory application is the automated detection of extraocular movement (EOM) disorders. This proof-of-concept study evaluated the feasibility of using Gemini 2.0 to recognize EOM abnormalities, identify the affected eye, and recognize specific movement limitations from publicly available, real-world clinical videos.

Methods: We retrospectively collected 114 YouTube videos of EOM disorders, including cranial nerve (CN) palsies, internuclear ophthalmoplegia (INO), supranuclear disorders, nystagmus, and ocular myasthenia gravis (MG), alongside 15 control videos demonstrating normal EOMs. Videos were trimmed to include only the pertinent clinical examinations, and audio was removed to avoid diagnostic cues. Using a standardized zero-shot prompt, Gemini 2.0 analyzed each video via the Google AI Studio platform. Gemini 2.0 was evaluated based on its ability to provide the correct diagnosis, identify the affected eye, and recognize the specific movement limitation (if any). Descriptive statistics, Spearman correlations, and comparative analyses were used to assess performance.

Results: Gemini 2.0 correctly identified the primary diagnosis in 43 of 114 videos, yielding an overall diagnostic accuracy of 37.7%. Diagnostic performance varied by condition, with the highest accuracies observed in third nerve palsy (81.1%), INO (80.0%), sixth nerve palsy (66.7%), and ocular MG (20.0%), whereas normal EOMs were correctly classified in 93.3% of cases. In misclassified cases, the correct diagnosis appeared in the differential diagnosis in 15.5% of instances. Laterality was correctly identified in 26.5% of eligible cases overall, 73.1% among correctly diagnosed cases vs. 9.6% in misclassified ones. Similarly, movement limitations were accurately identified in 30.3% of eligible cases overall, with a marked increase to 88.5% accuracy in correctly diagnosed cases compared to 9.6% in misclassified cases. Longer videos moderately correlated with longer processing time (ρ = 0.55, P < 0.001). Significant correlations were observed between correct diagnosis and correct laterality identification (ρ = 0.45, P < 0.001), correct diagnosis and correct movement limitation identification (ρ = 0.61, P < 0.001), and laterality and movement limitation (ρ = 0.51, P < 0.001). Processing time averaged 11.0 seconds and correlated with video length (ρ = 0.55, P < 0.001).

Conclusions: This proof-of-concept study demonstrates the feasibility of using Gemini 2.0 for automated recognition of EOM abnormalities in clinical videos. Although performance was stronger in overt cases, overall diagnostic accuracy remains limited. Substantial validation on standardized, clinician-annotated datasets is needed before clinical application.

查看原文本刊更多论文

从临床视频看眼运动障碍的人工智能诊断。

背景：多模式人工智能（AI）模型最近已经扩展到视频分析领域。在眼科，一个探索性的应用是眼外运动（EOM）疾病的自动检测。这项概念验证性研究评估了使用Gemini 2.0识别EOM异常、识别受影响的眼睛以及从公开可用的真实临床视频中识别特定运动限制的可行性。方法：我们回顾性收集了114个EOM疾病的YouTube视频，包括脑神经（CN）麻痹、核间眼麻痹（INO）、核上疾病、眼球震颤和眼重症肌无力（MG），以及15个正常EOM的对照视频。视频被修剪，只包括相关的临床检查，音频被删除，以避免诊断线索。双子座2.0使用标准化的零拍摄提示，通过谷歌AI Studio平台分析每个视频。Gemini 2.0的评估基于其提供正确诊断、识别受影响的眼睛和识别特定运动限制（如果有的话）的能力。使用描述性统计、Spearman相关和比较分析来评估绩效。结果：Gemini 2.0在114个视频中的43个中正确识别了原发性诊断，总体诊断准确率为37.7%。诊断表现因病情而异，第三神经麻痹的准确率最高（81.1%），INO(80.0%)，第六神经麻痹（66.7%）和眼部MG(20.0%)，而正常EOMs的准确率为93.3%。在误诊病例中，15.5%的病例在鉴别诊断中出现了正确的诊断。总的来说，26.5%的合格病例正确识别侧侧，正确诊断病例为73.1%，错误分类病例为9.6%。同样，30.3%的符合条件的病例准确地识别出了运动限制，正确诊断病例的准确率显著提高到88.5%，而错误分类病例的准确率为9.6%。较长的视频与较长的处理时间中度相关（ρ = 0.55, P < 0.001）。正确诊断与正确侧位识别（ρ = 0.45, P < 0.001）、正确诊断与正确运动受限识别（ρ = 0.61, P < 0.001）、侧位与运动受限识别（ρ = 0.51, P < 0.001）之间存在显著相关性。处理时间平均为11.0秒，且与视频长度相关（ρ = 0.55, P < 0.001）。结论：这项概念验证研究证明了在临床视频中使用Gemini 2.0自动识别EOM异常的可行性。尽管在显性病例中表现较好，但总体诊断准确性仍然有限。在临床应用之前，需要对标准化的、临床医生注释的数据集进行大量验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Neuro-Ophthalmology 医学-临床神经学

CiteScore

2.80

自引率

13.80%

发文量

593

审稿时长

6-12 weeks

期刊介绍： The Journal of Neuro-Ophthalmology (JNO) is the official journal of the North American Neuro-Ophthalmology Society (NANOS). It is a quarterly, peer-reviewed journal that publishes original and commissioned articles related to neuro-ophthalmology.