Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui
{"title":"实现智能头部脉冲测试:使用单目红外摄像机的无护目镜方法。","authors":"Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui","doi":"10.1002/lary.31848","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p><p><strong>Methods: </strong>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p><p><strong>Results: </strong>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p><p><strong>Conclusions: </strong>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p><p><strong>Level of evidence: </strong>3 Laryngoscope, 2024.</p>","PeriodicalId":49921,"journal":{"name":"Laryngoscope","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera.\",\"authors\":\"Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui\",\"doi\":\"10.1002/lary.31848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p><p><strong>Methods: </strong>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p><p><strong>Results: </strong>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p><p><strong>Conclusions: </strong>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p><p><strong>Level of evidence: </strong>3 Laryngoscope, 2024.</p>\",\"PeriodicalId\":49921,\"journal\":{\"name\":\"Laryngoscope\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Laryngoscope\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/lary.31848\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Laryngoscope","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/lary.31848","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
摘要
目的:要评估前庭功能,视频头脉冲试验(vHIT)是评估前庭眼反射(VOR)的金标准。然而,vHIT 需要患者佩戴专门的头戴式护目镜设备,每次使用前都需要校准。为此,我们提出了一种智能头脉冲测试(iHIT)设置,用单眼红外摄像机代替头戴式护目镜,并为前庭功能测定贡献了相应的深度学习视频分类方法:在 iHIT 框架内,我们在患者前方安装了一台单目红外摄像机以捕捉测试视频,并在此基础上建立了一个由 HIT 视频片段组成的数据集 DiHIT。然后,我们提出了一个在数据集 DiHIT 上训练的两阶段多模态视频分类网络,该网络将通过 HIT 片段从面部关键点提取的眼球运动和头部运动数据作为输入,并输出被测半规管(SCC)的识别(SCC 识别)和 VOR 异常的判断(SCC 定性):对该数据集 DiHIT 的实验表明,它对 SCC 识别的预测准确率达到了 100%。此外,它对水平 SCC 和垂直 SCC 定性的预测准确率分别为 84.1%和 79.0%:与现有的基于视频的 HIT 相比,iHIT 无需护目镜,无需校准设备,实现了完全自动化。此外,iHIT 成本低廉、操作简便,将为用户带来更多益处。代码和用例管道见:https://github.com/dec1st2023/iHIT.Level of evidence:3 喉镜,2024 年。
Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera.
Objectives: To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.
Methods: Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).
Results: Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.
Conclusions: Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.
期刊介绍:
The Laryngoscope has been the leading source of information on advances in the diagnosis and treatment of head and neck disorders since 1890. The Laryngoscope is the first choice among otolaryngologists for publication of their important findings and techniques. Each monthly issue of The Laryngoscope features peer-reviewed medical, clinical, and research contributions in general otolaryngology, allergy/rhinology, otology/neurotology, laryngology/bronchoesophagology, head and neck surgery, sleep medicine, pediatric otolaryngology, facial plastics and reconstructive surgery, oncology, and communicative disorders. Contributions include papers and posters presented at the Annual and Section Meetings of the Triological Society, as well as independent papers, "How I Do It", "Triological Best Practice" articles, and contemporary reviews. Theses authored by the Triological Society’s new Fellows as well as papers presented at meetings of the American Laryngological Association are published in The Laryngoscope.
• Broncho-esophagology
• Communicative disorders
• Head and neck surgery
• Plastic and reconstructive facial surgery
• Oncology
• Speech and hearing defects