实现智能头部脉冲测试：使用单目红外摄像机的无护目镜方法。

IF 2.2 3区医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL

Laryngoscope Pub Date : 2024-10-18 DOI:10.1002/lary.31848

Yang Ouyang ME, Wenwei Luo MD, Yinwei Zhan PhD, Caizhen Wei ME, Xian Liang ME, Hongming Huang MM, Yong Cui MD

{"title":"实现智能头部脉冲测试：使用单目红外摄像机的无护目镜方法。","authors":"Yang Ouyang ME, Wenwei Luo MD, Yinwei Zhan PhD, Caizhen Wei ME, Xian Liang ME, Hongming Huang MM, Yong Cui MD","doi":"10.1002/lary.31848","DOIUrl":null,"url":null,"abstract":"<div>\n \n <section>\n \n <h3> Objectives</h3>\n \n <p>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p>\n </section>\n \n <section>\n \n <h3> Level of Evidence</h3>\n \n <p>3 <i>Laryngoscope</i>, 135:1161–1168, 2025</p>\n </section>\n </div>","PeriodicalId":49921,"journal":{"name":"Laryngoscope","volume":"135 3","pages":"1161-1168"},"PeriodicalIF":2.2000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera\",\"authors\":\"Yang Ouyang ME, Wenwei Luo MD, Yinwei Zhan PhD, Caizhen Wei ME, Xian Liang ME, Hongming Huang MM, Yong Cui MD\",\"doi\":\"10.1002/lary.31848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <section>\\n \\n <h3> Objectives</h3>\\n \\n <p>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Level of Evidence</h3>\\n \\n <p>3 <i>Laryngoscope</i>, 135:1161–1168, 2025</p>\\n </section>\\n </div>\",\"PeriodicalId\":49921,\"journal\":{\"name\":\"Laryngoscope\",\"volume\":\"135 3\",\"pages\":\"1161-1168\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Laryngoscope\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/lary.31848\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Laryngoscope","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/lary.31848","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

目的：要评估前庭功能，视频头脉冲试验（vHIT）是评估前庭眼反射（VOR）的金标准。然而，vHIT 需要患者佩戴专门的头戴式护目镜设备，每次使用前都需要校准。为此，我们提出了一种智能头脉冲测试（iHIT）设置，用单眼红外摄像机代替头戴式护目镜，并为前庭功能测定贡献了相应的深度学习视频分类方法：在 iHIT 框架内，我们在患者前方安装了一台单目红外摄像机以捕捉测试视频，并在此基础上建立了一个由 HIT 视频片段组成的数据集 DiHIT。然后，我们提出了一个在数据集 DiHIT 上训练的两阶段多模态视频分类网络，该网络将通过 HIT 片段从面部关键点提取的眼球运动和头部运动数据作为输入，并输出被测半规管（SCC）的识别（SCC 识别）和 VOR 异常的判断（SCC 定性）：对该数据集 DiHIT 的实验表明，它对 SCC 识别的预测准确率达到了 100%。此外，它对水平 SCC 和垂直 SCC 定性的预测准确率分别为 84.1%和 79.0%：与现有的基于视频的 HIT 相比，iHIT 无需护目镜，无需校准设备，实现了完全自动化。此外，iHIT 成本低廉、操作简便，将为用户带来更多益处。代码和用例管道见：https://github.com/dec1st2023/iHIT.Level of evidence：3 喉镜，2024 年。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera

查看原文本刊更多论文

Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera

Objectives

To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.

Methods

Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).

Results

Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.

Conclusions

Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.

Level of Evidence

3 Laryngoscope, 135:1161–1168, 2025

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Laryngoscope 医学-耳鼻喉科学

CiteScore

6.50

自引率

7.70%

发文量

500

审稿时长

2-4 weeks

期刊介绍： The Laryngoscope has been the leading source of information on advances in the diagnosis and treatment of head and neck disorders since 1890. The Laryngoscope is the first choice among otolaryngologists for publication of their important findings and techniques. Each monthly issue of The Laryngoscope features peer-reviewed medical, clinical, and research contributions in general otolaryngology, allergy/rhinology, otology/neurotology, laryngology/bronchoesophagology, head and neck surgery, sleep medicine, pediatric otolaryngology, facial plastics and reconstructive surgery, oncology, and communicative disorders. Contributions include papers and posters presented at the Annual and Section Meetings of the Triological Society, as well as independent papers, "How I Do It", "Triological Best Practice" articles, and contemporary reviews. Theses authored by the Triological Society’s new Fellows as well as papers presented at meetings of the American Laryngological Association are published in The Laryngoscope. • Broncho-esophagology • Communicative disorders • Head and neck surgery • Plastic and reconstructive facial surgery • Oncology • Speech and hearing defects