实现智能头部脉冲测试:使用单目红外摄像机的无护目镜方法。

IF 2.2 3区 医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL
Laryngoscope Pub Date : 2024-10-18 DOI:10.1002/lary.31848
Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui
{"title":"实现智能头部脉冲测试:使用单目红外摄像机的无护目镜方法。","authors":"Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui","doi":"10.1002/lary.31848","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p><p><strong>Methods: </strong>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p><p><strong>Results: </strong>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p><p><strong>Conclusions: </strong>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p><p><strong>Level of evidence: </strong>3 Laryngoscope, 2024.</p>","PeriodicalId":49921,"journal":{"name":"Laryngoscope","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera.\",\"authors\":\"Yang Ouyang, Wenwei Luo, Yinwei Zhan, Caizhen Wei, Xian Liang, Hongming Huang, Yong Cui\",\"doi\":\"10.1002/lary.31848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.</p><p><strong>Methods: </strong>Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).</p><p><strong>Results: </strong>Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.</p><p><strong>Conclusions: </strong>Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.</p><p><strong>Level of evidence: </strong>3 Laryngoscope, 2024.</p>\",\"PeriodicalId\":49921,\"journal\":{\"name\":\"Laryngoscope\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Laryngoscope\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/lary.31848\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Laryngoscope","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/lary.31848","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

目的:要评估前庭功能,视频头脉冲试验(vHIT)是评估前庭眼反射(VOR)的金标准。然而,vHIT 需要患者佩戴专门的头戴式护目镜设备,每次使用前都需要校准。为此,我们提出了一种智能头脉冲测试(iHIT)设置,用单眼红外摄像机代替头戴式护目镜,并为前庭功能测定贡献了相应的深度学习视频分类方法:在 iHIT 框架内,我们在患者前方安装了一台单目红外摄像机以捕捉测试视频,并在此基础上建立了一个由 HIT 视频片段组成的数据集 DiHIT。然后,我们提出了一个在数据集 DiHIT 上训练的两阶段多模态视频分类网络,该网络将通过 HIT 片段从面部关键点提取的眼球运动和头部运动数据作为输入,并输出被测半规管(SCC)的识别(SCC 识别)和 VOR 异常的判断(SCC 定性):对该数据集 DiHIT 的实验表明,它对 SCC 识别的预测准确率达到了 100%。此外,它对水平 SCC 和垂直 SCC 定性的预测准确率分别为 84.1%和 79.0%:与现有的基于视频的 HIT 相比,iHIT 无需护目镜,无需校准设备,实现了完全自动化。此外,iHIT 成本低廉、操作简便,将为用户带来更多益处。代码和用例管道见:https://github.com/dec1st2023/iHIT.Level of evidence:3 喉镜,2024 年。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Toward Intelligent Head Impulse Test: A Goggle-Free Approach Using a Monocular Infrared Camera.

Objectives: To assess vestibular function, video head impulse test (vHIT) is taken as the gold standard by evaluating the vestibulo-ocular reflex (VOR). However, vHIT requires the patient to wear a specialized head-mounted goggle equipment that needs to be calibrated before each use. For this, we proposed an intelligent head impulse test (iHIT) setting with a monocular infrared camera instead of the head-mounted goggle and contributed correspondingly a video classification approach with deep learning to vestibular function determination.

Methods: Within the iHIT framework, a monocular infrared camera was set in front of the patient to capture test videos, based on which a dataset DiHIT of HIT video clips was set up. We then proposed a two-stage multi-modal video classification network, trained on the dataset DiHIT, that took as input the eye motion and head motion data extracted from the facial keypoints via HIT clips and outputted the identification of the semicircular canal (SCC) being tested (SCC identification) and determination of VOR abnormality (SCC qualitation).

Results: Experiments on this dataset DiHIT showed that it achieved the accuracy of 100% in prediction of SCC identification. Furthermore, it attained predictive accuracies of 84.1% in horizontal and 79.0% in vertical SCC qualitation.

Conclusions: Compared with existing video-based HIT, iHIT eliminates goggles, does not require equipment calibration, and achieves complete automation. Furthermore, iHIT will bring more benefits to users due to its low cost and ease of operation. Codes and use case pipeline are available at: https://github.com/dec1st2023/iHIT.

Level of evidence: 3 Laryngoscope, 2024.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Laryngoscope
Laryngoscope 医学-耳鼻喉科学
CiteScore
6.50
自引率
7.70%
发文量
500
审稿时长
2-4 weeks
期刊介绍: The Laryngoscope has been the leading source of information on advances in the diagnosis and treatment of head and neck disorders since 1890. The Laryngoscope is the first choice among otolaryngologists for publication of their important findings and techniques. Each monthly issue of The Laryngoscope features peer-reviewed medical, clinical, and research contributions in general otolaryngology, allergy/rhinology, otology/neurotology, laryngology/bronchoesophagology, head and neck surgery, sleep medicine, pediatric otolaryngology, facial plastics and reconstructive surgery, oncology, and communicative disorders. Contributions include papers and posters presented at the Annual and Section Meetings of the Triological Society, as well as independent papers, "How I Do It", "Triological Best Practice" articles, and contemporary reviews. Theses authored by the Triological Society’s new Fellows as well as papers presented at meetings of the American Laryngological Association are published in The Laryngoscope. • Broncho-esophagology • Communicative disorders • Head and neck surgery • Plastic and reconstructive facial surgery • Oncology • Speech and hearing defects
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信