Deep Learning for High-Speed Laryngeal Imaging Analysis

M. Naghibolhosseini, Ahmed M. Yousef, Mohsen Zayernouri, Stephanie R. C. Zacharias, D. Deliyski
{"title":"Deep Learning for High-Speed Laryngeal Imaging Analysis","authors":"M. Naghibolhosseini, Ahmed M. Yousef, Mohsen Zayernouri, Stephanie R. C. Zacharias, D. Deliyski","doi":"10.1109/ICCIKE58312.2023.10131757","DOIUrl":null,"url":null,"abstract":"High-speed imaging of the larynx provides a valuable means for studying vocal folds function and vibratory behaviors. Using laryngeal high-speed videoendoscopy (HSV) with a flexible nasolaryngoscope, one can record the detailed vibratory movements of vocal folds during connected speech. This high-speed imaging tool enables us to study the normal function of the vocal folds and how this function can be disrupted due to the presence of voice disorders. In this work, HSV data were utilized during connected speech from subjects with normophonic voices (no voice disorders) and a neurological voice disorder. The data were obtained using a high-speed camera, coupled with a flexible endoscope, at 4,000 frames per second. Deep learning was used for the analysis of the big HSV dataset to extract the vibratory behaviors of the vocal folds. This deep-learning-based tool achieved high levels of accuracy for analysis of challenging HSV data in connected speech. This tool provides a computationally cost-effective and an accurate measurement approach that could help design more advanced voice assessment protocols in future.","PeriodicalId":164690,"journal":{"name":"2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIKE58312.2023.10131757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

High-speed imaging of the larynx provides a valuable means for studying vocal folds function and vibratory behaviors. Using laryngeal high-speed videoendoscopy (HSV) with a flexible nasolaryngoscope, one can record the detailed vibratory movements of vocal folds during connected speech. This high-speed imaging tool enables us to study the normal function of the vocal folds and how this function can be disrupted due to the presence of voice disorders. In this work, HSV data were utilized during connected speech from subjects with normophonic voices (no voice disorders) and a neurological voice disorder. The data were obtained using a high-speed camera, coupled with a flexible endoscope, at 4,000 frames per second. Deep learning was used for the analysis of the big HSV dataset to extract the vibratory behaviors of the vocal folds. This deep-learning-based tool achieved high levels of accuracy for analysis of challenging HSV data in connected speech. This tool provides a computationally cost-effective and an accurate measurement approach that could help design more advanced voice assessment protocols in future.
高速喉部影像分析的深度学习
高速喉部成像为研究声带功能和振动行为提供了有价值的手段。使用喉高速视频内窥镜(HSV)与一个灵活的鼻咽喉镜,可以记录声带在连接讲话时的详细振动运动。这种高速成像工具使我们能够研究声带的正常功能,以及这种功能如何由于声音障碍的存在而被破坏。在这项工作中,HSV数据被用于正常语音(无语音障碍)和神经性语音障碍受试者的连接语音。这些数据是通过一个高速摄像机和一个灵活的内窥镜获得的,速度为每秒4000帧。利用深度学习对大HSV数据集进行分析,提取声带的振动行为。这种基于深度学习的工具在分析连接语音中具有挑战性的HSV数据方面取得了很高的准确性。该工具提供了一种计算成本效益和准确的测量方法,可以帮助设计未来更先进的语音评估协议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信