Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models.

IF 1.3 4区 医学 Q3 OTORHINOLARYNGOLOGY
Ishwarya S Mamidi, Michael E Dunham, Lacey K Adkins, Andrew J McWhorter, Zhide Fang, Britney T Banh
{"title":"Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models.","authors":"Ishwarya S Mamidi, Michael E Dunham, Lacey K Adkins, Andrew J McWhorter, Zhide Fang, Britney T Banh","doi":"10.1177/00034894241253376","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Develop an artificial intelligence assisted computer vision model to screen for laryngeal cancer during flexible laryngoscopy.</p><p><strong>Methods: </strong>Using laryngeal images and flexible laryngoscopy video recordings, we developed computer vision models to classify video frames for usability and cancer screening. A separate model segments any identified lesions on the frames. We used these computer vision models to construct a video stream annotation system. This system classifies findings from flexible laryngoscopy as \"potentially malignant\" or \"probably benign\" and segments any detected lesions. Additionally, the model provides a confidence level for each classification.</p><p><strong>Results: </strong>The overall accuracy of the flexible laryngoscopy cancer screening model was 92%. For cancer screening, it achieved a sensitivity of 97.7% and a specificity of 76.9%. The segmentation model attained an average precision at a 0.50 intersection-over-union of 0.595. The confidence level for positive screening results can assist clinicians in counseling patients regarding the findings.</p><p><strong>Conclusion: </strong>Our model is highly sensitive and adequately specific for laryngeal cancer screening. Segmentation helps endoscopists identify and describe potential lesions. Further optimization is required to enable the model's deployment in clinical settings for real-time annotation during flexible laryngoscopy.</p>","PeriodicalId":50975,"journal":{"name":"Annals of Otology Rhinology and Laryngology","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Otology Rhinology and Laryngology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/00034894241253376","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/16 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Develop an artificial intelligence assisted computer vision model to screen for laryngeal cancer during flexible laryngoscopy.

Methods: Using laryngeal images and flexible laryngoscopy video recordings, we developed computer vision models to classify video frames for usability and cancer screening. A separate model segments any identified lesions on the frames. We used these computer vision models to construct a video stream annotation system. This system classifies findings from flexible laryngoscopy as "potentially malignant" or "probably benign" and segments any detected lesions. Additionally, the model provides a confidence level for each classification.

Results: The overall accuracy of the flexible laryngoscopy cancer screening model was 92%. For cancer screening, it achieved a sensitivity of 97.7% and a specificity of 76.9%. The segmentation model attained an average precision at a 0.50 intersection-over-union of 0.595. The confidence level for positive screening results can assist clinicians in counseling patients regarding the findings.

Conclusion: Our model is highly sensitive and adequately specific for laryngeal cancer screening. Segmentation helps endoscopists identify and describe potential lesions. Further optimization is required to enable the model's deployment in clinical settings for real-time annotation during flexible laryngoscopy.

使用大型计算机视觉模型在柔性视频喉镜检查过程中筛查喉癌。
目的:开发一种人工智能辅助计算机视觉模型,用于在柔性喉镜检查过程中筛查喉癌:开发一种人工智能辅助计算机视觉模型,用于在柔性喉镜检查过程中筛查喉癌:我们利用喉部图像和柔性喉内镜视频记录,开发了计算机视觉模型来对视频帧进行分类,以便进行可用性和癌症筛查。此外,我们还开发了一个单独的模型,用于对视频帧进行分类,以便进行可用性和癌症筛查。我们利用这些计算机视觉模型构建了一个视频流注释系统。该系统将柔性喉镜检查结果分为 "潜在恶性 "或 "可能良性",并对检测到的病变进行分割。此外,该模型还为每种分类提供了置信度:结果:软喉镜癌症筛查模型的总体准确率为 92%。在癌症筛查方面,灵敏度为 97.7%,特异度为 76.9%。分段模型在 0.50 的交集-过联合时的平均精确度为 0.595。阳性筛查结果的置信度可帮助临床医生就筛查结果向患者提供咨询:结论:我们的模型对喉癌筛查具有高度敏感性和足够的特异性。分割有助于内镜医师识别和描述潜在的病变。需要进一步优化该模型,以便在临床环境中部署,在柔性喉镜检查过程中进行实时标注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.10
自引率
7.10%
发文量
171
审稿时长
4-8 weeks
期刊介绍: The Annals of Otology, Rhinology & Laryngology publishes original manuscripts of clinical and research importance in otolaryngology–head and neck medicine and surgery, otology, neurotology, bronchoesophagology, laryngology, rhinology, head and neck oncology and surgery, plastic and reconstructive surgery, pediatric otolaryngology, audiology, and speech pathology. In-depth studies (supplements), papers of historical interest, and reviews of computer software and applications in otolaryngology are also published, as well as imaging, pathology, and clinicopathology studies, book reviews, and letters to the editor. AOR is the official journal of the American Broncho-Esophagological Association.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信