用于喉镜识别的非局部双流融合网络。

IF 1.8 4区 医学 Q2 OTORHINOLARYNGOLOGY
Ran Wei , Yan Liang , Lei Geng , Wei Wang , Mei Wei
{"title":"用于喉镜识别的非局部双流融合网络。","authors":"Ran Wei ,&nbsp;Yan Liang ,&nbsp;Lei Geng ,&nbsp;Wei Wang ,&nbsp;Mei Wei","doi":"10.1016/j.amjoto.2024.104565","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>To use deep learning technology to design and implement a model that can automatically classify laryngoscope images and assist doctors in diagnosing laryngeal diseases.</div></div><div><h3>Materials and methods</h3><div>The experiment was based on 3057 images (normal, glottic cancer, granuloma, Reinke's Edema, vocal cord cyst, leukoplakia, nodules and polyps) from the dataset Laryngoscope8. A classification model based on deep neural networks was developed and tested. Model performance was verified by a variety of evaluation measures, including accuracy, recall, specificity, F1-Score and area under the receiver operating characteristic curve. In addition, the Grad-Cam technology was used to visualize the feature map of the model to improve the interpretation of the network.</div></div><div><h3>Results</h3><div>The model has high classification accuracy and robustness, and can accurately classify various types of laryngoscope images. In the test set of independent individuals, the overall accuracy reaches 86.51 %, and the average area under curve value is 0.954. The performance of the model is significantly better than other existing algorithms.</div></div><div><h3>Conclusion</h3><div>This paper proposes a deep learning based automatic classification model for laryngoscope images. By integrating the output features of deep neural network ResNet and Transformer, eight laryngeal diseases can be accurately classified. This indicates that the proposed method can be effectively applied to the study of laryngeal diseases.</div></div>","PeriodicalId":7591,"journal":{"name":"American Journal of Otolaryngology","volume":"46 1","pages":"Article 104565"},"PeriodicalIF":1.8000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A non-local dual-stream fusion network for laryngoscope recognition\",\"authors\":\"Ran Wei ,&nbsp;Yan Liang ,&nbsp;Lei Geng ,&nbsp;Wei Wang ,&nbsp;Mei Wei\",\"doi\":\"10.1016/j.amjoto.2024.104565\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><div>To use deep learning technology to design and implement a model that can automatically classify laryngoscope images and assist doctors in diagnosing laryngeal diseases.</div></div><div><h3>Materials and methods</h3><div>The experiment was based on 3057 images (normal, glottic cancer, granuloma, Reinke's Edema, vocal cord cyst, leukoplakia, nodules and polyps) from the dataset Laryngoscope8. A classification model based on deep neural networks was developed and tested. Model performance was verified by a variety of evaluation measures, including accuracy, recall, specificity, F1-Score and area under the receiver operating characteristic curve. In addition, the Grad-Cam technology was used to visualize the feature map of the model to improve the interpretation of the network.</div></div><div><h3>Results</h3><div>The model has high classification accuracy and robustness, and can accurately classify various types of laryngoscope images. In the test set of independent individuals, the overall accuracy reaches 86.51 %, and the average area under curve value is 0.954. The performance of the model is significantly better than other existing algorithms.</div></div><div><h3>Conclusion</h3><div>This paper proposes a deep learning based automatic classification model for laryngoscope images. By integrating the output features of deep neural network ResNet and Transformer, eight laryngeal diseases can be accurately classified. This indicates that the proposed method can be effectively applied to the study of laryngeal diseases.</div></div>\",\"PeriodicalId\":7591,\"journal\":{\"name\":\"American Journal of Otolaryngology\",\"volume\":\"46 1\",\"pages\":\"Article 104565\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Journal of Otolaryngology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S019607092400351X\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OTORHINOLARYNGOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Otolaryngology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S019607092400351X","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:利用深度学习技术设计并实现一个自动分类喉镜图像,辅助医生诊断喉部疾病的模型。材料和方法:实验基于来自喉镜数据集8的3057张图像(正常、声门癌、肉芽肿、莱因克水肿、声带囊肿、白斑病、结节和息肉)。开发并测试了基于深度神经网络的分类模型。通过准确率、召回率、特异性、F1-Score和受试者工作特征曲线下面积等多种评价指标来验证模型的性能。此外,利用Grad-Cam技术对模型的特征图进行可视化处理,提高了网络的解释能力。结果:该模型具有较高的分类精度和鲁棒性,能够对各类喉镜图像进行准确分类。在独立个体的测试集中,总体准确率达到86.51%,曲线下面积平均值为0.954。该模型的性能明显优于现有的其他算法。结论:提出了一种基于深度学习的喉镜图像自动分类模型。通过整合深度神经网络ResNet和Transformer的输出特征,可以对8种喉部疾病进行准确分类。这表明该方法可以有效地应用于喉部疾病的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A non-local dual-stream fusion network for laryngoscope recognition

Purpose

To use deep learning technology to design and implement a model that can automatically classify laryngoscope images and assist doctors in diagnosing laryngeal diseases.

Materials and methods

The experiment was based on 3057 images (normal, glottic cancer, granuloma, Reinke's Edema, vocal cord cyst, leukoplakia, nodules and polyps) from the dataset Laryngoscope8. A classification model based on deep neural networks was developed and tested. Model performance was verified by a variety of evaluation measures, including accuracy, recall, specificity, F1-Score and area under the receiver operating characteristic curve. In addition, the Grad-Cam technology was used to visualize the feature map of the model to improve the interpretation of the network.

Results

The model has high classification accuracy and robustness, and can accurately classify various types of laryngoscope images. In the test set of independent individuals, the overall accuracy reaches 86.51 %, and the average area under curve value is 0.954. The performance of the model is significantly better than other existing algorithms.

Conclusion

This paper proposes a deep learning based automatic classification model for laryngoscope images. By integrating the output features of deep neural network ResNet and Transformer, eight laryngeal diseases can be accurately classified. This indicates that the proposed method can be effectively applied to the study of laryngeal diseases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
American Journal of Otolaryngology
American Journal of Otolaryngology 医学-耳鼻喉科学
CiteScore
4.40
自引率
4.00%
发文量
378
审稿时长
41 days
期刊介绍: Be fully informed about developments in otology, neurotology, audiology, rhinology, allergy, laryngology, speech science, bronchoesophagology, facial plastic surgery, and head and neck surgery. Featured sections include original contributions, grand rounds, current reviews, case reports and socioeconomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信