用NBI喉镜图像进行喉白斑分类的单视图对比学习。

IF 2.3 3区 医学 Q1 OTORHINOLARYNGOLOGY
Zhenzhen You, Botao Han, Zhenghao Shi, Shuangli Du, Minghua Zhao, Zhiyong Lv, Xinhong Hei, Haiqin Liu, Xiaoyong Ren, Yan Yan
{"title":"用NBI喉镜图像进行喉白斑分类的单视图对比学习。","authors":"Zhenzhen You, Botao Han, Zhenghao Shi, Shuangli Du, Minghua Zhao, Zhiyong Lv, Xinhong Hei, Haiqin Liu, Xiaoyong Ren, Yan Yan","doi":"10.1002/hed.28157","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Laryngeal cancer is the second most common upper respiratory tract cancer. Early and accurate diagnosis can improve the cure rate of patients. Laryngoscopy with NBI is a commonly used tool that can help endoscopists diagnose laryngeal diseases. However, the fine classification of laryngeal leukoplakia using NBI images is challenging for computer-aided diagnosis.</p><p><strong>Methods: </strong>In this article, we propose a single-view contrastive learning network to locate lesion regions, construct sample pairs for contrastive learning, and provide pseudo-labels to unlabeled data in order to achieve fine classification under small samples. Firstly, we pretrain the backbone network using the original NBI images. Secondly, in order to augment the number of samples for contrastive learning, we design different patch generation methods based on an attention-guided network. The original NBI images are cropped into small patches for the purpose of generating lesion-related regions and complementary samples. The pseudo-labels of these small patches are obtained by applying the pre-trained backbone network. Finally, we combine the contrastive loss function and the cross-entropy loss function for jointly training the backbone network and contrastive learning network. Our NBI dataset is classified into six categories: normal tissue, inflammatory keratosis, mild dysplasia, moderate dysplasia, severe dysplasia, and squamous cell carcinoma.</p><p><strong>Results and conclusion: </strong>Experimental results demonstrate that our model achieves an accuracy of 96.12%, which is higher than the current mainstream models. Our model also achieves high specificity and sensitivity. The code is available at https://github.com/hans-bbt/single-view-contrastive-learning.</p>","PeriodicalId":55072,"journal":{"name":"Head and Neck-Journal for the Sciences and Specialties of the Head and Neck","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Single-View Contrastive Learning for Laryngeal Leukoplakia Classification With NBI Laryngoscopy Images.\",\"authors\":\"Zhenzhen You, Botao Han, Zhenghao Shi, Shuangli Du, Minghua Zhao, Zhiyong Lv, Xinhong Hei, Haiqin Liu, Xiaoyong Ren, Yan Yan\",\"doi\":\"10.1002/hed.28157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Laryngeal cancer is the second most common upper respiratory tract cancer. Early and accurate diagnosis can improve the cure rate of patients. Laryngoscopy with NBI is a commonly used tool that can help endoscopists diagnose laryngeal diseases. However, the fine classification of laryngeal leukoplakia using NBI images is challenging for computer-aided diagnosis.</p><p><strong>Methods: </strong>In this article, we propose a single-view contrastive learning network to locate lesion regions, construct sample pairs for contrastive learning, and provide pseudo-labels to unlabeled data in order to achieve fine classification under small samples. Firstly, we pretrain the backbone network using the original NBI images. Secondly, in order to augment the number of samples for contrastive learning, we design different patch generation methods based on an attention-guided network. The original NBI images are cropped into small patches for the purpose of generating lesion-related regions and complementary samples. The pseudo-labels of these small patches are obtained by applying the pre-trained backbone network. Finally, we combine the contrastive loss function and the cross-entropy loss function for jointly training the backbone network and contrastive learning network. Our NBI dataset is classified into six categories: normal tissue, inflammatory keratosis, mild dysplasia, moderate dysplasia, severe dysplasia, and squamous cell carcinoma.</p><p><strong>Results and conclusion: </strong>Experimental results demonstrate that our model achieves an accuracy of 96.12%, which is higher than the current mainstream models. Our model also achieves high specificity and sensitivity. The code is available at https://github.com/hans-bbt/single-view-contrastive-learning.</p>\",\"PeriodicalId\":55072,\"journal\":{\"name\":\"Head and Neck-Journal for the Sciences and Specialties of the Head and Neck\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-04-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Head and Neck-Journal for the Sciences and Specialties of the Head and Neck\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/hed.28157\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OTORHINOLARYNGOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Head and Neck-Journal for the Sciences and Specialties of the Head and Neck","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/hed.28157","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:喉癌是第二常见的上呼吸道肿瘤。早期准确的诊断可以提高患者的治愈率。NBI喉镜检查是一种常用的工具,可以帮助内窥镜医生诊断喉部疾病。然而,使用NBI图像对喉白斑进行精细分类对计算机辅助诊断具有挑战性。方法:本文提出一种单视图对比学习网络,定位病变区域,构建样本对进行对比学习,并对未标记数据提供伪标签,实现小样本下的精细分类。首先,利用原始NBI图像对骨干网进行预训练。其次,为了增加对比学习的样本数量,我们设计了基于注意引导网络的不同patch生成方法。原始NBI图像被裁剪成小块,用于生成病变相关区域和互补样本。利用预训练好的骨干网得到这些小块的伪标签。最后,结合对比损失函数和交叉熵损失函数,对骨干网络和对比学习网络进行联合训练。我们的NBI数据集分为六类:正常组织、炎症性角化病、轻度不典型增生、中度不典型增生、重度不典型增生和鳞状细胞癌。结果与结论:实验结果表明,我们的模型达到了96.12%的准确率,高于目前的主流模型。该模型具有较高的特异性和敏感性。代码可在https://github.com/hans-bbt/single-view-contrastive-learning上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Single-View Contrastive Learning for Laryngeal Leukoplakia Classification With NBI Laryngoscopy Images.

Background: Laryngeal cancer is the second most common upper respiratory tract cancer. Early and accurate diagnosis can improve the cure rate of patients. Laryngoscopy with NBI is a commonly used tool that can help endoscopists diagnose laryngeal diseases. However, the fine classification of laryngeal leukoplakia using NBI images is challenging for computer-aided diagnosis.

Methods: In this article, we propose a single-view contrastive learning network to locate lesion regions, construct sample pairs for contrastive learning, and provide pseudo-labels to unlabeled data in order to achieve fine classification under small samples. Firstly, we pretrain the backbone network using the original NBI images. Secondly, in order to augment the number of samples for contrastive learning, we design different patch generation methods based on an attention-guided network. The original NBI images are cropped into small patches for the purpose of generating lesion-related regions and complementary samples. The pseudo-labels of these small patches are obtained by applying the pre-trained backbone network. Finally, we combine the contrastive loss function and the cross-entropy loss function for jointly training the backbone network and contrastive learning network. Our NBI dataset is classified into six categories: normal tissue, inflammatory keratosis, mild dysplasia, moderate dysplasia, severe dysplasia, and squamous cell carcinoma.

Results and conclusion: Experimental results demonstrate that our model achieves an accuracy of 96.12%, which is higher than the current mainstream models. Our model also achieves high specificity and sensitivity. The code is available at https://github.com/hans-bbt/single-view-contrastive-learning.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.00
自引率
6.90%
发文量
278
审稿时长
1.6 months
期刊介绍: Head & Neck is an international multidisciplinary publication of original contributions concerning the diagnosis and management of diseases of the head and neck. This area involves the overlapping interests and expertise of several surgical and medical specialties, including general surgery, neurosurgery, otolaryngology, plastic surgery, oral surgery, dermatology, ophthalmology, pathology, radiotherapy, medical oncology, and the corresponding basic sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信