Yuxuan Shi, Zhen Li, Li Wang, Hong Wang, Xiaofeng Liu, Dantong Gu, Xiao Chen, Xueli Liu, Wentao Gong, Xiaowen Jiang, Wenquan Li, Yongdong Lin, Ke Liu, Deyan Luo, Tao Peng, Xuemei Peng, Meimei Tong, Huizhen Zheng, Xuanchen Zhou, Jianrong Wu, Georges El Fakhri, Mingzhang Chang, Jun Liao, Jie'en Li, Desheng Wang, Jing Ye, Shenhong Qu, Weihong Jiang, Quan Liu, Xicai Sun, Yefeng Zheng, Hongmeng Yu
{"title":"人工智能辅助鼻咽癌内镜图像检测:一项全国性、多中心、模型开发和验证研究。","authors":"Yuxuan Shi, Zhen Li, Li Wang, Hong Wang, Xiaofeng Liu, Dantong Gu, Xiao Chen, Xueli Liu, Wentao Gong, Xiaowen Jiang, Wenquan Li, Yongdong Lin, Ke Liu, Deyan Luo, Tao Peng, Xuemei Peng, Meimei Tong, Huizhen Zheng, Xuanchen Zhou, Jianrong Wu, Georges El Fakhri, Mingzhang Chang, Jun Liao, Jie'en Li, Desheng Wang, Jing Ye, Shenhong Qu, Weihong Jiang, Quan Liu, Xicai Sun, Yefeng Zheng, Hongmeng Yu","doi":"10.1016/j.landig.2025.03.001","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Nasopharyngeal carcinoma is highly curable when diagnosed early. However, the nasopharynx's obscure anatomical position and the similarity of local imaging manifestations with those of other nasopharyngeal diseases often lead to diagnostic challenges, resulting in delayed or missed diagnoses. Our aim was to develop a deep learning algorithm to enhance an otolaryngologist's diagnostic capabilities by differentiating between nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx during endoscopic examination.</p><p><strong>Methods: </strong>In this national, multicentre, model development and validation study, we developed a Swin Transformer-based Nasopharyngeal Diagnostic (STND) system to identify nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx. STND was developed with 27 362 nasopharyngeal endoscopic images (10 693 biopsy-proven nasopharyngeal carcinoma, 7073 biopsy-proven benign hyperplasia, and 9596 normal nasopharynx) sourced from eight prominent nasopharyngeal carcinoma centres (stage 1), and externally validated with 1885 prospectively acquired images from ten comprehensive hospitals with a high incidence of nasopharyngeal carcinoma (stage 2). Furthermore, we did a fully crossed, multireader, multicase study involving four expert otolaryngologists from four regional leading nasopharyngeal carcinoma centres, and 24 general otolaryngologists from 24 geographically diverse primary hospitals. This study included 400 images to evaluate the diagnostic capabilities of the experts and general otolaryngologists both with and without the aid of the STND system in a real-world environment.</p><p><strong>Findings: </strong>Endoscopic images used in the internal study (Jan 1, 2017, to Jan 31, 2023) were from 15 521 individuals (9033 [58·2%] men and 6488 [41·8%] women; mean age 47·6 years [IQR 38·4-56·8]). Images from 945 participants (538 [56·9%] men and 407 [43·1%] women; mean age 45·2 years [IQR 35·2- 55·2]) were used in the external validation. STND in the internal dataset discriminated normal nasopharynx images from abnormalities (benign hyperplasia and nasopharyngeal carcinoma) with an area under the curve (AUC) of 0·99 (95% CI 0·99-0·99) and malignant images (ie, nasopharyngeal carcinoma) from non-malignant images (ie, benign hyperplasia and normal nasopharynx) with an AUC of 0·99 (95% CI 0·98-0·99). In the external validation, the system had an AUC for the detection of nasopharyngeal carcinoma of 0·95 (95% CI 0·94-0·96), a sensitivity of 91·6% (95% CI 89·3-93·5), and a specificity of 86·1% (95% CI 84·1-87·9). In the multireader, multicase study, the artificial intelligence (AI)-assisted strategy enhanced otolaryngologists' diagnostic accuracy by 7·9%, increasing from 83·4% (95% CI 80·1-86·7, without AI assistance) to 91·2% (95% CI 88·6-93·9, with AI assistance; p<0·0001) for primary care otolaryngologists. Reading time per image decreased with the aid of the AI model (mean 5·0 s [SD 2·5] vs 6·7 s [6·0] without the model; p=0·034).</p><p><strong>Interpretation: </strong>Our deep learning system has shown significant clinical potential for the practical application of nasopharyngeal carcinoma diagnosis through endoscopic images in real-world settings. The system offers substantial benefits for adoption in primary hospitals, aiming to enhance specificity, avoid additional biopsies, and reduce missed diagnoses.</p><p><strong>Funding: </strong>New Technologies of Endoscopic Surgery in Skull Base Tumor: CAMS Innovation Fund for Medical Science; Shanghai Science and Technology Committee Foundation; Shanghai Municipal Key Clinical Specialty; National Key Clinical Specialty Program; and the Young Elite Scientists Sponsorship Program.</p>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":" ","pages":"100869"},"PeriodicalIF":23.8000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence-assisted detection of nasopharyngeal carcinoma on endoscopic images: a national, multicentre, model development and validation study.\",\"authors\":\"Yuxuan Shi, Zhen Li, Li Wang, Hong Wang, Xiaofeng Liu, Dantong Gu, Xiao Chen, Xueli Liu, Wentao Gong, Xiaowen Jiang, Wenquan Li, Yongdong Lin, Ke Liu, Deyan Luo, Tao Peng, Xuemei Peng, Meimei Tong, Huizhen Zheng, Xuanchen Zhou, Jianrong Wu, Georges El Fakhri, Mingzhang Chang, Jun Liao, Jie'en Li, Desheng Wang, Jing Ye, Shenhong Qu, Weihong Jiang, Quan Liu, Xicai Sun, Yefeng Zheng, Hongmeng Yu\",\"doi\":\"10.1016/j.landig.2025.03.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Nasopharyngeal carcinoma is highly curable when diagnosed early. However, the nasopharynx's obscure anatomical position and the similarity of local imaging manifestations with those of other nasopharyngeal diseases often lead to diagnostic challenges, resulting in delayed or missed diagnoses. Our aim was to develop a deep learning algorithm to enhance an otolaryngologist's diagnostic capabilities by differentiating between nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx during endoscopic examination.</p><p><strong>Methods: </strong>In this national, multicentre, model development and validation study, we developed a Swin Transformer-based Nasopharyngeal Diagnostic (STND) system to identify nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx. STND was developed with 27 362 nasopharyngeal endoscopic images (10 693 biopsy-proven nasopharyngeal carcinoma, 7073 biopsy-proven benign hyperplasia, and 9596 normal nasopharynx) sourced from eight prominent nasopharyngeal carcinoma centres (stage 1), and externally validated with 1885 prospectively acquired images from ten comprehensive hospitals with a high incidence of nasopharyngeal carcinoma (stage 2). Furthermore, we did a fully crossed, multireader, multicase study involving four expert otolaryngologists from four regional leading nasopharyngeal carcinoma centres, and 24 general otolaryngologists from 24 geographically diverse primary hospitals. This study included 400 images to evaluate the diagnostic capabilities of the experts and general otolaryngologists both with and without the aid of the STND system in a real-world environment.</p><p><strong>Findings: </strong>Endoscopic images used in the internal study (Jan 1, 2017, to Jan 31, 2023) were from 15 521 individuals (9033 [58·2%] men and 6488 [41·8%] women; mean age 47·6 years [IQR 38·4-56·8]). Images from 945 participants (538 [56·9%] men and 407 [43·1%] women; mean age 45·2 years [IQR 35·2- 55·2]) were used in the external validation. STND in the internal dataset discriminated normal nasopharynx images from abnormalities (benign hyperplasia and nasopharyngeal carcinoma) with an area under the curve (AUC) of 0·99 (95% CI 0·99-0·99) and malignant images (ie, nasopharyngeal carcinoma) from non-malignant images (ie, benign hyperplasia and normal nasopharynx) with an AUC of 0·99 (95% CI 0·98-0·99). In the external validation, the system had an AUC for the detection of nasopharyngeal carcinoma of 0·95 (95% CI 0·94-0·96), a sensitivity of 91·6% (95% CI 89·3-93·5), and a specificity of 86·1% (95% CI 84·1-87·9). In the multireader, multicase study, the artificial intelligence (AI)-assisted strategy enhanced otolaryngologists' diagnostic accuracy by 7·9%, increasing from 83·4% (95% CI 80·1-86·7, without AI assistance) to 91·2% (95% CI 88·6-93·9, with AI assistance; p<0·0001) for primary care otolaryngologists. Reading time per image decreased with the aid of the AI model (mean 5·0 s [SD 2·5] vs 6·7 s [6·0] without the model; p=0·034).</p><p><strong>Interpretation: </strong>Our deep learning system has shown significant clinical potential for the practical application of nasopharyngeal carcinoma diagnosis through endoscopic images in real-world settings. The system offers substantial benefits for adoption in primary hospitals, aiming to enhance specificity, avoid additional biopsies, and reduce missed diagnoses.</p><p><strong>Funding: </strong>New Technologies of Endoscopic Surgery in Skull Base Tumor: CAMS Innovation Fund for Medical Science; Shanghai Science and Technology Committee Foundation; Shanghai Municipal Key Clinical Specialty; National Key Clinical Specialty Program; and the Young Elite Scientists Sponsorship Program.</p>\",\"PeriodicalId\":48534,\"journal\":{\"name\":\"Lancet Digital Health\",\"volume\":\" \",\"pages\":\"100869\"},\"PeriodicalIF\":23.8000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lancet Digital Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.landig.2025.03.001\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lancet Digital Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.landig.2025.03.001","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
Artificial intelligence-assisted detection of nasopharyngeal carcinoma on endoscopic images: a national, multicentre, model development and validation study.
Background: Nasopharyngeal carcinoma is highly curable when diagnosed early. However, the nasopharynx's obscure anatomical position and the similarity of local imaging manifestations with those of other nasopharyngeal diseases often lead to diagnostic challenges, resulting in delayed or missed diagnoses. Our aim was to develop a deep learning algorithm to enhance an otolaryngologist's diagnostic capabilities by differentiating between nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx during endoscopic examination.
Methods: In this national, multicentre, model development and validation study, we developed a Swin Transformer-based Nasopharyngeal Diagnostic (STND) system to identify nasopharyngeal carcinoma, benign hyperplasia, and normal nasopharynx. STND was developed with 27 362 nasopharyngeal endoscopic images (10 693 biopsy-proven nasopharyngeal carcinoma, 7073 biopsy-proven benign hyperplasia, and 9596 normal nasopharynx) sourced from eight prominent nasopharyngeal carcinoma centres (stage 1), and externally validated with 1885 prospectively acquired images from ten comprehensive hospitals with a high incidence of nasopharyngeal carcinoma (stage 2). Furthermore, we did a fully crossed, multireader, multicase study involving four expert otolaryngologists from four regional leading nasopharyngeal carcinoma centres, and 24 general otolaryngologists from 24 geographically diverse primary hospitals. This study included 400 images to evaluate the diagnostic capabilities of the experts and general otolaryngologists both with and without the aid of the STND system in a real-world environment.
Findings: Endoscopic images used in the internal study (Jan 1, 2017, to Jan 31, 2023) were from 15 521 individuals (9033 [58·2%] men and 6488 [41·8%] women; mean age 47·6 years [IQR 38·4-56·8]). Images from 945 participants (538 [56·9%] men and 407 [43·1%] women; mean age 45·2 years [IQR 35·2- 55·2]) were used in the external validation. STND in the internal dataset discriminated normal nasopharynx images from abnormalities (benign hyperplasia and nasopharyngeal carcinoma) with an area under the curve (AUC) of 0·99 (95% CI 0·99-0·99) and malignant images (ie, nasopharyngeal carcinoma) from non-malignant images (ie, benign hyperplasia and normal nasopharynx) with an AUC of 0·99 (95% CI 0·98-0·99). In the external validation, the system had an AUC for the detection of nasopharyngeal carcinoma of 0·95 (95% CI 0·94-0·96), a sensitivity of 91·6% (95% CI 89·3-93·5), and a specificity of 86·1% (95% CI 84·1-87·9). In the multireader, multicase study, the artificial intelligence (AI)-assisted strategy enhanced otolaryngologists' diagnostic accuracy by 7·9%, increasing from 83·4% (95% CI 80·1-86·7, without AI assistance) to 91·2% (95% CI 88·6-93·9, with AI assistance; p<0·0001) for primary care otolaryngologists. Reading time per image decreased with the aid of the AI model (mean 5·0 s [SD 2·5] vs 6·7 s [6·0] without the model; p=0·034).
Interpretation: Our deep learning system has shown significant clinical potential for the practical application of nasopharyngeal carcinoma diagnosis through endoscopic images in real-world settings. The system offers substantial benefits for adoption in primary hospitals, aiming to enhance specificity, avoid additional biopsies, and reduce missed diagnoses.
Funding: New Technologies of Endoscopic Surgery in Skull Base Tumor: CAMS Innovation Fund for Medical Science; Shanghai Science and Technology Committee Foundation; Shanghai Municipal Key Clinical Specialty; National Key Clinical Specialty Program; and the Young Elite Scientists Sponsorship Program.
期刊介绍:
The Lancet Digital Health publishes important, innovative, and practice-changing research on any topic connected with digital technology in clinical medicine, public health, and global health.
The journal’s open access content crosses subject boundaries, building bridges between health professionals and researchers.By bringing together the most important advances in this multidisciplinary field,The Lancet Digital Health is the most prominent publishing venue in digital health.
We publish a range of content types including Articles,Review, Comment, and Correspondence, contributing to promoting digital technologies in health practice worldwide.