Mickey Kondo , Carlo Russo , Matthew Fadhil , Ben Hunter , Leon Chen , Rhea Darbari Kaul , Priyanka Rana , Yang Song , Nicholas Jufas , Antonio Di Ieva , Nirmal Patel
{"title":"Developing a ground truth for a convolutional neural network-based segmentation of anatomical structures in endoscopic ear surgery videos","authors":"Mickey Kondo , Carlo Russo , Matthew Fadhil , Ben Hunter , Leon Chen , Rhea Darbari Kaul , Priyanka Rana , Yang Song , Nicholas Jufas , Antonio Di Ieva , Nirmal Patel","doi":"10.1016/j.anl.2025.04.013","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aimed to develop a ground truth for a convolutional neural network-based segmentation of the stapes, chorda tympani and facial nerve in endoscopic ear surgery videos, and to evaluate the accuracy of artificial intelligence (AI) predictions on test videos.</div></div><div><h3>Methods</h3><div>Forty prospectively-gathered endoscopic ear tympanotomy videos were segmented to establish a ground truth. This ground truth was used to prime a convolutional neural network (CNN) to predict the stapes, chorda tympani and facial nerve. The CNN was then tested on an unlabeled series of videos, and the accuracy of predictions compared to ground truth labeling by calculating Dice scores, sensitivity and specificity.</div></div><div><h3>Results</h3><div>An overall Dice score of 77.94 % across all three elements was obtained, with a sensitivity of 78.42 % and specificity of 99.79 %.</div></div><div><h3>Conclusion</h3><div>Convolutional neural network analysis is effective at identifying key anatomical structures in endoscopic ear surgery videos. Further validation of the CNN on additional video datasets is necessary to optimize model performance.</div></div>","PeriodicalId":55627,"journal":{"name":"Auris Nasus Larynx","volume":"52 4","pages":"Pages 342-345"},"PeriodicalIF":1.6000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Auris Nasus Larynx","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0385814625000744","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study aimed to develop a ground truth for a convolutional neural network-based segmentation of the stapes, chorda tympani and facial nerve in endoscopic ear surgery videos, and to evaluate the accuracy of artificial intelligence (AI) predictions on test videos.
Methods
Forty prospectively-gathered endoscopic ear tympanotomy videos were segmented to establish a ground truth. This ground truth was used to prime a convolutional neural network (CNN) to predict the stapes, chorda tympani and facial nerve. The CNN was then tested on an unlabeled series of videos, and the accuracy of predictions compared to ground truth labeling by calculating Dice scores, sensitivity and specificity.
Results
An overall Dice score of 77.94 % across all three elements was obtained, with a sensitivity of 78.42 % and specificity of 99.79 %.
Conclusion
Convolutional neural network analysis is effective at identifying key anatomical structures in endoscopic ear surgery videos. Further validation of the CNN on additional video datasets is necessary to optimize model performance.
期刊介绍:
The international journal Auris Nasus Larynx provides the opportunity for rapid, carefully reviewed publications concerning the fundamental and clinical aspects of otorhinolaryngology and related fields. This includes otology, neurotology, bronchoesophagology, laryngology, rhinology, allergology, head and neck medicine and oncologic surgery, maxillofacial and plastic surgery, audiology, speech science.
Original papers, short communications and original case reports can be submitted. Reviews on recent developments are invited regularly and Letters to the Editor commenting on papers or any aspect of Auris Nasus Larynx are welcomed.
Founded in 1973 and previously published by the Society for Promotion of International Otorhinolaryngology, the journal is now the official English-language journal of the Oto-Rhino-Laryngological Society of Japan, Inc. The aim of its new international Editorial Board is to make Auris Nasus Larynx an international forum for high quality research and clinical sciences.