Yong Soon Park, Jun Ho Jeon, Tae Hoon Kong, Tae Yun Chung, Young Joon Seo
{"title":"Deep Learning Techniques for Ear Diseases Based on Segmentation of the Normal Tympanic Membrane.","authors":"Yong Soon Park, Jun Ho Jeon, Tae Hoon Kong, Tae Yun Chung, Young Joon Seo","doi":"10.21053/ceo.2022.00675","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Otitis media is a common infection worldwide. Owing to the limited number of ear specialists and rapid development of telemedicine, several trials have been conducted to develop novel diagnostic strategies to improve the diagnostic accuracy and screening of patients with otologic diseases based on abnormal otoscopic findings. Although these strategies have demonstrated high diagnostic accuracy for the tympanic membrane (TM), the insufficient explainability of these techniques limits their deployment in clinical practice.</p><p><strong>Methods: </strong>We used a deep convolutional neural network (CNN) model based on the segmentation of a normal TM into five substructures (malleus, umbo, cone of light, pars flaccida, and annulus) to identify abnormalities in otoscopic ear images. The mask R-CNN algorithm learned the labeled images. Subsequently, we evaluated the diagnostic performance of combinations of the five substructures using a three-layer fully connected neural network to determine whether ear disease was present.</p><p><strong>Results: </strong>We obtained the receiver operating characteristic (ROC) curve of the optimal conditions for the presence or absence of eardrum diseases according to each substructure separately or combinations of substructures. The highest area under the curve (0.911) was found for a combination of the malleus, cone of light, and umbo, compared with the corresponding areas under the curve of 0.737-0.873 for each substructure. Thus, an algorithm using these five important normal anatomical structures could prove to be explainable and effective in screening abnormal TMs.</p><p><strong>Conclusion: </strong>This automated algorithm can improve diagnostic accuracy by discriminating between normal and abnormal TMs and can facilitate appropriate and timely referral consultations to improve patients' quality of life in the context of primary care.</p>","PeriodicalId":10318,"journal":{"name":"Clinical and Experimental Otorhinolaryngology","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c6/ef/ceo-2022-00675.PMC9985991.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Experimental Otorhinolaryngology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21053/ceo.2022.00675","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 1
Abstract
Objectives: Otitis media is a common infection worldwide. Owing to the limited number of ear specialists and rapid development of telemedicine, several trials have been conducted to develop novel diagnostic strategies to improve the diagnostic accuracy and screening of patients with otologic diseases based on abnormal otoscopic findings. Although these strategies have demonstrated high diagnostic accuracy for the tympanic membrane (TM), the insufficient explainability of these techniques limits their deployment in clinical practice.
Methods: We used a deep convolutional neural network (CNN) model based on the segmentation of a normal TM into five substructures (malleus, umbo, cone of light, pars flaccida, and annulus) to identify abnormalities in otoscopic ear images. The mask R-CNN algorithm learned the labeled images. Subsequently, we evaluated the diagnostic performance of combinations of the five substructures using a three-layer fully connected neural network to determine whether ear disease was present.
Results: We obtained the receiver operating characteristic (ROC) curve of the optimal conditions for the presence or absence of eardrum diseases according to each substructure separately or combinations of substructures. The highest area under the curve (0.911) was found for a combination of the malleus, cone of light, and umbo, compared with the corresponding areas under the curve of 0.737-0.873 for each substructure. Thus, an algorithm using these five important normal anatomical structures could prove to be explainable and effective in screening abnormal TMs.
Conclusion: This automated algorithm can improve diagnostic accuracy by discriminating between normal and abnormal TMs and can facilitate appropriate and timely referral consultations to improve patients' quality of life in the context of primary care.
期刊介绍:
Clinical and Experimental Otorhinolaryngology (Clin Exp Otorhinolaryngol, CEO) is an international peer-reviewed journal on recent developments in diagnosis and treatment of otorhinolaryngology-head and neck surgery and dedicated to the advancement of patient care in ear, nose, throat, head, and neck disorders. This journal publishes original articles relating to both clinical and basic researches, reviews, and clinical trials, encompassing the whole topics of otorhinolaryngology-head and neck surgery.
CEO was first issued in 2008 and this journal is published in English four times (the last day of February, May, August, and November) per year by the Korean Society of Otorhinolaryngology-Head and Neck Surgery. The Journal aims at publishing evidence-based, scientifically written articles from different disciplines of otorhinolaryngology field.
The readership contains clinical/basic research into current practice in otorhinolaryngology, audiology, speech pathology, head and neck oncology, plastic and reconstructive surgery. The readers are otolaryngologists, head and neck surgeons and oncologists, audiologists, and speech pathologists.