Developing a diagnostic support system for audiogram interpretation using deep learning-based object detection.

Journal of otology Pub Date : 2025-03-20 eCollection Date: 2025-01-01 DOI:10.26599/JOTO.2025.9540005

Titipat Achakulvisut, Suchanon Phanthong, Thanawut Timpitak, Kanpat Vesessook, Sirinan Junthong, Withita Utainrat, Kanokrat Bunnag

{"title":"Developing a diagnostic support system for audiogram interpretation using deep learning-based object detection.","authors":"Titipat Achakulvisut, Suchanon Phanthong, Thanawut Timpitak, Kanpat Vesessook, Sirinan Junthong, Withita Utainrat, Kanokrat Bunnag","doi":"10.26599/JOTO.2025.9540005","DOIUrl":null,"url":null,"abstract":"Objective: To develop and evaluate an automated system for digitizing audiograms, classifying hearing loss levels, and comparing their performance with traditional methods and otolaryngologists' interpretations.Designed and methods: We conducted a retrospective diagnostic study using 1,959 audiogram images from patients aged 7 years and older at the Faculty of Medicine, Vajira Hospital, Navamindradhiraj University. We employed an object detection approach to digitize audiograms and developed multiple machine learning models to classify six hearing loss levels. The dataset was split into 70% training (1,407 images) and 30% testing (352 images) sets. We compared our model's performance with classifications based on manually extracted audiogram values and otolaryngologists' interpretations.Result: Our object detection-based model achieved an F1-score of 94.72% in classifying hearing loss levels, comparable to the 96.43% F1-score obtained using manually extracted values. The Light Gradient Boosting Machine (LGBM) model is used as the classifier for the manually extracted data, which achieved top performance with 94.72% accuracy, 94.72% f1-score, 94.72 recall, and 94.72 precision. In object detection based model, The Random Forest Classifier (RFC) model showed the highest 96.43% accuracy in predicting hearing loss level, with a F1-score of 96.43%, recall of 96.43%, and precision of 96.45%.Conclusion: Our proposed automated approach for audiogram digitization and hearing loss classification performs comparably to traditional methods and otolaryngologists' interpretations. This system can potentially assist otolaryngologists in providing more timely and effective treatment by quickly and accurately classifying hearing loss.","PeriodicalId":94336,"journal":{"name":"Journal of otology","volume":"20 1","pages":"26-32"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12510341/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of otology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26599/JOTO.2025.9540005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To develop and evaluate an automated system for digitizing audiograms, classifying hearing loss levels, and comparing their performance with traditional methods and otolaryngologists' interpretations.

Designed and methods: We conducted a retrospective diagnostic study using 1,959 audiogram images from patients aged 7 years and older at the Faculty of Medicine, Vajira Hospital, Navamindradhiraj University. We employed an object detection approach to digitize audiograms and developed multiple machine learning models to classify six hearing loss levels. The dataset was split into 70% training (1,407 images) and 30% testing (352 images) sets. We compared our model's performance with classifications based on manually extracted audiogram values and otolaryngologists' interpretations.

Result: Our object detection-based model achieved an F1-score of 94.72% in classifying hearing loss levels, comparable to the 96.43% F1-score obtained using manually extracted values. The Light Gradient Boosting Machine (LGBM) model is used as the classifier for the manually extracted data, which achieved top performance with 94.72% accuracy, 94.72% f1-score, 94.72 recall, and 94.72 precision. In object detection based model, The Random Forest Classifier (RFC) model showed the highest 96.43% accuracy in predicting hearing loss level, with a F1-score of 96.43%, recall of 96.43%, and precision of 96.45%.

Conclusion: Our proposed automated approach for audiogram digitization and hearing loss classification performs comparably to traditional methods and otolaryngologists' interpretations. This system can potentially assist otolaryngologists in providing more timely and effective treatment by quickly and accurately classifying hearing loss.

查看原文本刊更多论文

使用基于深度学习的对象检测开发一种听力图解释诊断支持系统。

目的：开发和评估一种听音图数字化、听力损失分级的自动化系统，并将其与传统方法和耳鼻喉科医生的解释进行比较。设计和方法：我们对Navamindradhiraj大学Vajira医院医学院7岁及以上患者的1959张听图图像进行了回顾性诊断研究。我们采用目标检测方法对听力图进行数字化处理，并开发了多个机器学习模型来对六种听力损失级别进行分类。数据集分为70%的训练集（1407张图像）和30%的测试集（352张图像）。我们将模型的性能与基于人工提取的听音值和耳鼻喉科医生解释的分类进行了比较。结果：基于目标检测的模型对听力损失等级的分类准确率为94.72%，与人工提取值的准确率为96.43%相当。采用LGBM （Light Gradient Boosting Machine）模型对人工提取的数据进行分类，准确率为94.72%，f1-score为94.72%，召回率为94.72，精密度为94.72。在基于目标检测的模型中，随机森林分类器（RFC）模型预测听力损失水平的准确率最高，达到96.43%，f1评分为96.43%，召回率为96.43%，准确率为96.45%。结论：我们提出的听图数字化和听力损失分类的自动化方法与传统方法和耳鼻喉科医生的解释相当。该系统可以通过快速准确地分类听力损失，帮助耳鼻喉科医生提供更及时有效的治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of otology

自引率

0.00%

发文量