Deep learning models for predicting hearing thresholds based on joint stimulus-frequency otoacoustic emissions and distortion-product otoacoustic emissions
IF 2.5 2区 医学Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY
{"title":"Deep learning models for predicting hearing thresholds based on joint stimulus-frequency otoacoustic emissions and distortion-product otoacoustic emissions","authors":"Runyi Xu , Qin Gong","doi":"10.1016/j.heares.2025.109349","DOIUrl":null,"url":null,"abstract":"<div><div>According to the dual-source generation hypothesis, stimulus-frequency otoacoustic emissions (SFOAEs) and distortion-product OAEs (DPOAEs) arise from different cochlear mechanisms, and both are capable of characterizing hearing loss. However, their joint application for hearing threshold prediction remains unexplored. This study developed an efficient deep learning (DL) model integrating SFOAEs and DPOAEs to quantitatively predict hearing thresholds. Training data for the model were collected from 94 ears with normal hearing and 401 ears with sensorineural hearing loss. Frequency-specific DL models were constructed across five octave frequencies (0.5–8 kHz), with inputs including amplitude spectra and corresponding signal-to-noise ratio spectra of both SFOAEs and DPOAEs. Self-extractors of the model were constructed using convolutional neural network (CNN) and recurrent neural network (RNN), respectively. Cross-validation demonstrated that the dual-OAE model achieved mean absolute errors (MAEs) of 5.17, 3.83, 3.96, 4.71, and 4.90 dB at 0.5–8 kHz, significantly outperforming single-OAE DL models (except DPOAE-based models at 0.5 and 2 kHz) and baseline machine learning models. By reducing the number of OAE stimulus levels, the efficiency-optimized model reduced testing time per individual to approximately 15 min while preserving accuracy. The proposed dual-source OAE (SFOAE and DPOAE)-integrated DL model achieves state-of-the-art accuracy in hearing threshold prediction, with its optimized efficiency establishing a foundation for further developing a practical clinical tool for objective hearing loss diagnosis.</div></div>","PeriodicalId":12881,"journal":{"name":"Hearing Research","volume":"465 ","pages":"Article 109349"},"PeriodicalIF":2.5000,"publicationDate":"2025-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378595525001674","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
According to the dual-source generation hypothesis, stimulus-frequency otoacoustic emissions (SFOAEs) and distortion-product OAEs (DPOAEs) arise from different cochlear mechanisms, and both are capable of characterizing hearing loss. However, their joint application for hearing threshold prediction remains unexplored. This study developed an efficient deep learning (DL) model integrating SFOAEs and DPOAEs to quantitatively predict hearing thresholds. Training data for the model were collected from 94 ears with normal hearing and 401 ears with sensorineural hearing loss. Frequency-specific DL models were constructed across five octave frequencies (0.5–8 kHz), with inputs including amplitude spectra and corresponding signal-to-noise ratio spectra of both SFOAEs and DPOAEs. Self-extractors of the model were constructed using convolutional neural network (CNN) and recurrent neural network (RNN), respectively. Cross-validation demonstrated that the dual-OAE model achieved mean absolute errors (MAEs) of 5.17, 3.83, 3.96, 4.71, and 4.90 dB at 0.5–8 kHz, significantly outperforming single-OAE DL models (except DPOAE-based models at 0.5 and 2 kHz) and baseline machine learning models. By reducing the number of OAE stimulus levels, the efficiency-optimized model reduced testing time per individual to approximately 15 min while preserving accuracy. The proposed dual-source OAE (SFOAE and DPOAE)-integrated DL model achieves state-of-the-art accuracy in hearing threshold prediction, with its optimized efficiency establishing a foundation for further developing a practical clinical tool for objective hearing loss diagnosis.
期刊介绍:
The aim of the journal is to provide a forum for papers concerned with basic peripheral and central auditory mechanisms. Emphasis is on experimental and clinical studies, but theoretical and methodological papers will also be considered. The journal publishes original research papers, review and mini- review articles, rapid communications, method/protocol and perspective articles.
Papers submitted should deal with auditory anatomy, physiology, psychophysics, imaging, modeling and behavioural studies in animals and humans, as well as hearing aids and cochlear implants. Papers dealing with the vestibular system are also considered for publication. Papers on comparative aspects of hearing and on effects of drugs and environmental contaminants on hearing function will also be considered. Clinical papers will be accepted when they contribute to the understanding of normal and pathological hearing functions.