Paul G Mayo, Kenneth I Vaden, Lois J Matthews, Judy R Dubno
{"title":"Feature-Based Audiogram Value Estimator (FAVE): Estimating Numerical Thresholds from Scanned Images of Handwritten Audiograms.","authors":"Paul G Mayo, Kenneth I Vaden, Lois J Matthews, Judy R Dubno","doi":"10.1007/s10916-025-02146-7","DOIUrl":null,"url":null,"abstract":"<p><p>Hearing loss is a public health concern that affects millions of people globally. Clinically, a person's hearing sensitivity is often measured using pure-tone audiometry and visualized as a pure-tone audiogram, a plot of hearing sensitivity as a function of frequency. Digital test equipment allows clinicians to store audiograms as numerical values, though some practices write audiograms by hand and store them as digital images in electronic health records systems. This leaves the numerical values inaccessible to public-health researchers unless manually interpreted. Therefore, this study developed machine-learning models for estimating numerical threshold values from scanned images of handwritten audiograms. Training data were a novel set of 556 handwritten audiograms from a longitudinal cohort study of age-related hearing loss. The models were sliding-window, single-class object detectors based on Aggregate Channel Features, altogether called Feature-based Audiogram Value Estimator or \"FAVE\". Model accuracy was determined using symbol location accuracy and comparing estimated numerical threshold values to known values from an electronic database. FAVE resulted in an average of 87.0% recall and 96.2% precision for symbol locations. The numerical threshold values were less accurate, with 78.3% of estimations having no error, though threshold estimates were not significantly different from true thresholds. Threshold estimation was more accurate than pre-trained deep learning approaches for the current test set. Future work should consider implementing detectors with similar image channels and identify limitations on symbol and axis tick label detection.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"32"},"PeriodicalIF":3.5000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02146-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Hearing loss is a public health concern that affects millions of people globally. Clinically, a person's hearing sensitivity is often measured using pure-tone audiometry and visualized as a pure-tone audiogram, a plot of hearing sensitivity as a function of frequency. Digital test equipment allows clinicians to store audiograms as numerical values, though some practices write audiograms by hand and store them as digital images in electronic health records systems. This leaves the numerical values inaccessible to public-health researchers unless manually interpreted. Therefore, this study developed machine-learning models for estimating numerical threshold values from scanned images of handwritten audiograms. Training data were a novel set of 556 handwritten audiograms from a longitudinal cohort study of age-related hearing loss. The models were sliding-window, single-class object detectors based on Aggregate Channel Features, altogether called Feature-based Audiogram Value Estimator or "FAVE". Model accuracy was determined using symbol location accuracy and comparing estimated numerical threshold values to known values from an electronic database. FAVE resulted in an average of 87.0% recall and 96.2% precision for symbol locations. The numerical threshold values were less accurate, with 78.3% of estimations having no error, though threshold estimates were not significantly different from true thresholds. Threshold estimation was more accurate than pre-trained deep learning approaches for the current test set. Future work should consider implementing detectors with similar image channels and identify limitations on symbol and axis tick label detection.
期刊介绍:
Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.