Feature-Based Audiogram Value Estimator (FAVE): Estimating Numerical Thresholds from Scanned Images of Handwritten Audiograms.

IF 5.7 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Journal of Medical Systems Pub Date : 2025-02-27 DOI:10.1007/s10916-025-02146-7

Paul G Mayo, Kenneth I Vaden, Lois J Matthews, Judy R Dubno

{"title":"Feature-Based Audiogram Value Estimator (FAVE): Estimating Numerical Thresholds from Scanned Images of Handwritten Audiograms.","authors":"Paul G Mayo, Kenneth I Vaden, Lois J Matthews, Judy R Dubno","doi":"10.1007/s10916-025-02146-7","DOIUrl":null,"url":null,"abstract":"<p><p>Hearing loss is a public health concern that affects millions of people globally. Clinically, a person's hearing sensitivity is often measured using pure-tone audiometry and visualized as a pure-tone audiogram, a plot of hearing sensitivity as a function of frequency. Digital test equipment allows clinicians to store audiograms as numerical values, though some practices write audiograms by hand and store them as digital images in electronic health records systems. This leaves the numerical values inaccessible to public-health researchers unless manually interpreted. Therefore, this study developed machine-learning models for estimating numerical threshold values from scanned images of handwritten audiograms. Training data were a novel set of 556 handwritten audiograms from a longitudinal cohort study of age-related hearing loss. The models were sliding-window, single-class object detectors based on Aggregate Channel Features, altogether called Feature-based Audiogram Value Estimator or \"FAVE\". Model accuracy was determined using symbol location accuracy and comparing estimated numerical threshold values to known values from an electronic database. FAVE resulted in an average of 87.0% recall and 96.2% precision for symbol locations. The numerical threshold values were less accurate, with 78.3% of estimations having no error, though threshold estimates were not significantly different from true thresholds. Threshold estimation was more accurate than pre-trained deep learning approaches for the current test set. Future work should consider implementing detectors with similar image channels and identify limitations on symbol and axis tick label detection.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"32"},"PeriodicalIF":5.7000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12034326/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02146-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Hearing loss is a public health concern that affects millions of people globally. Clinically, a person's hearing sensitivity is often measured using pure-tone audiometry and visualized as a pure-tone audiogram, a plot of hearing sensitivity as a function of frequency. Digital test equipment allows clinicians to store audiograms as numerical values, though some practices write audiograms by hand and store them as digital images in electronic health records systems. This leaves the numerical values inaccessible to public-health researchers unless manually interpreted. Therefore, this study developed machine-learning models for estimating numerical threshold values from scanned images of handwritten audiograms. Training data were a novel set of 556 handwritten audiograms from a longitudinal cohort study of age-related hearing loss. The models were sliding-window, single-class object detectors based on Aggregate Channel Features, altogether called Feature-based Audiogram Value Estimator or "FAVE". Model accuracy was determined using symbol location accuracy and comparing estimated numerical threshold values to known values from an electronic database. FAVE resulted in an average of 87.0% recall and 96.2% precision for symbol locations. The numerical threshold values were less accurate, with 78.3% of estimations having no error, though threshold estimates were not significantly different from true thresholds. Threshold estimation was more accurate than pre-trained deep learning approaches for the current test set. Future work should consider implementing detectors with similar image channels and identify limitations on symbol and axis tick label detection.

查看原文本刊更多论文

基于特征的听力图值估计（FAVE）：从手写听力图扫描图像估计数值阈值。

听力损失是影响全球数百万人的公共卫生问题。临床上，一个人的听力灵敏度通常是用纯音测听法测量的，并以纯音听力图可视化，这是一个听力灵敏度作为频率函数的图。数字测试设备允许临床医生将听力图存储为数值，尽管有些做法是手写听力图并将其作为数字图像存储在电子健康记录系统中。这使得公共卫生研究人员除非手工解释，否则无法获得数值。因此，本研究开发了机器学习模型，用于估计手写听力图扫描图像的数值阈值。训练数据是一组新颖的556个手写听音图，来自一项与年龄相关的听力损失的纵向队列研究。这些模型是基于聚合通道特征的滑动窗口单类目标检测器，统称为基于特征的听力图值估计器或“FAVE”。模型精度是通过符号定位精度确定的，并将估计的数值阈值与电子数据库中的已知值进行比较。平均查全率为87.0%，查准率为96.2%。数值阈值的准确性较低，尽管阈值估计值与真实阈值没有显著差异，但78.3%的估计值没有误差。对于当前的测试集，阈值估计比预训练的深度学习方法更准确。未来的工作应该考虑实现具有类似图像通道的检测器，并确定符号和轴刻度标签检测的局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Systems 医学-卫生保健

CiteScore

11.60

自引率

1.90%

发文量

审稿时长

4.8 months

期刊介绍： Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.