用于实时助听器应用的轻量级因果音分离模型

IF 2.2 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey
{"title":"用于实时助听器应用的轻量级因果音分离模型","authors":"Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey","doi":"10.1109/LSENS.2025.3546132","DOIUrl":null,"url":null,"abstract":"Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.","PeriodicalId":13014,"journal":{"name":"IEEE Sensors Letters","volume":"9 4","pages":"1-4"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Lightweight Causal Sound Separation Model for Real-Time Hearing Aid Applications\",\"authors\":\"Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey\",\"doi\":\"10.1109/LSENS.2025.3546132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.\",\"PeriodicalId\":13014,\"journal\":{\"name\":\"IEEE Sensors Letters\",\"volume\":\"9 4\",\"pages\":\"1-4\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10904326/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10904326/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

实时音频处理对于助听器物联网应用至关重要,因为低延迟和效率至关重要。最先进的模型,如Demucs,实现了高信号失真比(SDR),但由于其非因果性和高延迟,不适合实时使用。这封信介绍了为实时助听器应用量身定制的轻量级因果模型,旨在最大限度地减少延迟,同时保持可接受的SDR。该模型使用已建立的协议在MUSDB-18数据集上进行训练和评估。使用性能指标(包括SDR和延迟)将其与Demucs进行比较。结果表明,虽然Demucs实现了更高的SDR,但所提出的模型显著降低了延迟(9.42 ms,而不是52.25 ms),使其适用于实时物联网系统。本研究展示了因果架构在解决助听器实时音频处理挑战方面的潜力,并为未来在不影响延迟的情况下改进SDR奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Lightweight Causal Sound Separation Model for Real-Time Hearing Aid Applications
Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Sensors Letters
IEEE Sensors Letters Engineering-Electrical and Electronic Engineering
CiteScore
3.50
自引率
7.10%
发文量
194
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信