用于实时助听器应用的轻量级因果音分离模型

IF 2.2 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Letters Pub Date : 2025-02-26 DOI:10.1109/LSENS.2025.3546132

Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey

{"title":"用于实时助听器应用的轻量级因果音分离模型","authors":"Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey","doi":"10.1109/LSENS.2025.3546132","DOIUrl":null,"url":null,"abstract":"Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.","PeriodicalId":13014,"journal":{"name":"IEEE Sensors Letters","volume":"9 4","pages":"1-4"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Lightweight Causal Sound Separation Model for Real-Time Hearing Aid Applications\",\"authors\":\"Harsh Mishra;Mahendra K. Shukla;Priyanshu;Som Dengre;Yashveer Singh;Om Jee Pandey\",\"doi\":\"10.1109/LSENS.2025.3546132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.\",\"PeriodicalId\":13014,\"journal\":{\"name\":\"IEEE Sensors Letters\",\"volume\":\"9 4\",\"pages\":\"1-4\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10904326/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10904326/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

实时音频处理对于助听器物联网应用至关重要，因为低延迟和效率至关重要。最先进的模型，如Demucs，实现了高信号失真比（SDR），但由于其非因果性和高延迟，不适合实时使用。这封信介绍了为实时助听器应用量身定制的轻量级因果模型，旨在最大限度地减少延迟，同时保持可接受的SDR。该模型使用已建立的协议在MUSDB-18数据集上进行训练和评估。使用性能指标（包括SDR和延迟）将其与Demucs进行比较。结果表明，虽然Demucs实现了更高的SDR，但所提出的模型显著降低了延迟（9.42 ms，而不是52.25 ms），使其适用于实时物联网系统。本研究展示了因果架构在解决助听器实时音频处理挑战方面的潜力，并为未来在不影响延迟的情况下改进SDR奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Lightweight Causal Sound Separation Model for Real-Time Hearing Aid Applications

Real-time audio processing is crucial for hearing aid IoT applications, where low latency and efficiency are paramount. State-of-the-art models like Demucs achieve high signal-to-distortion ratio (SDR) but are unsuitable for real-time use due to their noncausal nature and high latency. This letter introduces a lightweight causal model tailored for real-time hearing aid applications, designed to minimize latency while maintaining acceptable SDR. The model was trained and evaluated on the MUSDB-18 dataset using established protocols. Performance metrics, including SDR and latency, were used to compare it against Demucs. Results show that while Demucs achieves higher SDR, the proposed model significantly reduces latency (9.42 ms compared to 52.25 ms), making it suitable for real-time IoT systems. This research demonstrates the potential of causal architectures in addressing the challenges of real-time audio processing for hearing aids and sets the stage for future improvements in SDR without compromising latency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Letters Engineering-Electrical and Electronic Engineering

CiteScore

3.50

自引率

7.10%

发文量

194