mmFusion: mmWave-Assisted Mono Speech Enhancement for Multisource Aliasing and Addressing

IF 4.3 2区 综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Xiangyi Tang;Yuyong Xiong;Haibin Meng;Wendi Tian;Qingbo He;Zhike Peng
{"title":"mmFusion: mmWave-Assisted Mono Speech Enhancement for Multisource Aliasing and Addressing","authors":"Xiangyi Tang;Yuyong Xiong;Haibin Meng;Wendi Tian;Qingbo He;Zhike Peng","doi":"10.1109/JSEN.2025.3588468","DOIUrl":null,"url":null,"abstract":"Separately extracting multiple overlapping target speech signals is a challenge. Microphone arrays can perform areal sound capture but require a large space to achieve good resolution. The emerging millimeter-wave (mmWave) vibration-sensing technique can reconstruct speech by measuring the vibrations of target sound sources. However, because of the weak vibrations of the high-frequency speech component, the key semantic content is likely to be lost. We aim to design a compact speech-sensing method termed as mmFusion to achieve antialiasing high-quality speech perception, which integrates an mmWave radar and a mono microphone. In mmFusion, the antialiasing property of mmWave radio and the high fidelity of the microphone are retained. Also, mmFusion does not rely on any voiceprint information of the speaker as a prior. By generating a fused signal using the microphone signal and the vibration signal of the target, and then enhancing it through a diffusion model, mmFusion can achieve selective high-quality perception of multiple speech sources. The experimental results demonstrate that mmFusion performs well across ten different metrics. In scenarios with overlapping sources and strong noise, mmFusion can independently locate and enhance speech signals from different sources, thereby achieving high-quality speech perception.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 18","pages":"35223-35236"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11107271/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Separately extracting multiple overlapping target speech signals is a challenge. Microphone arrays can perform areal sound capture but require a large space to achieve good resolution. The emerging millimeter-wave (mmWave) vibration-sensing technique can reconstruct speech by measuring the vibrations of target sound sources. However, because of the weak vibrations of the high-frequency speech component, the key semantic content is likely to be lost. We aim to design a compact speech-sensing method termed as mmFusion to achieve antialiasing high-quality speech perception, which integrates an mmWave radar and a mono microphone. In mmFusion, the antialiasing property of mmWave radio and the high fidelity of the microphone are retained. Also, mmFusion does not rely on any voiceprint information of the speaker as a prior. By generating a fused signal using the microphone signal and the vibration signal of the target, and then enhancing it through a diffusion model, mmFusion can achieve selective high-quality perception of multiple speech sources. The experimental results demonstrate that mmFusion performs well across ten different metrics. In scenarios with overlapping sources and strong noise, mmFusion can independently locate and enhance speech signals from different sources, thereby achieving high-quality speech perception.
mmFusion:用于多源混叠和寻址的毫米波辅助单声道语音增强
单独提取多个重叠的目标语音信号是一个难题。麦克风阵列可以进行区域声音捕获,但需要较大的空间才能获得良好的分辨率。新兴的毫米波振动传感技术可以通过测量目标声源的振动来重建语音。然而,由于高频语音分量的微弱振动,关键的语义内容很可能丢失。我们的目标是设计一种紧凑的语音传感方法,称为mmFusion,以实现抗混叠的高质量语音感知,该方法集成了毫米波雷达和单声道麦克风。mmFusion既保留了毫米波无线电的抗混叠特性,又保留了麦克风的高保真度。此外,mmFusion不依赖于说话者的任何声纹信息作为先验。利用传声器信号和目标振动信号产生融合信号,然后通过扩散模型对其进行增强,mmFusion可以实现对多个语音源的选择性高质量感知。实验结果表明,mmFusion在10个不同的指标上都表现良好。在声源重叠、噪声强的场景下,mmFusion能够独立定位和增强不同声源的语音信号,从而实现高质量的语音感知。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Sensors Journal
IEEE Sensors Journal 工程技术-工程:电子与电气
CiteScore
7.70
自引率
14.00%
发文量
2058
审稿时长
5.2 months
期刊介绍: The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信