{"title":"mmFusion: mmWave-Assisted Mono Speech Enhancement for Multisource Aliasing and Addressing","authors":"Xiangyi Tang;Yuyong Xiong;Haibin Meng;Wendi Tian;Qingbo He;Zhike Peng","doi":"10.1109/JSEN.2025.3588468","DOIUrl":null,"url":null,"abstract":"Separately extracting multiple overlapping target speech signals is a challenge. Microphone arrays can perform areal sound capture but require a large space to achieve good resolution. The emerging millimeter-wave (mmWave) vibration-sensing technique can reconstruct speech by measuring the vibrations of target sound sources. However, because of the weak vibrations of the high-frequency speech component, the key semantic content is likely to be lost. We aim to design a compact speech-sensing method termed as mmFusion to achieve antialiasing high-quality speech perception, which integrates an mmWave radar and a mono microphone. In mmFusion, the antialiasing property of mmWave radio and the high fidelity of the microphone are retained. Also, mmFusion does not rely on any voiceprint information of the speaker as a prior. By generating a fused signal using the microphone signal and the vibration signal of the target, and then enhancing it through a diffusion model, mmFusion can achieve selective high-quality perception of multiple speech sources. The experimental results demonstrate that mmFusion performs well across ten different metrics. In scenarios with overlapping sources and strong noise, mmFusion can independently locate and enhance speech signals from different sources, thereby achieving high-quality speech perception.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 18","pages":"35223-35236"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11107271/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Separately extracting multiple overlapping target speech signals is a challenge. Microphone arrays can perform areal sound capture but require a large space to achieve good resolution. The emerging millimeter-wave (mmWave) vibration-sensing technique can reconstruct speech by measuring the vibrations of target sound sources. However, because of the weak vibrations of the high-frequency speech component, the key semantic content is likely to be lost. We aim to design a compact speech-sensing method termed as mmFusion to achieve antialiasing high-quality speech perception, which integrates an mmWave radar and a mono microphone. In mmFusion, the antialiasing property of mmWave radio and the high fidelity of the microphone are retained. Also, mmFusion does not rely on any voiceprint information of the speaker as a prior. By generating a fused signal using the microphone signal and the vibration signal of the target, and then enhancing it through a diffusion model, mmFusion can achieve selective high-quality perception of multiple speech sources. The experimental results demonstrate that mmFusion performs well across ten different metrics. In scenarios with overlapping sources and strong noise, mmFusion can independently locate and enhance speech signals from different sources, thereby achieving high-quality speech perception.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice