{"title":"Vib2audio:基于毫米波雷达的微振动传感的稳健声音恢复","authors":"Yingxiao Wu;Changlin Mao;Zhihua Jian;Jianping Han","doi":"10.1109/JSEN.2025.3578648","DOIUrl":null,"url":null,"abstract":"The popularity of smartphones presents a potential risk to personal privacy, particularly through covert eavesdropping by external nonacoustic sensors (e.g., high-speed camera and radio frequency (RF) sensor), where RF-based voice perception technology shows great potential due to its insensitivity to the environment and its ability to traverse occlusions. In this article, we introduce Vib2audio, a high-quality audio recovery system that uses RF signals to recover audio from the tiny vibration of sound source. We discover that the extracted phase spectrum lacks components in the high-frequency band. Therefore, we investigate the frequency detectable range of mmWave radar used in our scenario and develop an improved spectrum reconstruction network based on the pix2pix structure to accurately recover the time-frequency spectrum of audio from the limited frequency band of the vibration signal. In addition, we synthesize the audio waveform of voice from the time-frequency spectrum using a Griffin-Lim vocoder. Our extensive experiments indicate that Vib2audio can robustly recover tiny sound source vibration to recognizable high-quality human voice audio from tiny vibration. In our overall objective and subjective performance evaluation, the reconstructed audio by Vib2audio had a mean absolute error (MAE) of 3.6% in the non-intrusive speech quality assessment (NISQA) and achieved a subjective identifiability assessment score of 3.48. Furthermore, the quantitative evaluation results show that Vib2audio is effective and robust to background noise and multisource interference.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 15","pages":"28845-28860"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vib2audio: Robust Sound Recovery With Microvibration Sensing via mmWave Radar\",\"authors\":\"Yingxiao Wu;Changlin Mao;Zhihua Jian;Jianping Han\",\"doi\":\"10.1109/JSEN.2025.3578648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The popularity of smartphones presents a potential risk to personal privacy, particularly through covert eavesdropping by external nonacoustic sensors (e.g., high-speed camera and radio frequency (RF) sensor), where RF-based voice perception technology shows great potential due to its insensitivity to the environment and its ability to traverse occlusions. In this article, we introduce Vib2audio, a high-quality audio recovery system that uses RF signals to recover audio from the tiny vibration of sound source. We discover that the extracted phase spectrum lacks components in the high-frequency band. Therefore, we investigate the frequency detectable range of mmWave radar used in our scenario and develop an improved spectrum reconstruction network based on the pix2pix structure to accurately recover the time-frequency spectrum of audio from the limited frequency band of the vibration signal. In addition, we synthesize the audio waveform of voice from the time-frequency spectrum using a Griffin-Lim vocoder. Our extensive experiments indicate that Vib2audio can robustly recover tiny sound source vibration to recognizable high-quality human voice audio from tiny vibration. In our overall objective and subjective performance evaluation, the reconstructed audio by Vib2audio had a mean absolute error (MAE) of 3.6% in the non-intrusive speech quality assessment (NISQA) and achieved a subjective identifiability assessment score of 3.48. Furthermore, the quantitative evaluation results show that Vib2audio is effective and robust to background noise and multisource interference.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 15\",\"pages\":\"28845-28860\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11039160/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11039160/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Vib2audio: Robust Sound Recovery With Microvibration Sensing via mmWave Radar
The popularity of smartphones presents a potential risk to personal privacy, particularly through covert eavesdropping by external nonacoustic sensors (e.g., high-speed camera and radio frequency (RF) sensor), where RF-based voice perception technology shows great potential due to its insensitivity to the environment and its ability to traverse occlusions. In this article, we introduce Vib2audio, a high-quality audio recovery system that uses RF signals to recover audio from the tiny vibration of sound source. We discover that the extracted phase spectrum lacks components in the high-frequency band. Therefore, we investigate the frequency detectable range of mmWave radar used in our scenario and develop an improved spectrum reconstruction network based on the pix2pix structure to accurately recover the time-frequency spectrum of audio from the limited frequency band of the vibration signal. In addition, we synthesize the audio waveform of voice from the time-frequency spectrum using a Griffin-Lim vocoder. Our extensive experiments indicate that Vib2audio can robustly recover tiny sound source vibration to recognizable high-quality human voice audio from tiny vibration. In our overall objective and subjective performance evaluation, the reconstructed audio by Vib2audio had a mean absolute error (MAE) of 3.6% in the non-intrusive speech quality assessment (NISQA) and achieved a subjective identifiability assessment score of 3.48. Furthermore, the quantitative evaluation results show that Vib2audio is effective and robust to background noise and multisource interference.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice