{"title":"DeWinder:利用超声波传感降低单通道风噪","authors":"Kuang Yuan, Shuo Han, Swarun Kumar, Bhiksha Raj","doi":"arxiv-2409.06137","DOIUrl":null,"url":null,"abstract":"The quality of audio recordings in outdoor environments is often degraded by\nthe presence of wind. Mitigating the impact of wind noise on the perceptual\nquality of single-channel speech remains a significant challenge due to its\nnon-stationary characteristics. Prior work in noise suppression treats wind\nnoise as a general background noise without explicit modeling of its\ncharacteristics. In this paper, we leverage ultrasound as an auxiliary modality\nto explicitly sense the airflow and characterize the wind noise. We propose a\nmulti-modal deep-learning framework to fuse the ultrasonic Doppler features and\nspeech signals for wind noise reduction. Our results show that DeWinder can\nsignificantly improve the noise reduction capabilities of state-of-the-art\nspeech enhancement models.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing\",\"authors\":\"Kuang Yuan, Shuo Han, Swarun Kumar, Bhiksha Raj\",\"doi\":\"arxiv-2409.06137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The quality of audio recordings in outdoor environments is often degraded by\\nthe presence of wind. Mitigating the impact of wind noise on the perceptual\\nquality of single-channel speech remains a significant challenge due to its\\nnon-stationary characteristics. Prior work in noise suppression treats wind\\nnoise as a general background noise without explicit modeling of its\\ncharacteristics. In this paper, we leverage ultrasound as an auxiliary modality\\nto explicitly sense the airflow and characterize the wind noise. We propose a\\nmulti-modal deep-learning framework to fuse the ultrasonic Doppler features and\\nspeech signals for wind noise reduction. Our results show that DeWinder can\\nsignificantly improve the noise reduction capabilities of state-of-the-art\\nspeech enhancement models.\",\"PeriodicalId\":501034,\"journal\":{\"name\":\"arXiv - EE - Signal Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
The quality of audio recordings in outdoor environments is often degraded by
the presence of wind. Mitigating the impact of wind noise on the perceptual
quality of single-channel speech remains a significant challenge due to its
non-stationary characteristics. Prior work in noise suppression treats wind
noise as a general background noise without explicit modeling of its
characteristics. In this paper, we leverage ultrasound as an auxiliary modality
to explicitly sense the airflow and characterize the wind noise. We propose a
multi-modal deep-learning framework to fuse the ultrasonic Doppler features and
speech signals for wind noise reduction. Our results show that DeWinder can
significantly improve the noise reduction capabilities of state-of-the-art
speech enhancement models.