{"title":"基于听觉雷达和网络摄像头的语音恢复","authors":"Yue Ma, Hong Hong, Heng Zhao, Hui Li, Chen Gu, Xiaohua Zhu","doi":"10.1109/IMBIOC.2019.8777840","DOIUrl":null,"url":null,"abstract":"This paper presents a speech recovery technology based on a 24-GHz portable auditory radar and webcam for noncontact robust speech recognition, recovery and surveillance. The time-varying vocal vibration signal obtained by the continuous-wave auditory radar is used as the sound source excitation while the fitted formant frequency extracted by webcam is used as the vocal tract resonance characteristics to synthesize and recover speech. Experiments of reading single English character are carried out. Compared with microphone-recorded results, the speech recovery technology can accurately extract the formant frequency and recover speech effectively. Subject evaluation results show a high relatively consistency between the synthesized speech and original acoustic speech.","PeriodicalId":171472,"journal":{"name":"2019 IEEE MTT-S International Microwave Biomedical Conference (IMBioC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speech Recovery Based On Auditory Radar and Webcam\",\"authors\":\"Yue Ma, Hong Hong, Heng Zhao, Hui Li, Chen Gu, Xiaohua Zhu\",\"doi\":\"10.1109/IMBIOC.2019.8777840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a speech recovery technology based on a 24-GHz portable auditory radar and webcam for noncontact robust speech recognition, recovery and surveillance. The time-varying vocal vibration signal obtained by the continuous-wave auditory radar is used as the sound source excitation while the fitted formant frequency extracted by webcam is used as the vocal tract resonance characteristics to synthesize and recover speech. Experiments of reading single English character are carried out. Compared with microphone-recorded results, the speech recovery technology can accurately extract the formant frequency and recover speech effectively. Subject evaluation results show a high relatively consistency between the synthesized speech and original acoustic speech.\",\"PeriodicalId\":171472,\"journal\":{\"name\":\"2019 IEEE MTT-S International Microwave Biomedical Conference (IMBioC)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE MTT-S International Microwave Biomedical Conference (IMBioC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMBIOC.2019.8777840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE MTT-S International Microwave Biomedical Conference (IMBioC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMBIOC.2019.8777840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Recovery Based On Auditory Radar and Webcam
This paper presents a speech recovery technology based on a 24-GHz portable auditory radar and webcam for noncontact robust speech recognition, recovery and surveillance. The time-varying vocal vibration signal obtained by the continuous-wave auditory radar is used as the sound source excitation while the fitted formant frequency extracted by webcam is used as the vocal tract resonance characteristics to synthesize and recover speech. Experiments of reading single English character are carried out. Compared with microphone-recorded results, the speech recovery technology can accurately extract the formant frequency and recover speech effectively. Subject evaluation results show a high relatively consistency between the synthesized speech and original acoustic speech.