{"title":"Sound Source Localization and Speech Enhancement Algorithm Based on Fixed Beamforming","authors":"Fuchun Liu, Yang Yang, Q. Lin","doi":"10.1145/3351917.3351932","DOIUrl":null,"url":null,"abstract":"In voice-based human-computer interaction, various environmental noise may interfere with the normal transmission of information. Noises may typically lead to performance degradation and efficiency reduction of the voice interaction system. Aiming for reliable sound source localization and effective speech enhancement, we researched those algorithms in scenario of voice-based human-computer interaction with indoor intelligent robots, carried out analysis and implementation. In this paper, we analyzed the localization algorithms based on steered-response power, improved, implemented, and tested on it for its stability in practical application. The results indicated the good performance of the algorithm we proposed under the condition that no strong directional noise interferes with the target signal. Reliable two-dimensional sound localization information can be provided, ensuring real-time performance of the system. We also implemented the speech enhancement algorithm based on fixed beamforming, using the sound localization information provided by the two-dimensional sound localization algorithm. The results showed that the algorithm has a certain inhibitory effect on non-correlative noise on the premise of reliable sound source localization and undistorted signal of speech.","PeriodicalId":367885,"journal":{"name":"Proceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3351917.3351932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In voice-based human-computer interaction, various environmental noise may interfere with the normal transmission of information. Noises may typically lead to performance degradation and efficiency reduction of the voice interaction system. Aiming for reliable sound source localization and effective speech enhancement, we researched those algorithms in scenario of voice-based human-computer interaction with indoor intelligent robots, carried out analysis and implementation. In this paper, we analyzed the localization algorithms based on steered-response power, improved, implemented, and tested on it for its stability in practical application. The results indicated the good performance of the algorithm we proposed under the condition that no strong directional noise interferes with the target signal. Reliable two-dimensional sound localization information can be provided, ensuring real-time performance of the system. We also implemented the speech enhancement algorithm based on fixed beamforming, using the sound localization information provided by the two-dimensional sound localization algorithm. The results showed that the algorithm has a certain inhibitory effect on non-correlative noise on the premise of reliable sound source localization and undistorted signal of speech.