{"title":"一种新的基于DOA估计的区域搜索方法用于SRP-PHAT语音源定位","authors":"A. D. Firoozabadi, H. R. Abutalebi","doi":"10.5281/ZENODO.41983","DOIUrl":null,"url":null,"abstract":"Steered Response Power-PHAse Transform (SRP-PHAT) method has been already proposed and investigated for the sound source localization. Grid search methods can be used to find global maximum of SRP, but they are so computationally expensive that can not be used in real-time applications. In this paper, we have proposed a SRP-based localization method which works in cascade with a DOA estimation module; i.e. first the direction of speaker is recognized by one of the DOA estimation methods; after that, we bound the search region to a space fragment around estimated direction of speaker; then we use SRP-PHAT algorithm computations and volume contraction methods (such as SRC and CFRC) on this fragmentized regions and decrease computational costs to a large extent. By use of the data collected from different (speaker) scenarios, we demonstrate the accuracy and speed gained by proposed method.","PeriodicalId":409817,"journal":{"name":"2010 18th European Signal Processing Conference","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A new region search method based on DOA estimation for speech source localization by SRP-PHAT method\",\"authors\":\"A. D. Firoozabadi, H. R. Abutalebi\",\"doi\":\"10.5281/ZENODO.41983\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Steered Response Power-PHAse Transform (SRP-PHAT) method has been already proposed and investigated for the sound source localization. Grid search methods can be used to find global maximum of SRP, but they are so computationally expensive that can not be used in real-time applications. In this paper, we have proposed a SRP-based localization method which works in cascade with a DOA estimation module; i.e. first the direction of speaker is recognized by one of the DOA estimation methods; after that, we bound the search region to a space fragment around estimated direction of speaker; then we use SRP-PHAT algorithm computations and volume contraction methods (such as SRC and CFRC) on this fragmentized regions and decrease computational costs to a large extent. By use of the data collected from different (speaker) scenarios, we demonstrate the accuracy and speed gained by proposed method.\",\"PeriodicalId\":409817,\"journal\":{\"name\":\"2010 18th European Signal Processing Conference\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 18th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.41983\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 18th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.41983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new region search method based on DOA estimation for speech source localization by SRP-PHAT method
Steered Response Power-PHAse Transform (SRP-PHAT) method has been already proposed and investigated for the sound source localization. Grid search methods can be used to find global maximum of SRP, but they are so computationally expensive that can not be used in real-time applications. In this paper, we have proposed a SRP-based localization method which works in cascade with a DOA estimation module; i.e. first the direction of speaker is recognized by one of the DOA estimation methods; after that, we bound the search region to a space fragment around estimated direction of speaker; then we use SRP-PHAT algorithm computations and volume contraction methods (such as SRC and CFRC) on this fragmentized regions and decrease computational costs to a large extent. By use of the data collected from different (speaker) scenarios, we demonstrate the accuracy and speed gained by proposed method.