{"title":"基于时延估计和极谱直方图的立体声传声器三维声源方向盲研究","authors":"N. Tengtrairat, W. L. Woo","doi":"10.1109/INCIT.2017.8257881","DOIUrl":null,"url":null,"abstract":"In this research, a new 3 dimension sound source direction algorithm was developed for computing the azimuth and elevation direction of the source under a limited space between two microphones in noise free environment. The proposed azimuth-estimation method is based on the time — delay estimation using maximum likelihood estimation by forming a histogram of power weighted spectrum corresponding to attenuation and time-delay index. For estimating elevation angle, the proposed method requires a sole recorded signal which selected the higher sound pressure level between the two recorded signals. The elevation-angle estimation is computed by phase delay of the windowed autocorrelated signal. The windowed autocorrelation process of the selected signal benefits for reducing the influence of the artifacts. A polar-pattern histogram is employed for identify the source position associated to the microphone i.e. above or below. The proposed method has been demonstrated by real-audio signals. Given by a narrow space, the proposed azimuth-estimation method renders a significant better directional performance improvement over the benchmarked method.","PeriodicalId":405827,"journal":{"name":"2017 2nd International Conference on Information Technology (INCIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Blind 3D sound source direction using stereo microphones based on time-delay estimation and polar-pattern histogram\",\"authors\":\"N. Tengtrairat, W. L. Woo\",\"doi\":\"10.1109/INCIT.2017.8257881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this research, a new 3 dimension sound source direction algorithm was developed for computing the azimuth and elevation direction of the source under a limited space between two microphones in noise free environment. The proposed azimuth-estimation method is based on the time — delay estimation using maximum likelihood estimation by forming a histogram of power weighted spectrum corresponding to attenuation and time-delay index. For estimating elevation angle, the proposed method requires a sole recorded signal which selected the higher sound pressure level between the two recorded signals. The elevation-angle estimation is computed by phase delay of the windowed autocorrelated signal. The windowed autocorrelation process of the selected signal benefits for reducing the influence of the artifacts. A polar-pattern histogram is employed for identify the source position associated to the microphone i.e. above or below. The proposed method has been demonstrated by real-audio signals. Given by a narrow space, the proposed azimuth-estimation method renders a significant better directional performance improvement over the benchmarked method.\",\"PeriodicalId\":405827,\"journal\":{\"name\":\"2017 2nd International Conference on Information Technology (INCIT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd International Conference on Information Technology (INCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCIT.2017.8257881\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Information Technology (INCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCIT.2017.8257881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Blind 3D sound source direction using stereo microphones based on time-delay estimation and polar-pattern histogram
In this research, a new 3 dimension sound source direction algorithm was developed for computing the azimuth and elevation direction of the source under a limited space between two microphones in noise free environment. The proposed azimuth-estimation method is based on the time — delay estimation using maximum likelihood estimation by forming a histogram of power weighted spectrum corresponding to attenuation and time-delay index. For estimating elevation angle, the proposed method requires a sole recorded signal which selected the higher sound pressure level between the two recorded signals. The elevation-angle estimation is computed by phase delay of the windowed autocorrelated signal. The windowed autocorrelation process of the selected signal benefits for reducing the influence of the artifacts. A polar-pattern histogram is employed for identify the source position associated to the microphone i.e. above or below. The proposed method has been demonstrated by real-audio signals. Given by a narrow space, the proposed azimuth-estimation method renders a significant better directional performance improvement over the benchmarked method.