{"title":"Using Performance of ASR to Compute Optimal Location of Microphone","authors":"K. Nathwani, Bhavya Dixit, Sunil Kumar Kopparapu","doi":"10.1109/SPCOM55316.2022.9840766","DOIUrl":null,"url":null,"abstract":"It has been observed that the measurement error in the microphone position from a fixed source location affected the room impulse response (RIR). This in turn affects the single-channel close microphone and multi-channel distant microphone speech recognition. Toward this end, we systematically study to identify the optimal location of the microphone, given an approximate and hence erroneous location of the microphone in 3D space. The primary idea is to use Monte-Carlo technique to generate a large number of random microphone positions around the erroneous microphone position and select the microphone position that results in the best performance of a general purpose automatic speech recognition (ASR). We experiment with clean and noisy speech and show that the optimal location of the microphone that achieves the best ASR performance is not only affected by noise characteristics but is also dependent on the SNR of the noise.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
It has been observed that the measurement error in the microphone position from a fixed source location affected the room impulse response (RIR). This in turn affects the single-channel close microphone and multi-channel distant microphone speech recognition. Toward this end, we systematically study to identify the optimal location of the microphone, given an approximate and hence erroneous location of the microphone in 3D space. The primary idea is to use Monte-Carlo technique to generate a large number of random microphone positions around the erroneous microphone position and select the microphone position that results in the best performance of a general purpose automatic speech recognition (ASR). We experiment with clean and noisy speech and show that the optimal location of the microphone that achieves the best ASR performance is not only affected by noise characteristics but is also dependent on the SNR of the noise.