改善社交互动机器人信噪比的仿生听觉定位

2006 IEEE/RSJ International Conference on Intelligent Robots and Systems Pub Date : 2006-10-09 DOI:10.1109/IROS.2006.281855

J. Murray, S. Wermter, H. Erwin

{"title":"改善社交互动机器人信噪比的仿生听觉定位","authors":"J. Murray, S. Wermter, H. Erwin","doi":"10.1109/IROS.2006.281855","DOIUrl":null,"url":null,"abstract":"In this paper we describe a bioinspired hybrid architecture for acoustic sound source localisation and tracking to increase the signal to noise ratio (SNR) between speaker and background sources for a socially interactive robot's speech recogniser system. The model presented incorporates the use of interaural time difference for azimuth estimation and recurrent neural networks for trajectory prediction. The results are then presented showing the difference in the SNR of a localised and non-localised speaker source, in addition to presenting the recognition rates between a localised and non-localised speaker source. From the results presented in this paper it can be seen that by orientating towards the sound source of interest the recognition rates of that source can be increased","PeriodicalId":237562,"journal":{"name":"2006 IEEE/RSJ International Conference on Intelligent Robots and Systems","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Bioinspired Auditory Sound Localisation for Improving the Signal to Noise Ratio of Socially Interactive Robots\",\"authors\":\"J. Murray, S. Wermter, H. Erwin\",\"doi\":\"10.1109/IROS.2006.281855\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we describe a bioinspired hybrid architecture for acoustic sound source localisation and tracking to increase the signal to noise ratio (SNR) between speaker and background sources for a socially interactive robot's speech recogniser system. The model presented incorporates the use of interaural time difference for azimuth estimation and recurrent neural networks for trajectory prediction. The results are then presented showing the difference in the SNR of a localised and non-localised speaker source, in addition to presenting the recognition rates between a localised and non-localised speaker source. From the results presented in this paper it can be seen that by orientating towards the sound source of interest the recognition rates of that source can be increased\",\"PeriodicalId\":237562,\"journal\":{\"name\":\"2006 IEEE/RSJ International Conference on Intelligent Robots and Systems\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE/RSJ International Conference on Intelligent Robots and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IROS.2006.281855\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE/RSJ International Conference on Intelligent Robots and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.2006.281855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

在本文中，我们描述了一种生物启发的混合架构，用于声学声源定位和跟踪，以增加社交互动机器人语音识别系统中扬声器和背景源之间的信噪比(SNR)。该模型结合了利用间时差进行方位角估计和利用递归神经网络进行弹道预测。结果显示了局部化和非局部化说话源的信噪比差异，以及局部化和非局部化说话源之间的识别率。从本文给出的结果可以看出，通过对感兴趣的声源进行定向，可以提高该声源的识别率

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Bioinspired Auditory Sound Localisation for Improving the Signal to Noise Ratio of Socially Interactive Robots

In this paper we describe a bioinspired hybrid architecture for acoustic sound source localisation and tracking to increase the signal to noise ratio (SNR) between speaker and background sources for a socially interactive robot's speech recogniser system. The model presented incorporates the use of interaural time difference for azimuth estimation and recurrent neural networks for trajectory prediction. The results are then presented showing the difference in the SNR of a localised and non-localised speaker source, in addition to presenting the recognition rates between a localised and non-localised speaker source. From the results presented in this paper it can be seen that by orientating towards the sound source of interest the recognition rates of that source can be increased

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2006 IEEE/RSJ International Conference on Intelligent Robots and Systems

自引率

0.00%

发文量