基于可重构无线声传感器网络的实时文本和语言无关说话人识别

2008 IEEE Conference on Technologies for Homeland Security Pub Date : 2008-05-12 DOI:10.1109/THS.2008.4534491

M. Bocca, H. Koivo

{"title":"基于可重构无线声传感器网络的实时文本和语言无关说话人识别","authors":"M. Bocca, H. Koivo","doi":"10.1109/THS.2008.4534491","DOIUrl":null,"url":null,"abstract":"This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.","PeriodicalId":366416,"journal":{"name":"2008 IEEE Conference on Technologies for Homeland Security","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Real-Time Text and Language Independent Speaker Identification with a Reconfigurable Wireless Network of Acoustic Sensors\",\"authors\":\"M. Bocca, H. Koivo\",\"doi\":\"10.1109/THS.2008.4534491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.\",\"PeriodicalId\":366416,\"journal\":{\"name\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/THS.2008.4534491\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Technologies for Homeland Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/THS.2008.4534491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文描述了一种可重构的声传感器无线网络，该网络可记录建筑物不同区域的语音信号并在汇聚节点上传输。在他们到达时，一个轻量级的文本和语言无关的算法实时执行说话人识别任务。终端用户可以中断网络的正常运行模式，并要求向特定节点发送信号，同时指定采样频率和采样周期的时间长度。在我们的模拟中，我们使用了一个由200个信号、60个个体和15种语言组成的数据库。总执行时间小于2秒。对算法参数进行了优化，准确率达到83%。在降低信号采样频率和时间长度的情况下，对其鲁棒性进行了评价。最后，对运行节点的功耗进行了分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Real-Time Text and Language Independent Speaker Identification with a Reconfigurable Wireless Network of Acoustic Sensors

This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 IEEE Conference on Technologies for Homeland Security

自引率

0.00%

发文量