{"title":"基于可重构无线声传感器网络的实时文本和语言无关说话人识别","authors":"M. Bocca, H. Koivo","doi":"10.1109/THS.2008.4534491","DOIUrl":null,"url":null,"abstract":"This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.","PeriodicalId":366416,"journal":{"name":"2008 IEEE Conference on Technologies for Homeland Security","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Real-Time Text and Language Independent Speaker Identification with a Reconfigurable Wireless Network of Acoustic Sensors\",\"authors\":\"M. Bocca, H. Koivo\",\"doi\":\"10.1109/THS.2008.4534491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.\",\"PeriodicalId\":366416,\"journal\":{\"name\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/THS.2008.4534491\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Technologies for Homeland Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/THS.2008.4534491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-Time Text and Language Independent Speaker Identification with a Reconfigurable Wireless Network of Acoustic Sensors
This paper describes a reconfigurable wireless network of acoustic sensors that records voice signals in different areas of a building and conveys them at the sink node. At their arrival, a light-weight text and language independent algorithm performs the speaker identification task in real time. The end-user can interrupt the normal operation mode of the network and require a signal to a particular node, specifying also sampling frequency and time length of the sampling period. In our simulations, we use a database composed of 200 signals, 60 individuals, and 15 languages. The total execution time is less than 2 seconds. We optimize the parameters of the algorithm, achieving 83% accuracy. We also evaluate its robustness when the sampling frequency and the time length of the signals are reduced. Finally, the power consumption of the operating nodes is analyzed.