{"title":"基于TESPAR s -矩阵和人工神经网络的野外入侵检测音频信号量化效应","authors":"L. Grama, C. Rusu, G. Oltean, L. Ivanciu","doi":"10.1109/SPED.2015.7343079","DOIUrl":null,"url":null,"abstract":"This paper analyses the influence of quantization of audio signals on the Time Encoding Signal Processing and Recognition S-matrix, in order to detect and classify intruders in wildlife areas. The intruder classification is performed with multilayer feed-forward neural networks. The databases involved in this work consist of 640 waveforms of audio signals originated from 4 different types of sources. The experimental results proves that in the proposed audio based wildlife intruder detection framework, the overall correct classification rates remain very high even if the number of bits used for quantization decreases from 16 to 4.","PeriodicalId":426074,"journal":{"name":"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Quantization effects on audio signals for detecting intruders in wild areas using TESPAR S-matrix and artificial neural networks\",\"authors\":\"L. Grama, C. Rusu, G. Oltean, L. Ivanciu\",\"doi\":\"10.1109/SPED.2015.7343079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper analyses the influence of quantization of audio signals on the Time Encoding Signal Processing and Recognition S-matrix, in order to detect and classify intruders in wildlife areas. The intruder classification is performed with multilayer feed-forward neural networks. The databases involved in this work consist of 640 waveforms of audio signals originated from 4 different types of sources. The experimental results proves that in the proposed audio based wildlife intruder detection framework, the overall correct classification rates remain very high even if the number of bits used for quantization decreases from 16 to 4.\",\"PeriodicalId\":426074,\"journal\":{\"name\":\"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPED.2015.7343079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPED.2015.7343079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Quantization effects on audio signals for detecting intruders in wild areas using TESPAR S-matrix and artificial neural networks
This paper analyses the influence of quantization of audio signals on the Time Encoding Signal Processing and Recognition S-matrix, in order to detect and classify intruders in wildlife areas. The intruder classification is performed with multilayer feed-forward neural networks. The databases involved in this work consist of 640 waveforms of audio signals originated from 4 different types of sources. The experimental results proves that in the proposed audio based wildlife intruder detection framework, the overall correct classification rates remain very high even if the number of bits used for quantization decreases from 16 to 4.