{"title":"基于深度尖峰神经网络的钢琴声音鲁棒多音高估计","authors":"Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang","doi":"10.1109/SSCI44817.2019.9003037","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.","PeriodicalId":6729,"journal":{"name":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"6 1","pages":"2335-2341"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust Multipitch Estimation of Piano Sounds Using Deep Spiking Neural Networks\",\"authors\":\"Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang\",\"doi\":\"10.1109/SSCI44817.2019.9003037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.\",\"PeriodicalId\":6729,\"journal\":{\"name\":\"2019 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"volume\":\"6 1\",\"pages\":\"2335-2341\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSCI44817.2019.9003037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI44817.2019.9003037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robust Multipitch Estimation of Piano Sounds Using Deep Spiking Neural Networks
In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.