基于深度尖峰神经网络的钢琴声音鲁棒多音高估计

2019 IEEE Symposium Series on Computational Intelligence (SSCI) Pub Date : 2019-12-01 DOI:10.1109/SSCI44817.2019.9003037

Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang

{"title":"基于深度尖峰神经网络的钢琴声音鲁棒多音高估计","authors":"Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang","doi":"10.1109/SSCI44817.2019.9003037","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.","PeriodicalId":6729,"journal":{"name":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"6 1","pages":"2335-2341"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust Multipitch Estimation of Piano Sounds Using Deep Spiking Neural Networks\",\"authors\":\"Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang\",\"doi\":\"10.1109/SSCI44817.2019.9003037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.\",\"PeriodicalId\":6729,\"journal\":{\"name\":\"2019 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"volume\":\"6 1\",\"pages\":\"2335-2341\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Symposium Series on Computational Intelligence (SSCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSCI44817.2019.9003037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI44817.2019.9003037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文提出了一种基于深度尖峰神经网络的鲁棒多标签分类系统来处理多基音估计任务。我们采用常数q变换谱图作为时频表示。采用关键点检测技术进行噪声抑制和相关信息的提取。我们还提出了一种新的适合音乐信号表达的生物尖峰编码方法。这种编码方法可以将时间、频率、强度信息编码成时空尖峰序列。利用时空信用分配(STCA)算法训练深度尖峰神经网络。我们对MAPS数据集进行了多音高评估，并通过使用f1评分指标将我们的工作与最先进的方法进行了比较。实验结果表明，该方法取得了比现有方法更好的性能，并显示了系统对环境噪声的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Robust Multipitch Estimation of Piano Sounds Using Deep Spiking Neural Networks

In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE Symposium Series on Computational Intelligence (SSCI)

自引率

0.00%

发文量