检测DWPT和BLSTM的Onset自由游戏系统

Hisyam Mustofa, A. E. Putra
{"title":"检测DWPT和BLSTM的Onset自由游戏系统","authors":"Hisyam Mustofa, A. E. Putra","doi":"10.22146/ijeis.79534","DOIUrl":null,"url":null,"abstract":"Gamelan consists of various kinds of instruments that have different characteristics. Each has characteristics in terms of the basic frequency, amplitude, signal envelope, and different ways of playing it, resulting in differences in the sustain power of the signal. These characteristics cause the problem of vanishing gradient in the Elman Network model which was used in previous studies in studying the onset detection in the Saron instrument signal which has an average interval of more than 0.6 seconds. This study uses BLSTM (Bidirectional Long Short Term Memory) as a model for training and Wavelet Packet Transformation to design a psychoacoustic critical bandwidth as a model for feature extraction. For the peak picking method, this study uses a fixed threshold method with a value of 0.25. The use of the BLSTM model supported by the Wavelet Packet Transform is expected to overcome the vanishing gradient that exists in a simple RNN architecture. The model was tested based on 3 evaluation parameters, namely precision, recall and F-Measure. Based on the test scenario carried out, the model can overcome the vanishing gradient problem on the Saron instrument which has an average interval between onset of 600 ms. Out of a total of 428 onsets on the Saron instrument, the model successfully detected 426 correctly, with 4 incorrectly detected onsets and 2 undetected onsets. A thorough evaluation for each of the precision, recall, and F1-Measure algorithms obtained 0.975, 0.945 and 0.960.","PeriodicalId":31590,"journal":{"name":"IJEIS Indonesian Journal of Electronics and Instrumentation Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deteksi Onset Gamelan Bebasis DWPT dan BLSTM\",\"authors\":\"Hisyam Mustofa, A. E. Putra\",\"doi\":\"10.22146/ijeis.79534\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gamelan consists of various kinds of instruments that have different characteristics. Each has characteristics in terms of the basic frequency, amplitude, signal envelope, and different ways of playing it, resulting in differences in the sustain power of the signal. These characteristics cause the problem of vanishing gradient in the Elman Network model which was used in previous studies in studying the onset detection in the Saron instrument signal which has an average interval of more than 0.6 seconds. This study uses BLSTM (Bidirectional Long Short Term Memory) as a model for training and Wavelet Packet Transformation to design a psychoacoustic critical bandwidth as a model for feature extraction. For the peak picking method, this study uses a fixed threshold method with a value of 0.25. The use of the BLSTM model supported by the Wavelet Packet Transform is expected to overcome the vanishing gradient that exists in a simple RNN architecture. The model was tested based on 3 evaluation parameters, namely precision, recall and F-Measure. Based on the test scenario carried out, the model can overcome the vanishing gradient problem on the Saron instrument which has an average interval between onset of 600 ms. Out of a total of 428 onsets on the Saron instrument, the model successfully detected 426 correctly, with 4 incorrectly detected onsets and 2 undetected onsets. A thorough evaluation for each of the precision, recall, and F1-Measure algorithms obtained 0.975, 0.945 and 0.960.\",\"PeriodicalId\":31590,\"journal\":{\"name\":\"IJEIS Indonesian Journal of Electronics and Instrumentation Systems\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IJEIS Indonesian Journal of Electronics and Instrumentation Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22146/ijeis.79534\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJEIS Indonesian Journal of Electronics and Instrumentation Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22146/ijeis.79534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

Gamelan由具有不同特征的各种乐器组成。每种信号在基频、振幅、信号包络和不同的播放方式方面都有特点,导致信号的维持功率不同。这些特征导致Elman网络模型中的消失梯度问题,该模型在先前的研究中用于研究平均间隔超过0.6秒的Saron仪器信号中的发作检测。本研究使用BLSTM(双向长短期记忆)作为训练模型,并使用小波包变换设计心理声学临界带宽作为特征提取模型。对于峰值拾取方法,本研究使用值为0.25的固定阈值方法。使用小波包变换支持的BLSTM模型有望克服简单RNN架构中存在的消失梯度。该模型基于精度、召回率和F-Measure三个评价参数进行了测试。基于所执行的测试场景,该模型可以克服Saron仪器上的消失梯度问题,该问题的平均发作间隔为600ms。在Saron仪器总共428次发作中,该模型成功检测到426次正确发作,其中4次检测不正确,2次未检测到。对精度、召回率和F1 Measure算法的每种算法进行彻底评估,分别获得0.975、0.945和0.960。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Deteksi Onset Gamelan Bebasis DWPT dan BLSTM
Gamelan consists of various kinds of instruments that have different characteristics. Each has characteristics in terms of the basic frequency, amplitude, signal envelope, and different ways of playing it, resulting in differences in the sustain power of the signal. These characteristics cause the problem of vanishing gradient in the Elman Network model which was used in previous studies in studying the onset detection in the Saron instrument signal which has an average interval of more than 0.6 seconds. This study uses BLSTM (Bidirectional Long Short Term Memory) as a model for training and Wavelet Packet Transformation to design a psychoacoustic critical bandwidth as a model for feature extraction. For the peak picking method, this study uses a fixed threshold method with a value of 0.25. The use of the BLSTM model supported by the Wavelet Packet Transform is expected to overcome the vanishing gradient that exists in a simple RNN architecture. The model was tested based on 3 evaluation parameters, namely precision, recall and F-Measure. Based on the test scenario carried out, the model can overcome the vanishing gradient problem on the Saron instrument which has an average interval between onset of 600 ms. Out of a total of 428 onsets on the Saron instrument, the model successfully detected 426 correctly, with 4 incorrectly detected onsets and 2 undetected onsets. A thorough evaluation for each of the precision, recall, and F1-Measure algorithms obtained 0.975, 0.945 and 0.960.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信