Andi Rokhman Hermawan, E. M. Yuniarno, D. Wulandari
{"title":"Gamelan Demung Music Transcription Based on STFT Using Deep Learning","authors":"Andi Rokhman Hermawan, E. M. Yuniarno, D. Wulandari","doi":"10.12962/jaree.v6i2.276","DOIUrl":null,"url":null,"abstract":"Learning to play a gamelan instrument would be easier when there’s a musical notation guide. The process of converting a musical signal into a notation guide is called transcription. In this paper, we would like to transcript the gamelan music especially the Demung instrument using the Deep Learning method. Each Demung’s note from 6-low until 1-high would be converted to the time-frequency domain using STFT (Short-Time Fourier Transform). Then, those data will be treated as an input for the multilayers perceptron. The training method is a single label of each notation. The output returned by the model is a music roll transcription.","PeriodicalId":32708,"journal":{"name":"JAREE Journal on Advanced Research in Electrical Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAREE Journal on Advanced Research in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12962/jaree.v6i2.276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Learning to play a gamelan instrument would be easier when there’s a musical notation guide. The process of converting a musical signal into a notation guide is called transcription. In this paper, we would like to transcript the gamelan music especially the Demung instrument using the Deep Learning method. Each Demung’s note from 6-low until 1-high would be converted to the time-frequency domain using STFT (Short-Time Fourier Transform). Then, those data will be treated as an input for the multilayers perceptron. The training method is a single label of each notation. The output returned by the model is a music roll transcription.