用软木槌演奏的泰国木琴的自动音乐转录

Apichai Huaysrijan, S. Pongpinigpinyo
{"title":"用软木槌演奏的泰国木琴的自动音乐转录","authors":"Apichai Huaysrijan, S. Pongpinigpinyo","doi":"10.1109/jcsse54890.2022.9836266","DOIUrl":null,"url":null,"abstract":"Automatic music transcription (AMT) is the conversion of audio to music notation, which helps with music education, music production, and music creation. The Thai xylophone is a Thai classical music instrument. Commonly, Thai xylophone has two types of mallets, including soft mallets and hard mallets. This paper proposes the study of AMT for Thai xylophone played with soft mallets. We compared feature extraction using Mel-Spectrogram and Mel-Frequency Cepstral Coefficient (MFCC), as well as deep learning using the Onsets and Frames model (OaF), which is the state of the art for AMT. We collected 30 Thai xylophone played with soft mallets songs with music notation as the dataset. The results show that Mel-Spectrogram outperforms MFCC. The experiment shows that Mel-Spectrogram with the OaF model performed the best on the frame detector with 87.04% of F1-Score and the onset detector with 94.35% of F1-Score. We also conduct ablation research.","PeriodicalId":284735,"journal":{"name":"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Automatic Music Transcription for the Thai Xylophone played with Soft Mallets\",\"authors\":\"Apichai Huaysrijan, S. Pongpinigpinyo\",\"doi\":\"10.1109/jcsse54890.2022.9836266\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic music transcription (AMT) is the conversion of audio to music notation, which helps with music education, music production, and music creation. The Thai xylophone is a Thai classical music instrument. Commonly, Thai xylophone has two types of mallets, including soft mallets and hard mallets. This paper proposes the study of AMT for Thai xylophone played with soft mallets. We compared feature extraction using Mel-Spectrogram and Mel-Frequency Cepstral Coefficient (MFCC), as well as deep learning using the Onsets and Frames model (OaF), which is the state of the art for AMT. We collected 30 Thai xylophone played with soft mallets songs with music notation as the dataset. The results show that Mel-Spectrogram outperforms MFCC. The experiment shows that Mel-Spectrogram with the OaF model performed the best on the frame detector with 87.04% of F1-Score and the onset detector with 94.35% of F1-Score. We also conduct ablation research.\",\"PeriodicalId\":284735,\"journal\":{\"name\":\"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/jcsse54890.2022.9836266\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/jcsse54890.2022.9836266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

自动音乐转录(AMT)是将音频转换为音乐符号,有助于音乐教育,音乐制作和音乐创作。泰国木琴是一种泰国古典乐器。通常,泰国木琴有两种木槌,包括软木槌和硬木槌。本文提出对泰国软槌木琴的AMT进行研究。我们比较了使用Mel-Spectrogram和Mel-Frequency Cepstral Coefficient (MFCC)的特征提取,以及使用Onsets和Frames模型(OaF)的深度学习,这是AMT的最新技术。我们收集了30首泰国木琴用软木槌演奏的乐曲作为数据集。结果表明,mel谱图优于MFCC。实验结果表明,基于OaF模型的Mel-Spectrogram在帧检测器和起始检测器上的表现最佳,分别达到了87.04%和94.35%的F1-Score。我们也进行消融研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic Music Transcription for the Thai Xylophone played with Soft Mallets
Automatic music transcription (AMT) is the conversion of audio to music notation, which helps with music education, music production, and music creation. The Thai xylophone is a Thai classical music instrument. Commonly, Thai xylophone has two types of mallets, including soft mallets and hard mallets. This paper proposes the study of AMT for Thai xylophone played with soft mallets. We compared feature extraction using Mel-Spectrogram and Mel-Frequency Cepstral Coefficient (MFCC), as well as deep learning using the Onsets and Frames model (OaF), which is the state of the art for AMT. We collected 30 Thai xylophone played with soft mallets songs with music notation as the dataset. The results show that Mel-Spectrogram outperforms MFCC. The experiment shows that Mel-Spectrogram with the OaF model performed the best on the frame detector with 87.04% of F1-Score and the onset detector with 94.35% of F1-Score. We also conduct ablation research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信