基于solfa名字识别的脱机视唱乐谱跟踪中的正则化DTW

Rongfeng Li, Kuoxi Yu
{"title":"基于solfa名字识别的脱机视唱乐谱跟踪中的正则化DTW","authors":"Rongfeng Li, Kuoxi Yu","doi":"10.1109/ICMEW56448.2022.9859398","DOIUrl":null,"url":null,"abstract":"The automatic scoring of singing evaluation is a hot topic in recent years. Improving the score following effect is the first step to improve the accuracy of evaluation. Most of the commonly used methods are based on DTW, but for audios with low singing quality and inaccurate pitch, DTW often predicts the onset incorrectly. In order to solve the above problems, this paper focus on the offline following, mainly improves from two aspects: 1. Sol-fa name recognition is done before pitch tracking as preprocess. We cannot guarantee that the pitch of the singer is correct, but we can assume that the singer pronounces the sol-fa name correctly, so we use sol-fa name recognition as preprocessing; 2. Regularized DTW is proposed based on the basis of sol-fa name recognition. The results show that for general audio, under the condition of a tolerance of 20ms, compared with about 86% accuracy of ordinary DTW algorithm, our algorithm has improved to about 92%, while the average error of predicted notes is reduced by about 23ms. For audio with low signal-to-noise ratio and unstable voice frequency, the alignment effect is improved by about 20% compared with ordinary DTW.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Regularized DTW in Offline Music Score-Following for Sight-Singing Based on Sol-fa Name Recognition\",\"authors\":\"Rongfeng Li, Kuoxi Yu\",\"doi\":\"10.1109/ICMEW56448.2022.9859398\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The automatic scoring of singing evaluation is a hot topic in recent years. Improving the score following effect is the first step to improve the accuracy of evaluation. Most of the commonly used methods are based on DTW, but for audios with low singing quality and inaccurate pitch, DTW often predicts the onset incorrectly. In order to solve the above problems, this paper focus on the offline following, mainly improves from two aspects: 1. Sol-fa name recognition is done before pitch tracking as preprocess. We cannot guarantee that the pitch of the singer is correct, but we can assume that the singer pronounces the sol-fa name correctly, so we use sol-fa name recognition as preprocessing; 2. Regularized DTW is proposed based on the basis of sol-fa name recognition. The results show that for general audio, under the condition of a tolerance of 20ms, compared with about 86% accuracy of ordinary DTW algorithm, our algorithm has improved to about 92%, while the average error of predicted notes is reduced by about 23ms. For audio with low signal-to-noise ratio and unstable voice frequency, the alignment effect is improved by about 20% compared with ordinary DTW.\",\"PeriodicalId\":106759,\"journal\":{\"name\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW56448.2022.9859398\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW56448.2022.9859398","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

歌唱评价自动评分是近年来的研究热点。提高分数跟踪效果是提高评价准确性的第一步。常用的方法大多基于DTW,但对于演唱质量低、音高不准确的音频,DTW往往无法准确预测起跳。为了解决以上问题,本文重点对线下进行了如下改进,主要从两个方面进行了改进:1.线下;球场名称识别是在球场跟踪之前进行的预处理。我们不能保证演唱者的音高是正确的,但我们可以假设演唱者正确地读出了sol-fa的名字,所以我们使用sol-fa的名字识别作为预处理;2. 在sol-fa名称识别的基础上,提出了正则化DTW。结果表明,对于一般音频,在容差为20ms的情况下,与普通DTW算法86%的准确率相比,我们的算法提高到了92%左右,预测音符的平均误差降低了23ms左右。对于信噪比低、话音频率不稳定的音频,与普通DTW相比,对准效果提高了20%左右。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Regularized DTW in Offline Music Score-Following for Sight-Singing Based on Sol-fa Name Recognition
The automatic scoring of singing evaluation is a hot topic in recent years. Improving the score following effect is the first step to improve the accuracy of evaluation. Most of the commonly used methods are based on DTW, but for audios with low singing quality and inaccurate pitch, DTW often predicts the onset incorrectly. In order to solve the above problems, this paper focus on the offline following, mainly improves from two aspects: 1. Sol-fa name recognition is done before pitch tracking as preprocess. We cannot guarantee that the pitch of the singer is correct, but we can assume that the singer pronounces the sol-fa name correctly, so we use sol-fa name recognition as preprocessing; 2. Regularized DTW is proposed based on the basis of sol-fa name recognition. The results show that for general audio, under the condition of a tolerance of 20ms, compared with about 86% accuracy of ordinary DTW algorithm, our algorithm has improved to about 92%, while the average error of predicted notes is reduced by about 23ms. For audio with low signal-to-noise ratio and unstable voice frequency, the alignment effect is improved by about 20% compared with ordinary DTW.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信