复调音乐转录中非负矩阵分解的合适期望因子的确定

2007 International Symposium on Information Technology Convergence (ISITC 2007) Pub Date : 2007-11-23 DOI:10.1109/ISITC.2007.41

S. Sophea, S. Phon-Amnuaisuk

{"title":"复调音乐转录中非负矩阵分解的合适期望因子的确定","authors":"S. Sophea, S. Phon-Amnuaisuk","doi":"10.1109/ISITC.2007.41","DOIUrl":null,"url":null,"abstract":"Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.","PeriodicalId":394071,"journal":{"name":"2007 International Symposium on Information Technology Convergence (ISITC 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Determining a Suitable Desired Factors for Nonnegative Matrix Factorization in Polyphonic Music Transcription\",\"authors\":\"S. Sophea, S. Phon-Amnuaisuk\",\"doi\":\"10.1109/ISITC.2007.41\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.\",\"PeriodicalId\":394071,\"journal\":{\"name\":\"2007 International Symposium on Information Technology Convergence (ISITC 2007)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 International Symposium on Information Technology Convergence (ISITC 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISITC.2007.41\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Symposium on Information Technology Convergence (ISITC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITC.2007.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

非负矩阵分解(NMF)是一种将给定矩阵V分解为W和H的技术:NMF(Vmn)rarrWmr Hrn。NMF已被应用于各种问题领域。到目前为止，还没有固定的规则来选择NMF因子的数量(即期望因子r)。在音乐转录中，r的值具有期望的音乐线的语义。如果r设置不正确，转录输出将不准确。在本文中，我们提出了一种策略来解决这个问题。我们发现，通过将输入音频流与另一个具有先验知识r的音频流连接起来，其中r是音乐线的数量;我们保证在转录输出中得到正确的音乐行数。在我们的方法中，NMF被应用于从输入复调音频流中提取的FFT系数中提取音符。我们的实验结果令人鼓舞，我们希望在未来进一步探索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Determining a Suitable Desired Factors for Nonnegative Matrix Factorization in Polyphonic Music Transcription

Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 International Symposium on Information Technology Convergence (ISITC 2007)

自引率

0.00%

发文量