{"title":"复调音乐转录中非负矩阵分解的合适期望因子的确定","authors":"S. Sophea, S. Phon-Amnuaisuk","doi":"10.1109/ISITC.2007.41","DOIUrl":null,"url":null,"abstract":"Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.","PeriodicalId":394071,"journal":{"name":"2007 International Symposium on Information Technology Convergence (ISITC 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Determining a Suitable Desired Factors for Nonnegative Matrix Factorization in Polyphonic Music Transcription\",\"authors\":\"S. Sophea, S. Phon-Amnuaisuk\",\"doi\":\"10.1109/ISITC.2007.41\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.\",\"PeriodicalId\":394071,\"journal\":{\"name\":\"2007 International Symposium on Information Technology Convergence (ISITC 2007)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 International Symposium on Information Technology Convergence (ISITC 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISITC.2007.41\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Symposium on Information Technology Convergence (ISITC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITC.2007.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Determining a Suitable Desired Factors for Nonnegative Matrix Factorization in Polyphonic Music Transcription
Nonnegative matrix factorization (NMF) is a technique that factorizes a given matrix V into W and H: NMF(Vmn )rarrWmr Hrn . NMF has been applied to various problem domains. So far there has been no fixed rule to select the number of NMF factoring (i.e. the desired factor r). In music transcription, the value r has the semantic of the desired musical line. If the r is not set correctly, the transcription output will not be accurate. In this paper, we present a tactic to get around this issue. We found that by concatenating an input audio stream with another audio stream with prior knowledge of r, where r ges the number of musical lines; we are guaranteed to get the correct number of musical lines in the transcription output. In our approach, NMF is applied to extract musical notes from FFT coefficients extracted from input polyphonic audio streams. The output from our experiment is encouraging and we hope to explore this further in the future.