{"title":"基于梯度下降优化线性回归的单耳语音分离","authors":"Belhedi Wiem, M. B. Messaoud, A. Bouzid","doi":"10.1109/ATSIP49331.2020.9231542","DOIUrl":null,"url":null,"abstract":"Monaural speech separation (MSS) is useful for many real-world applications. In this work, we propose a novel method for MSS based on the observation that a composite speech signals can be modeled as the linear summation of each speaker with respect to participation coefficients. Hence, speech signals are separated using linear regression. Partial derivative with respect to each variable is then used to perform gradient descent in order to optimize the estimation and therefore the separation. The proposed speech separation method for is applicable to known speakers.The proposed method was assessed using metrics characterized by good correlation coefficients with subjective listening tests. Evaluation results reveal the effectiveness of the proposed approach.","PeriodicalId":384018,"journal":{"name":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Monaural speech separation based on linear regression optimized using gradient descent\",\"authors\":\"Belhedi Wiem, M. B. Messaoud, A. Bouzid\",\"doi\":\"10.1109/ATSIP49331.2020.9231542\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Monaural speech separation (MSS) is useful for many real-world applications. In this work, we propose a novel method for MSS based on the observation that a composite speech signals can be modeled as the linear summation of each speaker with respect to participation coefficients. Hence, speech signals are separated using linear regression. Partial derivative with respect to each variable is then used to perform gradient descent in order to optimize the estimation and therefore the separation. The proposed speech separation method for is applicable to known speakers.The proposed method was assessed using metrics characterized by good correlation coefficients with subjective listening tests. Evaluation results reveal the effectiveness of the proposed approach.\",\"PeriodicalId\":384018,\"journal\":{\"name\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ATSIP49331.2020.9231542\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATSIP49331.2020.9231542","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Monaural speech separation based on linear regression optimized using gradient descent
Monaural speech separation (MSS) is useful for many real-world applications. In this work, we propose a novel method for MSS based on the observation that a composite speech signals can be modeled as the linear summation of each speaker with respect to participation coefficients. Hence, speech signals are separated using linear regression. Partial derivative with respect to each variable is then used to perform gradient descent in order to optimize the estimation and therefore the separation. The proposed speech separation method for is applicable to known speakers.The proposed method was assessed using metrics characterized by good correlation coefficients with subjective listening tests. Evaluation results reveal the effectiveness of the proposed approach.