{"title":"基于典型相关分析的跨性别语音变形","authors":"I. Baseer, Rabeea Basir","doi":"10.1109/C-CODE.2017.7918947","DOIUrl":null,"url":null,"abstract":"Voice morphing one of the speech synthesis frameworks, in simplest term aim to transforms speaker's identity from source to target speaker while preserving the original content of message. This paper presents a novel spectral envelope mapping algorithm based on Canonical Correlation Analysis(CCA) that find the association between spectral envelope characteristics of source speaker and target speaker in terms of correlation as a similarity metric. Moreover, the speech also undergoes to prosodic modification using PSOLA as pitch frequency is also an important parameter for varying identity. This morphing algorithm is evaluated by taking the utterances from freely available CMU-ARCTIC speech dataset. The subjective experiment shows that the proposed method successfully transforms speaker identity and produced high-quality morphed signal.","PeriodicalId":344222,"journal":{"name":"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cross gender voice morphing using Canonical Correlation Analysis\",\"authors\":\"I. Baseer, Rabeea Basir\",\"doi\":\"10.1109/C-CODE.2017.7918947\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Voice morphing one of the speech synthesis frameworks, in simplest term aim to transforms speaker's identity from source to target speaker while preserving the original content of message. This paper presents a novel spectral envelope mapping algorithm based on Canonical Correlation Analysis(CCA) that find the association between spectral envelope characteristics of source speaker and target speaker in terms of correlation as a similarity metric. Moreover, the speech also undergoes to prosodic modification using PSOLA as pitch frequency is also an important parameter for varying identity. This morphing algorithm is evaluated by taking the utterances from freely available CMU-ARCTIC speech dataset. The subjective experiment shows that the proposed method successfully transforms speaker identity and produced high-quality morphed signal.\",\"PeriodicalId\":344222,\"journal\":{\"name\":\"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/C-CODE.2017.7918947\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/C-CODE.2017.7918947","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cross gender voice morphing using Canonical Correlation Analysis
Voice morphing one of the speech synthesis frameworks, in simplest term aim to transforms speaker's identity from source to target speaker while preserving the original content of message. This paper presents a novel spectral envelope mapping algorithm based on Canonical Correlation Analysis(CCA) that find the association between spectral envelope characteristics of source speaker and target speaker in terms of correlation as a similarity metric. Moreover, the speech also undergoes to prosodic modification using PSOLA as pitch frequency is also an important parameter for varying identity. This morphing algorithm is evaluated by taking the utterances from freely available CMU-ARCTIC speech dataset. The subjective experiment shows that the proposed method successfully transforms speaker identity and produced high-quality morphed signal.