{"title":"Robust voice conversion systems using MFDWC","authors":"M. Farhid, M. Tinati","doi":"10.1109/ISTEL.2008.4651405","DOIUrl":null,"url":null,"abstract":"Voice conversion is a method used to transform one speakerpsilas voice into another speakerpsilas voice. New modification approach for voice conversion is proposed in this paper. We take Mel-frequency Discrete Wavelet coefficients (MFDWC) as the basic feature. This feature copes well with small training sets of high dimension, which is a problem often encountered in voice conversion. The proposed voice conversion system assumes parallel training data from source and target speakers and uses the theory of wavelets in order to extract speaker feature information. The satisfactory performance of the voice conversion system can be confirmed through ABX listening test and MOS grade.","PeriodicalId":133602,"journal":{"name":"2008 International Symposium on Telecommunications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2008.4651405","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Voice conversion is a method used to transform one speakerpsilas voice into another speakerpsilas voice. New modification approach for voice conversion is proposed in this paper. We take Mel-frequency Discrete Wavelet coefficients (MFDWC) as the basic feature. This feature copes well with small training sets of high dimension, which is a problem often encountered in voice conversion. The proposed voice conversion system assumes parallel training data from source and target speakers and uses the theory of wavelets in order to extract speaker feature information. The satisfactory performance of the voice conversion system can be confirmed through ABX listening test and MOS grade.