P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li
{"title":"歌唱合成中的共振峰偏移","authors":"P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li","doi":"10.1109/ICDSP.2015.7251852","DOIUrl":null,"url":null,"abstract":"This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Formant excursion in singing synthesis\",\"authors\":\"P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li\",\"doi\":\"10.1109/ICDSP.2015.7251852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.\",\"PeriodicalId\":216293,\"journal\":{\"name\":\"2015 IEEE International Conference on Digital Signal Processing (DSP)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Digital Signal Processing (DSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2015.7251852\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2015.7251852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.