歌唱合成中的共振峰偏移

P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li
{"title":"歌唱合成中的共振峰偏移","authors":"P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li","doi":"10.1109/ICDSP.2015.7251852","DOIUrl":null,"url":null,"abstract":"This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.","PeriodicalId":216293,"journal":{"name":"2015 IEEE International Conference on Digital Signal Processing (DSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Formant excursion in singing synthesis\",\"authors\":\"P. Chan, M. Dong, Yi Qian Lim, A. Toh, E. Chong, Mantita Yeo, Megan Chua, Haizhou Li\",\"doi\":\"10.1109/ICDSP.2015.7251852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.\",\"PeriodicalId\":216293,\"journal\":{\"name\":\"2015 IEEE International Conference on Digital Signal Processing (DSP)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Digital Signal Processing (DSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2015.7251852\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2015.7251852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

本文介绍了我们在人类歌唱声音的共振偏移方面的工作。在歌唱声音合成中,为了获得更好的表现力和自然度,已经提出了许多方法来随时间调整音高和能量[1]-[3]。然而,修改谱包络的方法仍然是保守的[4],[5]。然而,一个富有表现力的歌手在整首歌中使用不同的技巧来广泛地修改他的声谱[6]。这激发了我们对峰漂移的研究。我们假设对元音的语义依赖程度限制了形成峰偏移的范围,并开发了一种方法来找到|Ξ|,这是一种独立于言语和歌唱之间固有的频谱差异的、归因于歌唱表现力的孤立频谱失真的测量。这样,我们就可以更好地参数化歌唱声音中的频谱变化,从而建立一个用于歌唱合成的动态频谱模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Formant excursion in singing synthesis
This paper presents our work in formant excursion in the human singing voice. In singing voice synthesis, numerous methods have been proposed to modify pitch and energy over time in order to achieve better expressiveness and naturalness [1]-[3]. Methods to modify the spectral envelop, however, remain conservative [4], [5]. An expressive singer, nevertheless, employs different techniques to modify his vocal spectra extensively throughout a song [6]. This motivates our study on formant excursion. We hypothesize that the level of semantic reliance on vowels limits the range of formant excursion and develop a method to find |Ξ|, a measure of isolated spectral distortion attributed to singing expressiveness, independent of the spectral differences inherent between speech and singing. With this, we are able to better parameterize spectral modifications in the singing voice towards a dynamic spectral model for singing synthesis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信