Digital singing voice synthesis using a new alternating reflection model

2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353) Pub Date : 2002-08-07 DOI:10.1109/ISCAS.2002.1011490

M.E. Lee, M.J.T. Smith

{"title":"Digital singing voice synthesis using a new alternating reflection model","authors":"M.E. Lee, M.J.T. Smith","doi":"10.1109/ISCAS.2002.1011490","DOIUrl":null,"url":null,"abstract":"Many models for computer generated singing voices have been proposed in the past and have been shown to produce a wide variety of synthesized voices. While many of these models are capable of synthesizing a particular singing voice with high musical quality, they typically are challenged with respect to naturalness, range, the ability to synthesize both male and female voices, as well as the ability to capture the identity of the singer. The analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model has proven to be effective in producing high quality voices with manageable computational cost. It is based on the combination of a block overlap-add sinusoidal representation and an analysis-by-synthesis parameter estimation technique. ABS/OLA is flexible enough to allow for modifications such as time and pitch scaling; however, it can suffer from quality degradation under such conditions. This paper presents an analysis/synthesis model that incorporates new methods to improve synthesis. These improvements add to the naturalness and flexibility in controlling perceptually important musical characteristics.","PeriodicalId":203750,"journal":{"name":"2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAS.2002.1011490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Many models for computer generated singing voices have been proposed in the past and have been shown to produce a wide variety of synthesized voices. While many of these models are capable of synthesizing a particular singing voice with high musical quality, they typically are challenged with respect to naturalness, range, the ability to synthesize both male and female voices, as well as the ability to capture the identity of the singer. The analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model has proven to be effective in producing high quality voices with manageable computational cost. It is based on the combination of a block overlap-add sinusoidal representation and an analysis-by-synthesis parameter estimation technique. ABS/OLA is flexible enough to allow for modifications such as time and pitch scaling; however, it can suffer from quality degradation under such conditions. This paper presents an analysis/synthesis model that incorporates new methods to improve synthesis. These improvements add to the naturalness and flexibility in controlling perceptually important musical characteristics.

查看原文本刊更多论文

使用一种新的交替反射模型的数字歌声合成

过去已经提出了许多计算机生成歌声的模型，并已被证明可以产生各种各样的合成声音。虽然这些模型中的许多都能够合成具有高音乐质量的特定歌唱声音，但它们通常在自然性，音域，合成男声和女声的能力以及捕捉歌手身份的能力方面受到挑战。合成分析/叠加(ABS/OLA)正弦模型已被证明可以有效地产生高质量的声音，并且计算成本可控。它是基于块重叠加正弦表示和综合分析参数估计技术的结合。ABS/OLA足够灵活，允许修改，如时间和音调缩放;然而，在这种条件下，它的质量会下降。本文提出了一个分析/合成模型，该模型包含了改进合成的新方法。这些改进增加了控制感知上重要的音乐特征的自然性和灵活性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353)

自引率

0.00%

发文量