德国串联合成的混合库存结构

T. Portele, F. Höfer, W. Hess
{"title":"德国串联合成的混合库存结构","authors":"T. Portele, F. Höfer, W. Hess","doi":"10.22028/D291-25294","DOIUrl":null,"url":null,"abstract":"In speech synthesis by unit concatenation a major point is the definition of the unit inventory. Diphone or demisyliable inventories are widely used but both unit types have their drawbacks. This chapter describes a mixed inventory structure that is syllable-oriented but does not demand a definite decision about the position of a syllable boundary. In the definition process of the inventory the results of a comprehensive investigation of coarticulatory phenomena at syllable boundaries were used as well as a machine-readable pronunciation dictionary. An evaluation comparing the mixed inventory with a demisyllable and a diphone inventory confirms that speech generated with the mixed inventory is superior regarding general acceptance. A segmental intelligibility test shows the high intelligibility of the synthetic speech.","PeriodicalId":340820,"journal":{"name":"Speech Synthesis Workshop","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A mixed inventory structure for German concatenative synthesis\",\"authors\":\"T. Portele, F. Höfer, W. Hess\",\"doi\":\"10.22028/D291-25294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In speech synthesis by unit concatenation a major point is the definition of the unit inventory. Diphone or demisyliable inventories are widely used but both unit types have their drawbacks. This chapter describes a mixed inventory structure that is syllable-oriented but does not demand a definite decision about the position of a syllable boundary. In the definition process of the inventory the results of a comprehensive investigation of coarticulatory phenomena at syllable boundaries were used as well as a machine-readable pronunciation dictionary. An evaluation comparing the mixed inventory with a demisyllable and a diphone inventory confirms that speech generated with the mixed inventory is superior regarding general acceptance. A segmental intelligibility test shows the high intelligibility of the synthetic speech.\",\"PeriodicalId\":340820,\"journal\":{\"name\":\"Speech Synthesis Workshop\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Speech Synthesis Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22028/D291-25294\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Speech Synthesis Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22028/D291-25294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

在单元拼接语音合成中,一个重要的问题是单元清单的定义。Diphone或可销毁库存被广泛使用,但这两种单位类型都有其缺点。本章描述了一种以音节为导向但不需要明确决定音节边界位置的混合清单结构。在清单的定义过程中,使用了音节边界处的协同发音现象的综合调查结果以及机器可读的发音词典。将混合清单与半音节清单和双音节清单进行比较的评估确认,混合清单生成的语音在一般接受度方面优于双音节清单。片段可理解性测试表明合成语音具有较高的可理解性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A mixed inventory structure for German concatenative synthesis
In speech synthesis by unit concatenation a major point is the definition of the unit inventory. Diphone or demisyliable inventories are widely used but both unit types have their drawbacks. This chapter describes a mixed inventory structure that is syllable-oriented but does not demand a definite decision about the position of a syllable boundary. In the definition process of the inventory the results of a comprehensive investigation of coarticulatory phenomena at syllable boundaries were used as well as a machine-readable pronunciation dictionary. An evaluation comparing the mixed inventory with a demisyllable and a diphone inventory confirms that speech generated with the mixed inventory is superior regarding general acceptance. A segmental intelligibility test shows the high intelligibility of the synthetic speech.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信