{"title":"西班牙语文本到语音转换中的数据驱动联合f/sub /和持续时间建模","authors":"E. López-Gonzalo, L. Hernández-Gómez","doi":"10.1109/ICASSP.1994.389225","DOIUrl":null,"url":null,"abstract":"The aim of the proposed paper is to discuss how to model representations of both fundamental frequency and suprasegmental duration in TTS converters for Spanish. For this purpose we use a data-driven methodology that is able to represent both fundamental frequency and suprasegmental duration in order to model the prosody of a text-to-speech system for Spanish.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"197 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Data-driven joint f/sub 0/ and duration modeling in text to speech conversion for Spanish\",\"authors\":\"E. López-Gonzalo, L. Hernández-Gómez\",\"doi\":\"10.1109/ICASSP.1994.389225\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of the proposed paper is to discuss how to model representations of both fundamental frequency and suprasegmental duration in TTS converters for Spanish. For this purpose we use a data-driven methodology that is able to represent both fundamental frequency and suprasegmental duration in order to model the prosody of a text-to-speech system for Spanish.<<ETX>>\",\"PeriodicalId\":290798,\"journal\":{\"name\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"volume\":\"197 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1994.389225\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389225","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data-driven joint f/sub 0/ and duration modeling in text to speech conversion for Spanish
The aim of the proposed paper is to discuss how to model representations of both fundamental frequency and suprasegmental duration in TTS converters for Spanish. For this purpose we use a data-driven methodology that is able to represent both fundamental frequency and suprasegmental duration in order to model the prosody of a text-to-speech system for Spanish.<>