{"title":"语音识别模型中语速的介绍","authors":"A. Yousfi, A. Meziane","doi":"10.1109/PCEE.2000.873603","DOIUrl":null,"url":null,"abstract":"We propose an improvement to the centisecond TLHMM model applied to the sound duration. Indeed, the distribution of the sound duration depends on the speaking rate. An adaptation in a post-processing step is needed. This adaptation is studied by proposing a model of the speaking rate based on average syllabic duration. The experiments elaborated on a set of BDSONS show the interest of this approach. This work is a continuation of those of (Meziane et al., 1999) and (Suaudeau, 1994).","PeriodicalId":369394,"journal":{"name":"Proceedings International Conference on Parallel Computing in Electrical Engineering. PARELEC 2000","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Introduction of the speaking rate in the model of speech recognition\",\"authors\":\"A. Yousfi, A. Meziane\",\"doi\":\"10.1109/PCEE.2000.873603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose an improvement to the centisecond TLHMM model applied to the sound duration. Indeed, the distribution of the sound duration depends on the speaking rate. An adaptation in a post-processing step is needed. This adaptation is studied by proposing a model of the speaking rate based on average syllabic duration. The experiments elaborated on a set of BDSONS show the interest of this approach. This work is a continuation of those of (Meziane et al., 1999) and (Suaudeau, 1994).\",\"PeriodicalId\":369394,\"journal\":{\"name\":\"Proceedings International Conference on Parallel Computing in Electrical Engineering. PARELEC 2000\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings International Conference on Parallel Computing in Electrical Engineering. PARELEC 2000\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCEE.2000.873603\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings International Conference on Parallel Computing in Electrical Engineering. PARELEC 2000","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCEE.2000.873603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
我们提出了一种改进的厘秒TLHMM模型适用于声音的持续时间。实际上,声音持续时间的分布取决于说话的速度。需要在后处理步骤中进行调整。通过提出一个基于平均音节时长的语速模型来研究这种适应性。在一组bdson上进行的实验表明了这种方法的可行性。这项工作是(Meziane et al., 1999)和(Suaudeau, 1994)的工作的延续。
Introduction of the speaking rate in the model of speech recognition
We propose an improvement to the centisecond TLHMM model applied to the sound duration. Indeed, the distribution of the sound duration depends on the speaking rate. An adaptation in a post-processing step is needed. This adaptation is studied by proposing a model of the speaking rate based on average syllabic duration. The experiments elaborated on a set of BDSONS show the interest of this approach. This work is a continuation of those of (Meziane et al., 1999) and (Suaudeau, 1994).