N. Saimai, C. Tantibundhit, C. Onsuwan, N. Thatphithakkul
{"title":"泰国人工耳蜗的语音合成算法","authors":"N. Saimai, C. Tantibundhit, C. Onsuwan, N. Thatphithakkul","doi":"10.1109/ECTICON.2012.6254287","DOIUrl":null,"url":null,"abstract":"Enhancement of speech perception is a crucial aspect for cochlear implant (CI) technology. In a tonal language such as Thai, with segments (consonants and vowels) and supra-segments (tones), many crucial acoustic cues are to be taken into account for speech processing strategy, i.e., amplitude envelopes and temporal fine structure. This paper presents a new speech synthesis algorithm for CI, which combines an amplitude envelope extraction by the continuous interleaved sampling (CIS) strategy and a temporal fine structure extraction to construct carrier signals, and to use them for amplitude modulation based on each band's envelope. Synthesized initial phonemes using three different algorithms (Fu et al., Chen and Zhang, and the proposed algorithm) with six-band speech processor were evaluated by eight native Thais with normal hearing. The results show that the proposed algorithm produces the most significant improvement in speech intelligibility of the initial phonemes compared to the other algorithms. This suggests a possibility of using the proposed algorithm for CI devices, specifically designed for tonal languages such as Thai, to improve speech perception in CI patients.","PeriodicalId":6319,"journal":{"name":"2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology","volume":"40 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speech synthesis algorithm for Thai cochlear implants\",\"authors\":\"N. Saimai, C. Tantibundhit, C. Onsuwan, N. Thatphithakkul\",\"doi\":\"10.1109/ECTICON.2012.6254287\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Enhancement of speech perception is a crucial aspect for cochlear implant (CI) technology. In a tonal language such as Thai, with segments (consonants and vowels) and supra-segments (tones), many crucial acoustic cues are to be taken into account for speech processing strategy, i.e., amplitude envelopes and temporal fine structure. This paper presents a new speech synthesis algorithm for CI, which combines an amplitude envelope extraction by the continuous interleaved sampling (CIS) strategy and a temporal fine structure extraction to construct carrier signals, and to use them for amplitude modulation based on each band's envelope. Synthesized initial phonemes using three different algorithms (Fu et al., Chen and Zhang, and the proposed algorithm) with six-band speech processor were evaluated by eight native Thais with normal hearing. The results show that the proposed algorithm produces the most significant improvement in speech intelligibility of the initial phonemes compared to the other algorithms. This suggests a possibility of using the proposed algorithm for CI devices, specifically designed for tonal languages such as Thai, to improve speech perception in CI patients.\",\"PeriodicalId\":6319,\"journal\":{\"name\":\"2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology\",\"volume\":\"40 1\",\"pages\":\"1-4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECTICON.2012.6254287\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECTICON.2012.6254287","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
语音感知的增强是人工耳蜗技术的一个重要方面。在像泰语这样的声调语言中,有音段(辅音和元音)和超音段(音调),语音处理策略需要考虑许多关键的声学线索,即振幅包络和时间精细结构。本文提出了一种新的CI语音合成算法,该算法结合了连续交错采样(CIS)策略的幅度包络提取和时间精细结构提取来构造载波信号,并将其用于基于每个频带包络的幅度调制。采用三种不同的算法(Fu et al., Chen and Zhang,以及所提出的算法)和六波段语音处理器对8名听力正常的泰国人进行合成初始音素的评价。结果表明,与其他算法相比,该算法对初始音素的语音可理解性有最显著的提高。这表明将所提出的算法用于CI设备的可能性,专门为声调语言(如泰语)设计,以改善CI患者的语音感知。
Speech synthesis algorithm for Thai cochlear implants
Enhancement of speech perception is a crucial aspect for cochlear implant (CI) technology. In a tonal language such as Thai, with segments (consonants and vowels) and supra-segments (tones), many crucial acoustic cues are to be taken into account for speech processing strategy, i.e., amplitude envelopes and temporal fine structure. This paper presents a new speech synthesis algorithm for CI, which combines an amplitude envelope extraction by the continuous interleaved sampling (CIS) strategy and a temporal fine structure extraction to construct carrier signals, and to use them for amplitude modulation based on each band's envelope. Synthesized initial phonemes using three different algorithms (Fu et al., Chen and Zhang, and the proposed algorithm) with six-band speech processor were evaluated by eight native Thais with normal hearing. The results show that the proposed algorithm produces the most significant improvement in speech intelligibility of the initial phonemes compared to the other algorithms. This suggests a possibility of using the proposed algorithm for CI devices, specifically designed for tonal languages such as Thai, to improve speech perception in CI patients.