S. Rallabandi, Ayushi Pandey, Sai Krishna Rallabandi, Tejas Godambe, S. Gangashetty
{"title":"声音上升:在基于音节的语音合成中帮助后退","authors":"S. Rallabandi, Ayushi Pandey, Sai Krishna Rallabandi, Tejas Godambe, S. Gangashetty","doi":"10.1109/NCC.2016.7561189","DOIUrl":null,"url":null,"abstract":"Back off techniques are employed in syllable based unit selection speech synthesis systems to maintain the naturalness of the speech in spite of the missing syllables. In synthesizing the missing complex consonant clusters syllables of Telugu, we introduced reduced vowel epenthesis as a rule-based backoff strategy[1]. In this paper, we refine the scope of the approach in selectively applying vowel epenthesis only in cases of sonority rise between adjacent consonants. When the sonority does not rise (stop-stop, liquid-stop clusters), we increase the duration of the consonant. Owing to specific patterns of vowel epenthesis observed in languages, we conduct a subjective evaluation to determine the identity of the epenthetic vowel in Hindi. From the inferences of the listening test, we devise a class based rule to perform epenthesis. Further, to evaluate the performance of the designed system, we perform both subjective as well as an objective evaluation based on confidence measures from an ASR system. We conduct a phone level automatic speech recognition task on the intelligibility of the words synthesized using epenthesis as a cluster-repair strategy. The results show that the proposed back off method helps in producing more natural-sounding speech compared to the conventional backoffs.","PeriodicalId":279637,"journal":{"name":"2016 Twenty Second National Conference on Communication (NCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Sonority rise: Aiding backoff in syllable-based speech synthesis\",\"authors\":\"S. Rallabandi, Ayushi Pandey, Sai Krishna Rallabandi, Tejas Godambe, S. Gangashetty\",\"doi\":\"10.1109/NCC.2016.7561189\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Back off techniques are employed in syllable based unit selection speech synthesis systems to maintain the naturalness of the speech in spite of the missing syllables. In synthesizing the missing complex consonant clusters syllables of Telugu, we introduced reduced vowel epenthesis as a rule-based backoff strategy[1]. In this paper, we refine the scope of the approach in selectively applying vowel epenthesis only in cases of sonority rise between adjacent consonants. When the sonority does not rise (stop-stop, liquid-stop clusters), we increase the duration of the consonant. Owing to specific patterns of vowel epenthesis observed in languages, we conduct a subjective evaluation to determine the identity of the epenthetic vowel in Hindi. From the inferences of the listening test, we devise a class based rule to perform epenthesis. Further, to evaluate the performance of the designed system, we perform both subjective as well as an objective evaluation based on confidence measures from an ASR system. We conduct a phone level automatic speech recognition task on the intelligibility of the words synthesized using epenthesis as a cluster-repair strategy. The results show that the proposed back off method helps in producing more natural-sounding speech compared to the conventional backoffs.\",\"PeriodicalId\":279637,\"journal\":{\"name\":\"2016 Twenty Second National Conference on Communication (NCC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Twenty Second National Conference on Communication (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2016.7561189\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Twenty Second National Conference on Communication (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2016.7561189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sonority rise: Aiding backoff in syllable-based speech synthesis
Back off techniques are employed in syllable based unit selection speech synthesis systems to maintain the naturalness of the speech in spite of the missing syllables. In synthesizing the missing complex consonant clusters syllables of Telugu, we introduced reduced vowel epenthesis as a rule-based backoff strategy[1]. In this paper, we refine the scope of the approach in selectively applying vowel epenthesis only in cases of sonority rise between adjacent consonants. When the sonority does not rise (stop-stop, liquid-stop clusters), we increase the duration of the consonant. Owing to specific patterns of vowel epenthesis observed in languages, we conduct a subjective evaluation to determine the identity of the epenthetic vowel in Hindi. From the inferences of the listening test, we devise a class based rule to perform epenthesis. Further, to evaluate the performance of the designed system, we perform both subjective as well as an objective evaluation based on confidence measures from an ASR system. We conduct a phone level automatic speech recognition task on the intelligibility of the words synthesized using epenthesis as a cluster-repair strategy. The results show that the proposed back off method helps in producing more natural-sounding speech compared to the conventional backoffs.