Altangerel Chagnaa, Purev Jaimai, Kerey Yesyenbyek, C. Hansakunbuntheung
{"title":"蒙古语数据驱动文本转语音的评价","authors":"Altangerel Chagnaa, Purev Jaimai, Kerey Yesyenbyek, C. Hansakunbuntheung","doi":"10.1109/ICSDA.2013.6709881","DOIUrl":null,"url":null,"abstract":"This paper presents a first attempt to evaluate data-driven speech synthesis of Mongolian trained on 1500-sentence female speech corpus. The speech corpus contains nearly 6 hours of Mongolian female speech that is designed to cover all Mongolian phones. The evaluation is done on two levels. In overall quality evaluation, we generated 25 sentences and asked raters about their quality based on Mean Opinion Score (MOS). The second evaluation uses Phoneme confusion test, which contains all possible phoneme set in Mongolian.","PeriodicalId":266295,"journal":{"name":"2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An evaluation of Mongolian data-driven Text-to-Speech\",\"authors\":\"Altangerel Chagnaa, Purev Jaimai, Kerey Yesyenbyek, C. Hansakunbuntheung\",\"doi\":\"10.1109/ICSDA.2013.6709881\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a first attempt to evaluate data-driven speech synthesis of Mongolian trained on 1500-sentence female speech corpus. The speech corpus contains nearly 6 hours of Mongolian female speech that is designed to cover all Mongolian phones. The evaluation is done on two levels. In overall quality evaluation, we generated 25 sentences and asked raters about their quality based on Mean Opinion Score (MOS). The second evaluation uses Phoneme confusion test, which contains all possible phoneme set in Mongolian.\",\"PeriodicalId\":266295,\"journal\":{\"name\":\"2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSDA.2013.6709881\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2013.6709881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An evaluation of Mongolian data-driven Text-to-Speech
This paper presents a first attempt to evaluate data-driven speech synthesis of Mongolian trained on 1500-sentence female speech corpus. The speech corpus contains nearly 6 hours of Mongolian female speech that is designed to cover all Mongolian phones. The evaluation is done on two levels. In overall quality evaluation, we generated 25 sentences and asked raters about their quality based on Mean Opinion Score (MOS). The second evaluation uses Phoneme confusion test, which contains all possible phoneme set in Mongolian.