{"title":"HuZhouSpeech: A Huzhou Dialect Speech Recognition Corpus","authors":"Yejin Wang, Maonian Wu, Bo Zheng, Shaojun Zhu","doi":"10.1109/ICICSP55539.2022.10050614","DOIUrl":null,"url":null,"abstract":"In this paper, a new open source dialect speech corpus is proposed. It is by far the only open source Goetian dialect corpus, suitable for conducting speech recognition research and building a speech recognition system for Huzhou dialect. This corpus contains about 184 hours of speech and corresponding subtitle transcriptions. In addition, an automated method for building speech recognition corpus is proposed. Only the film and television materials with subtitles are needed to build the speech recognition corpus automatically. Tested on four classical speech recognition frameworks to demonstrate the effectiveness of the corpus for training different speech recognition systems.","PeriodicalId":281095,"journal":{"name":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","volume":"2 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSP55539.2022.10050614","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, a new open source dialect speech corpus is proposed. It is by far the only open source Goetian dialect corpus, suitable for conducting speech recognition research and building a speech recognition system for Huzhou dialect. This corpus contains about 184 hours of speech and corresponding subtitle transcriptions. In addition, an automated method for building speech recognition corpus is proposed. Only the film and television materials with subtitles are needed to build the speech recognition corpus automatically. Tested on four classical speech recognition frameworks to demonstrate the effectiveness of the corpus for training different speech recognition systems.