Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha
{"title":"Mizo口语查询系统与韵律信息增强","authors":"Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha","doi":"10.1109/O-COCOSDA50338.2020.9295007","DOIUrl":null,"url":null,"abstract":"We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mizo Spoken Query System Enhanced with Prosodic Information\",\"authors\":\"Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha\",\"doi\":\"10.1109/O-COCOSDA50338.2020.9295007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.\",\"PeriodicalId\":385266,\"journal\":{\"name\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/O-COCOSDA50338.2020.9295007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mizo Spoken Query System Enhanced with Prosodic Information
We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.