Mizo口语查询系统与韵律信息增强

Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha
{"title":"Mizo口语查询系统与韵律信息增强","authors":"Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha","doi":"10.1109/O-COCOSDA50338.2020.9295007","DOIUrl":null,"url":null,"abstract":"We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mizo Spoken Query System Enhanced with Prosodic Information\",\"authors\":\"Rupam Das, Abhishek Dey, Wendy Lalhminghlui, Priyankoo Sarmah, K. Samudravijaya, R. Sinha\",\"doi\":\"10.1109/O-COCOSDA50338.2020.9295007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.\",\"PeriodicalId\":385266,\"journal\":{\"name\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/O-COCOSDA50338.2020.9295007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们报告了米佐拉姆省各种农产品市场商品价格在线数据库的米佐拉语口语界面的开发。语音查询系统设计用于移动电话网络。它的灵感来自于早期为阿萨姆语和马拉地语等语言开发的此类系统。然而,考虑到当前Mizo语言口语查询系统的性能有了多方面的提高,我们有动力详细报告整个系统的实现。在5次交叉验证实验中,DNN-HMM语音识别系统的平均单词错误率为2.2%。Mizo商品名称识别系统的单词错误率比阿萨姆语类似系统的错误率低2.75倍。误差率的降低是由于从不同噪声水平的不同环境中收集数据,并对数据进行进一步处理。在与几个用户进行的现场试验中,Mizo ASR系统的单词错误率增加到8.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Mizo Spoken Query System Enhanced with Prosodic Information
We report the development of a Mizo spoken language interface to an online database of commodity prices in various agricultural markets of Mizoram province. The spoken query system is designed to be used over mobile phone networks. It is inspired by earlier such systems developed for languages such as Assamese and Marathi. However, considering the manifold increase in the performance of the current Mizo language spoken query system, we were motivated to report the whole system implementation in detail. The average word error rate of the DNN-HMM speech recognition system in a 5-fold cross validation experiment is 2.2%. The word error rate of the Mizo commodity name recognition system is 2.75 times smaller than that reported for a similar system for Assamese language. The reduction in error rate is attributed to data collection from diverse environmental situations with different noise levels and further processing of the data. The word error rate of the Mizo ASR system increases to 8.5% in field trials conducted with several users.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信