基于领域聚焦文档的非英语母语者语音纠错

K. Radzikowski, Le Wang, O. Yoshie
{"title":"基于领域聚焦文档的非英语母语者语音纠错","authors":"K. Radzikowski, Le Wang, O. Yoshie","doi":"10.1145/3011141.3011169","DOIUrl":null,"url":null,"abstract":"With the increase in exchange programs, many international students worldwide can face communication problems. During lectures, usually English language is used for the communication between students and teachers. However both sides, not necessarily being native speakers of English, may misunderstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocessing phase. Secondly, finding the document part corresponding to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correction by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.","PeriodicalId":247823,"journal":{"name":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Non-native English speakers' speech correction, based on domain focused document\",\"authors\":\"K. Radzikowski, Le Wang, O. Yoshie\",\"doi\":\"10.1145/3011141.3011169\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the increase in exchange programs, many international students worldwide can face communication problems. During lectures, usually English language is used for the communication between students and teachers. However both sides, not necessarily being native speakers of English, may misunderstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocessing phase. Secondly, finding the document part corresponding to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correction by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.\",\"PeriodicalId\":247823,\"journal\":{\"name\":\"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3011141.3011169\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3011141.3011169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

随着交流项目的增加,世界各地的许多国际学生都面临着沟通问题。在课堂上,学生和老师之间的交流通常使用英语。然而,双方不一定以英语为母语,可能会误解对方。本文提出了一种基于领域聚焦电子文档的非英语母语者语音纠错方法。该方法依赖于语音识别(SR)软件的结果,并将其与文档一起使用。我们的方法包括三个步骤。首先是预处理阶段的文档分析。其次,利用基于隐马尔可夫模型(HMM)的方法,从SR软件中找到句子对应的文档部分。最后,根据SR软件的结果,通过计算每个候选句子的分数来进行校正。概率评分结合关键词比较、BM25F方法和基于HMM的方法评分。选择得分最高的候选人作为替补。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Non-native English speakers' speech correction, based on domain focused document
With the increase in exchange programs, many international students worldwide can face communication problems. During lectures, usually English language is used for the communication between students and teachers. However both sides, not necessarily being native speakers of English, may misunderstand each other. In this paper we propose a method for correction of non-native English speakers' speech, based on the domain focused electronic document. The method relies on the results of speech recognition (SR) software, and uses them altogether with the document. Our approach consists of three steps. Firstly, document analysis in the preprocessing phase. Secondly, finding the document part corresponding to sentence from SR software, realised using the Hidden Markov Model (HMM) based method. Finally, the correction by calculating the score for each of candidate sentences, based on the result of SR software. The probability score combines keywords comparison, BM25F method and HMM based method scores. Highest score candidate is chosen as replacement.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信