斯洛伐克语听写语音识别的假设组合

M. Lojka, J. Juhár
{"title":"斯洛伐克语听写语音识别的假设组合","authors":"M. Lojka, J. Juhár","doi":"10.1109/ELMAR.2014.6923311","DOIUrl":null,"url":null,"abstract":"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.","PeriodicalId":424325,"journal":{"name":"Proceedings ELMAR-2014","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Hypothesis combination for Slovak dictation speech recognition\",\"authors\":\"M. Lojka, J. Juhár\",\"doi\":\"10.1109/ELMAR.2014.6923311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.\",\"PeriodicalId\":424325,\"journal\":{\"name\":\"Proceedings ELMAR-2014\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings ELMAR-2014\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR.2014.6923311\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings ELMAR-2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR.2014.6923311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

多个语音识别系统的组合是提高语音识别精度最常用的方法。组合在特征级执行,或者系统在解码过程中相互交换信息,或者之后使用它们的输出以n -最优假设或格的形式进行组合。本文提供了斯洛伐克语语音识别系统组合的初步实验,使用著名的组合工具,来自美国国家标准与技术研究所(NIST)的识别输出投票错误减少(ROVER)。这里将探讨提供给ROVER的两种分数。第一个是基于归一化后验概率,第二个是基于识别句子中单词的置信度得分。提出了一种通过对分数进行平滑处理来提高组合效率的新方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hypothesis combination for Slovak dictation speech recognition
Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信