斯洛伐克语听写语音识别的假设组合

Proceedings ELMAR-2014 Pub Date : 2014-10-16 DOI:10.1109/ELMAR.2014.6923311

M. Lojka, J. Juhár

{"title":"斯洛伐克语听写语音识别的假设组合","authors":"M. Lojka, J. Juhár","doi":"10.1109/ELMAR.2014.6923311","DOIUrl":null,"url":null,"abstract":"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.","PeriodicalId":424325,"journal":{"name":"Proceedings ELMAR-2014","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Hypothesis combination for Slovak dictation speech recognition\",\"authors\":\"M. Lojka, J. Juhár\",\"doi\":\"10.1109/ELMAR.2014.6923311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.\",\"PeriodicalId\":424325,\"journal\":{\"name\":\"Proceedings ELMAR-2014\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings ELMAR-2014\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR.2014.6923311\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings ELMAR-2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR.2014.6923311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

多个语音识别系统的组合是提高语音识别精度最常用的方法。组合在特征级执行，或者系统在解码过程中相互交换信息，或者之后使用它们的输出以n -最优假设或格的形式进行组合。本文提供了斯洛伐克语语音识别系统组合的初步实验，使用著名的组合工具，来自美国国家标准与技术研究所(NIST)的识别输出投票错误减少(ROVER)。这里将探讨提供给ROVER的两种分数。第一个是基于归一化后验概率，第二个是基于识别句子中单词的置信度得分。提出了一种通过对分数进行平滑处理来提高组合效率的新方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hypothesis combination for Slovak dictation speech recognition

Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings ELMAR-2014

自引率

0.00%

发文量