{"title":"斯洛伐克语听写语音识别的假设组合","authors":"M. Lojka, J. Juhár","doi":"10.1109/ELMAR.2014.6923311","DOIUrl":null,"url":null,"abstract":"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.","PeriodicalId":424325,"journal":{"name":"Proceedings ELMAR-2014","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Hypothesis combination for Slovak dictation speech recognition\",\"authors\":\"M. Lojka, J. Juhár\",\"doi\":\"10.1109/ELMAR.2014.6923311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.\",\"PeriodicalId\":424325,\"journal\":{\"name\":\"Proceedings ELMAR-2014\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings ELMAR-2014\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR.2014.6923311\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings ELMAR-2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR.2014.6923311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hypothesis combination for Slovak dictation speech recognition
Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.