2013 IEEE Workshop on Automatic Speech Recognition and Understanding最新文献_第4页

Search results based N-best hypothesis rescoring with maximum entropy classification 基于n -最优假设评分的最大熵分类搜索结果

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707767

Fuchun Peng, Scott Roy, B. Shahshahani, F. Beaufays

引用次数: 20

The IBM keyword search system for the DARPA RATS program IBM关键字搜索系统为DARPA RATS计划

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707730

L. Mangu, H. Soltau, H. Kuo, G. Saon

引用次数: 4

Learning filter banks within a deep neural network framework 在深度神经网络框架内学习滤波器组

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707746

Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, B. Ramabhadran

引用次数: 170

Learning a subword vocabulary based on unigram likelihood 学习基于一元似然的子词词汇

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707697

Matti Varjokallio, M. Kurimo, Sami Virpioja

引用次数: 19

Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon 基于字素词典的声学单元发现和发音生成

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707760

William Hartmann, A. Roy, L. Lamel, J. Gauvain

引用次数: 19

The second ‘CHiME’ speech separation and recognition challenge: An overview of challenge systems and outcomes 第二个“CHiME”语音分离和识别挑战:挑战系统和结果概述

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707723

Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni

引用次数: 94

Language style and domain adaptation for cross-language SLU porting 跨语言SLU移植的语言风格和领域适应

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707720

Evgeny A. Stepanov, Ilya Kashkarev, Ali Orkan Bayer, G. Riccardi, Arindam Ghosh

引用次数: 16

A propagation approach to modelling the joint distributions of clean and corrupted speech in the Mel-Cepstral domain 一种在mel -倒谱域模拟干净和损坏语音联合分布的传播方法

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707726

Ramón Fernández Astudillo

引用次数: 2

Multi-stream temporally varying weight regression for cross-lingual speech recognition 跨语言语音识别的多流时变权回归

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707769

Shilin Liu, K. Sim

{"title":"Multi-stream temporally varying weight regression for cross-lingual speech recognition","authors":"Shilin Liu, K. Sim","doi":"10.1109/ASRU.2013.6707769","DOIUrl":"https://doi.org/10.1109/ASRU.2013.6707769","url":null,"abstract":"Building a good Automatic Speech Recognition (ASR) system with limited resources is a very challenging task due to the existing many speech variations. Multilingual and cross-lingual speech recognition techniques are commonly used for this task. This paper investigates the recently proposed Temporally Varying Weight Regression (TVWR) method for cross-lingual speech recognition. TVWR uses posterior features to implicitly model the long-term temporal structures in acoustic patterns. By leveraging on the well-trained foreign recognizers, high quality monophone/state posteriors can be easily incorporated into TVWR to boost the ASR performance on low-resource languages. Furthermore, multi-stream TVWR is proposed, where multiple sets of posterior features are used to incorporate richer (temporal and spatial) context information. Finally, a separate state-tying for the TVWR regression parameters is used to better utilize the more reliable posterior features. Experimental results are evaluated for English and Malay speech recognition with limited resources. By using the Czech, Hungarian and Russian posterior features, TVWR was found to consistently outperform the tandem systems trained on the same features.","PeriodicalId":265258,"journal":{"name":"2013 IEEE Workshop on Automatic Speech Recognition and Understanding","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133916696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Cross-lingual context sharing and parameter-tying for multi-lingual speech recognition 多语言语音识别的跨语言上下文共享和参数关联

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707717

Aanchan Mohan, R. Rose

引用次数: 2