2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)最新文献_第6页

Soundbite identification using reference and automatic transcripts of broadcast news speech 使用参考和自动抄本的广播新闻讲话片段识别

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430189

F. Liu, Yang Liu

引用次数: 8

Robust speech recognition with on-line unsupervised acoustic feature compensation 基于在线无监督声学特征补偿的鲁棒语音识别

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430092

L. Buera, A. Miguel, EDUARDO LLEIDA SOLANO, Oscar Saz-Torralba, A. Ortega

{"title":"Robust speech recognition with on-line unsupervised acoustic feature compensation","authors":"L. Buera, A. Miguel, EDUARDO LLEIDA SOLANO, Oscar Saz-Torralba, A. Ortega","doi":"10.1109/ASRU.2007.4430092","DOIUrl":"https://doi.org/10.1109/ASRU.2007.4430092","url":null,"abstract":"An on-line unsupervised hybrid compensation technique is proposed to reduce the mismatch between training and testing conditions. It combines multi-environment model based linear normalization with cross-probability model based on GMMs (MEMLIN CPM) with a novel acoustic model adaptation method based on rotation transformations. Hence, a set of rotation transformations is estimated with clean and MEMLIN CPM-normalized training data by linear regression in an unsupervised process. Thus, in testing, each MEMLIN CPM normalized frame is decoded using a modified Viterbi algorithm and expanded acoustic models, which are obtained from the reference ones and the set of rotation transformations. To test the proposed solution, some experiments with Spanish SpeechDat Car database were carried out. MEMLIN CPM over standard ETSI front-end parameters reaches 83.89% of average improvement in WER, while the introduced hybrid solution goes up to 92.07%. Also, the proposed hybrid technique was tested with Aurora 2 database, obtaining an average improvement of 68.88% with clean training.","PeriodicalId":371729,"journal":{"name":"2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133609156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

A multi-layer architecture for semi-synchronous event-driven dialogue management 用于半同步事件驱动对话管理的多层体系结构

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430165

Antoine Raux, M. Eskénazi

引用次数: 49

Implicit user-adaptive system engagement in speech, pen and multimodal interfaces 语音、笔和多模态界面中隐含的用户自适应系统参与

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430162

S. Oviatt

引用次数: 2

Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing 基于词和子词单位的位置后验格与混淆网络在口语文档索引中的分析比较

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430193

Yi-Cheng Pan, Hung-lin Chang, Lin-Shan Lee

引用次数: 26

Type-II dialogue systems for information access from unstructured knowledge sources 用于从非结构化知识来源获取信息的第二类对话系统

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430170

Yi-Cheng Pan, Lin-Shan Lee

引用次数: 8

Efficient combination of parametric spaces, models and metrics for speaker diarization1 参数空间、模型和度量的有效结合

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430120

Themos Stafylakis, V. Katsouros, G. Carayannis

引用次数: 0

A Mandarin lecture speech transcription system for speech summarization 基于语音摘要的普通话讲座语音转录系统

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430157

R. Chan, J. Zhang, Pascale Fung, Lu Cao

引用次数: 5

The GALE project: A description and an update GALE项目:描述和更新

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430115

Jordan Cohen

引用次数: 18

Spoken document summarization using relevant information 使用相关信息对口头文件进行总结

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430107

Yi-Ting Chen, Shih-Hsiang Lin, H. Wang, Berlin Chen

引用次数: 3