2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)最新文献_第9页

Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition 两个扩展集成扬声器和说话环境建模鲁棒自动语音识别

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430087

Yu Tsao, Chin-Hui Lee

引用次数: 10

A fast-match approach for robust, faster than real-time speaker diarization 一种鲁棒的快速匹配方法，比实时扬声器拨号更快

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430196

Yan Huang, Oriol Vinyals, G. Friedland, Christian A. Müller, Nikki Mirghafori, Chuck Wooters

{"title":"A fast-match approach for robust, faster than real-time speaker diarization","authors":"Yan Huang, Oriol Vinyals, G. Friedland, Christian A. Müller, Nikki Mirghafori, Chuck Wooters","doi":"10.1109/ASRU.2007.4430196","DOIUrl":"https://doi.org/10.1109/ASRU.2007.4430196","url":null,"abstract":"During the past few years, speaker diarization has achieved satisfying accuracy in terms of speaker Diarization Error Rate (DER). The most successful approaches, based on agglomerative clustering, however, exhibit an inherent computational complexity which makes real-time processing, especially in combination with further processing steps, almost impossible. In this article we present a framework to speed up agglomerative clustering speaker diarization. The basic idea is to adopt a computationally cheap method to reduce the hypothesis space of the more expensive and accurate model selection via Bayesian Information Criterion (BIC). Two strategies based on the pitch-correlogram and the unscented-trans-form based approximation of KL-divergence are used independently as a fast-match approach to select the most likely clusters to merge. We performed the experiments using the existing ICSI speaker diarization system. The new system using KL-divergence fast-match strategy only performs 14% of total BIC comparisons needed in the baseline system, speeds up the system by 41% without affecting the speaker Diarization Error Rate (DER). The result is a robust and faster than real-time speaker diarization system.","PeriodicalId":371729,"journal":{"name":"2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)","volume":"26 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132352790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

Automatic lexical pronunciations generation and update 自动词汇发音生成和更新

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430113

Ghinwa F. Choueiter, S. Seneff, James R. Glass

引用次数: 5

Semantic translation error rate for evaluating translation systems 评价翻译系统的语义翻译错误率

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430144

Krishna Subramanian, D. Stallard, R. Prasad, S. Saleem, P. Natarajan

引用次数: 7

Towards robust automatic evaluation of pathologic telephone speech 病态电话语音的鲁棒自动评价

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430200

K. Riedhammer, G. Stemmer, T. Haderlein, M. Schuster, F. Rosanowski, E. Nöth, A. Maier

引用次数: 15

Phonological feature based variable frame rate scheme for improved speech recognition 基于语音特征的变帧率语音识别改进方案

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430177

A. Sangwan, J. Hansen

引用次数: 2

A language modeling approach to question answering on speech transcripts 语音答疑的语言建模方法

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430112

Matthias H. Heie, E. Whittaker, Josef R. Novak, S. Furui

引用次数: 2

Call classification for automated troubleshooting on large corpora 呼叫分类用于大型语料库的自动故障排除

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430110

Keelan Evanini, David Suendermann-Oeft, R. Pieraccini

引用次数: 19

Combining statistical models with symbolic grammar in parsing 将统计模型与符号语法相结合进行语法分析

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430140

Junichi Tsujii

引用次数: 0

Variational Kullback-Leibler divergence for Hidden Markov models 隐马尔可夫模型的变分Kullback-Leibler散度

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI: 10.1109/ASRU.2007.4430132

J. Hershey, P. Olsen, Steven J. Rennie

引用次数: 22