2016 IEEE Spoken Language Technology Workshop (SLT)最新文献_第10页

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition MGB-2的挑战:阿拉伯语多方言广播媒体识别

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-09-19 DOI: 10.1109/SLT.2016.7846277

Ahmed M. Ali, P. Bell, James R. Glass, Yacine Messaoui, Hamdy Mubarak, S. Renals, Yifan Zhang

引用次数: 93

Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification 基于长短期记忆的递归神经网络语音增强噪声鲁棒说话人验证

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-09-16 DOI: 10.1109/SLT.2016.7846281

Morten Kolbæk, Z. Tan, J. Jensen

引用次数: 53

Approaches for language identification in mismatched environments 不匹配环境下的语言识别方法

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-09-08 DOI: 10.1109/SLT.2016.7846286

S. Nercessian, P. Torres-Carrasquillo, Gabriel Martinez-Montes

引用次数: 11

Hierarchical attention model for improved machine comprehension of spoken content 提高机器对口语内容理解的层次注意模型

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-08-28 DOI: 10.1109/SLT.2016.7846270

Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan Lee

{"title":"Hierarchical attention model for improved machine comprehension of spoken content","authors":"Wei Fang, Juei-Yang Hsu, Hung-yi Lee, Lin-Shan Lee","doi":"10.1109/SLT.2016.7846270","DOIUrl":"https://doi.org/10.1109/SLT.2016.7846270","url":null,"abstract":"Multimedia or spoken content presents more attractive information than plain text content, but the former is more difficult to display on a screen and be selected by a user. As a result, accessing large collections of the former is much more difficult and time-consuming than the latter for humans. It's therefore highly attractive to develop machines which can automatically understand spoken content and summarize the key information for humans to browse over. In this endeavor, a new task of machine comprehension of spoken content was proposed recently. The initial goal was defined as the listening comprehension test of TOEFL, a challenging academic English examination for English learners whose native languages are not English. An Attention-based Multi-hop Recurrent Neural Network (AMRNN) architecture was also proposed for this task, which considered only the sequential relationship within the speech utterances. In this paper, we propose a new Hierarchical Attention Model (HAM), which constructs multi-hopped attention mechanism over tree-structured rather than sequential representations for the utterances. Improved comprehension performance robust with respect to ASR errors were obtained.","PeriodicalId":281635,"journal":{"name":"2016 IEEE Spoken Language Technology Workshop (SLT)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130234116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Median-based generation of synthetic speech durations using a non-parametric approach 使用非参数方法合成语音持续时间的基于中位数的生成

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-08-22 DOI: 10.1109/SLT.2016.7846337

S. Ronanki, O. Watts, Simon King, G. Henter

引用次数: 16

Multi-lingual deep neural networks for language recognition 用于语言识别的多语言深度神经网络

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-08-08 DOI: 10.1109/SLT.2016.7846285

Luis Murphy Marcos, F. Richardson

引用次数: 4

Sequence training and adaptation of highway deep neural networks 公路深度神经网络的序列训练与自适应

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-07-07 DOI: 10.1109/SLT.2016.7846304

Liang Lu

引用次数: 6

DialPort: Connecting the spoken dialog research community to real user data DialPort:将口语对话研究社区与真实用户数据连接起来

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2016-06-08 DOI: 10.1109/SLT.2016.7846249

Tiancheng Zhao, Kyusong Lee, M. Eskénazi

引用次数: 20

Deep neural network driven mixture of PLDA for robust i-vector speaker verification 基于深度神经网络的混合PLDA鲁棒i向量说话人验证

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 1900-01-01 DOI: 10.1109/SLT.2016.7846263

N. Li, M. Mak, Jen-Tzung Chien

引用次数: 9

The fifth dialog state tracking challenge 第五个对话框状态跟踪挑战

2016 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 1900-01-01 DOI: 10.1109/SLT.2016.7846311

Seokhwan Kim, L. F. D’Haro, Rafael E. Banchs, J. Williams, Matthew Henderson, Koichiro Yoshino

引用次数: 69