Workshop on Spoken Language Technologies for Under-resourced Languages最新文献_第5页

Visually Grounded Cross-Lingual Keyword Spotting in Speech 基于视觉的跨语言关键字识别

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-53

H. Kamper, Michael Roth

引用次数: 3

Prosodic Analysis of Non-Native South Indian English Speech 非母语南印度英语语音的韵律分析

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-15

Radha Krishna Guntur, R. Krishnan, V. K. Mittal

引用次数: 4

Post-Processing Using Speech Enhancement Techniques for Unit Selection and Hidden Markov Model Based Low Resource Language Marathi Text-to-Speech System 基于单元选择的语音增强后处理和基于隐马尔可夫模型的低资源语言马拉地语文本到语音系统

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-20

Sangramsing Kayte, Monica R. Mundada

引用次数: 1

IIITH-ILSC Speech Database for Indain Language Identification iith - ilsc印度语言识别语音数据库

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-12

R. Vuddagiri, K. Gurugubelli, P. Jain, Hari Krishna Vydana, A. Vuppala

引用次数: 17

A Human Quality Text to Speech System for Sinhala 僧伽罗语人性化文本转语音系统

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-33

L. Nanayakkara, Chamila Liyanage, Pubudu Tharaka Viswakula, Thilini Nagungodage, Randil Pushpananda, R. Weerasinghe

引用次数: 7

Predicting the Features of World Atlas of Language Structures from Speech 从言语预测世界语言结构地图集的特征

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-52

Alexander Gutkin, Tatiana Merkulova, Martin Jansche

引用次数: 0

Low-resource Tibetan Dialect Acoustic Modeling Based on Transfer Learning 基于迁移学习的低资源藏语方言声学建模

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/SLTU.2018-2

Jinghao Yan, Zhiqiang Lv, Shen Huang, Hongzhi Yu

引用次数: 2

Incorporating Speaker Normalizing Capabilities to an End-to-End Speech Recognition System 结合说话人规范化能力到端到端语音识别系统

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-08-29 DOI: 10.21437/sltu.2018-36

Hari Krishna Vydana, Sivanand Achanta, A. Vuppala

{"title":"Incorporating Speaker Normalizing Capabilities to an End-to-End Speech Recognition System","authors":"Hari Krishna Vydana, Sivanand Achanta, A. Vuppala","doi":"10.21437/sltu.2018-36","DOIUrl":"https://doi.org/10.21437/sltu.2018-36","url":null,"abstract":"Speaker normalization is one of the crucial aspects of an Automatic speech recognition system (ASR). Speaker normalization is employed to reduce the performance drop in ASR due to speaker variabilities. Traditional speaker normalization methods are mostly linear transforms over the input data estimated per speaker, such transforms would be efﬁcient with sufﬁcient data. In practical scenarios, only a single utterance from the test speaker is accessible. The present study explores speaker normalization methods for end-to-end speech recognition systems that could efﬁciently be performed even when single utterance from the unseen speaker is available. In this work, it is hypothesized that by suitably providing information about the speaker’s identity while training an end-to-end neural network, the capability to normalize the speaker variability could be in-corporated into an ASR system. The efﬁciency of these normalization methods depends on the representation used for unseen speakers. In this work, the identity of the training speaker is represented in two different ways viz. i) by using a one-hot speaker code, ii) a weighted combination of all the training speakers identities. The unseen speakers from the test set are represented using a weighted combination of training speakers representations. Both the approaches have reduced the word error rate (WER) by 0.6, 1.3% WSJ corpus.","PeriodicalId":190269,"journal":{"name":"Workshop on Spoken Language Technologies for Under-resourced Languages","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126785093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A small Griko-Italian speech translation corpus 一个小的griko -意大利语语音翻译语料库

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-07-27 DOI: 10.21437/SLTU.2018-8

Marcely Zanon Boito, Antonios Anastasopoulos, M. Lekakou, A. Villavicencio, L. Besacier

引用次数: 12

Automatic Speech Recognition for Humanitarian Applications in Somali 索马里人道主义应用的自动语音识别

Workshop on Spoken Language Technologies for Under-resourced Languages Pub Date : 2018-07-23 DOI: 10.21437/SLTU.2018-5

Raghav Menon, A. Biswas, A. Saeb, John Quinn, T. Niesler

引用次数: 4