低资源口语理解的迁移学习

2019 IEEE Bombay Section Signature Conference (IBSSC) Pub Date : 2019-07-01 DOI:10.1109/IBSSC47189.2019.8973067

Swapnil Bhosale, I. Sheikh, Sri Harsha Dumpala, S. Kopparapu

{"title":"低资源口语理解的迁移学习","authors":"Swapnil Bhosale, I. Sheikh, Sri Harsha Dumpala, S. Kopparapu","doi":"10.1109/IBSSC47189.2019.8973067","DOIUrl":null,"url":null,"abstract":"Spoken Language Understanding (SLU) without speech-to-text conversion is more promising in low resource scenarios. These could be applications where there is not enough labeled data to train reliable speech recognition and language understanding systems, or where running SLU on edge is preferred over cloud based services. In this paper, we present an approach for building SLU without speech-to-text conversion in low resource scenarios using a transfer learning approach. We show that the intermediate layer representations from a pre-trained model outperforms the typically used Mel filter bank features. Moreover, the representations extracted from a model pre-trained on one language perform well even for SLU tasks on a different language.","PeriodicalId":148941,"journal":{"name":"2019 IEEE Bombay Section Signature Conference (IBSSC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transfer Learning for Low Resource Spoken Language Understanding without Speech-to-Text\",\"authors\":\"Swapnil Bhosale, I. Sheikh, Sri Harsha Dumpala, S. Kopparapu\",\"doi\":\"10.1109/IBSSC47189.2019.8973067\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spoken Language Understanding (SLU) without speech-to-text conversion is more promising in low resource scenarios. These could be applications where there is not enough labeled data to train reliable speech recognition and language understanding systems, or where running SLU on edge is preferred over cloud based services. In this paper, we present an approach for building SLU without speech-to-text conversion in low resource scenarios using a transfer learning approach. We show that the intermediate layer representations from a pre-trained model outperforms the typically used Mel filter bank features. Moreover, the representations extracted from a model pre-trained on one language perform well even for SLU tasks on a different language.\",\"PeriodicalId\":148941,\"journal\":{\"name\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC47189.2019.8973067\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC47189.2019.8973067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

没有语音到文本转换的口语理解(SLU)在低资源场景下更有前景。这些应用程序可能没有足够的标记数据来训练可靠的语音识别和语言理解系统，或者在边缘运行SLU比基于云的服务更受欢迎。在本文中，我们提出了一种在低资源场景下使用迁移学习方法构建无语音到文本转换的SLU的方法。我们表明，来自预训练模型的中间层表示优于通常使用的Mel滤波器组特征。此外，从一种语言预训练的模型中提取的表示即使在不同语言的SLU任务中也表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Transfer Learning for Low Resource Spoken Language Understanding without Speech-to-Text

Spoken Language Understanding (SLU) without speech-to-text conversion is more promising in low resource scenarios. These could be applications where there is not enough labeled data to train reliable speech recognition and language understanding systems, or where running SLU on edge is preferred over cloud based services. In this paper, we present an approach for building SLU without speech-to-text conversion in low resource scenarios using a transfer learning approach. We show that the intermediate layer representations from a pre-trained model outperforms the typically used Mel filter bank features. Moreover, the representations extracted from a model pre-trained on one language perform well even for SLU tasks on a different language.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE Bombay Section Signature Conference (IBSSC)

自引率

0.00%

发文量