低资源下的小占用端到端KWS系统

Gui-Xin Shi, Weiqiang Zhang, Hao Wu, Yao Liu
{"title":"低资源下的小占用端到端KWS系统","authors":"Gui-Xin Shi, Weiqiang Zhang, Hao Wu, Yao Liu","doi":"10.1145/3372806.3372822","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an efficient end-to-end architecture, based on Connectionist Temporal Classification (CTC), for low-resource small-footprint keyword spotting (KWS) system. For a low-resource KWS system, it is difficult for the network to thoroughly learn the features of keywords. The intuition behind our new model is that a priori information of the keyword is available. In contrast to the conventional KWS system, we modify the label set by adding the preset keyword(s) to the original label set to enhance the learning performance and optimize the final detection task of the system. Besides, CTC is applied to address the sequential alignment problem. We employ GRU as the encoding layer in our system because of the dataset small. Experiments using the WSJ0 dataset show that the proposed KWS system is significantly more accurate than the baseline system. Compared to the character-level-only KWS system, the proposed system can obviously improve the performance. Furthermore, the improved system works well in terms of low resource conditions, especially for long words.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Small-Footprint End-to-End KWS System in Low Resources\",\"authors\":\"Gui-Xin Shi, Weiqiang Zhang, Hao Wu, Yao Liu\",\"doi\":\"10.1145/3372806.3372822\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an efficient end-to-end architecture, based on Connectionist Temporal Classification (CTC), for low-resource small-footprint keyword spotting (KWS) system. For a low-resource KWS system, it is difficult for the network to thoroughly learn the features of keywords. The intuition behind our new model is that a priori information of the keyword is available. In contrast to the conventional KWS system, we modify the label set by adding the preset keyword(s) to the original label set to enhance the learning performance and optimize the final detection task of the system. Besides, CTC is applied to address the sequential alignment problem. We employ GRU as the encoding layer in our system because of the dataset small. Experiments using the WSJ0 dataset show that the proposed KWS system is significantly more accurate than the baseline system. Compared to the character-level-only KWS system, the proposed system can obviously improve the performance. Furthermore, the improved system works well in terms of low resource conditions, especially for long words.\",\"PeriodicalId\":340004,\"journal\":{\"name\":\"International Conference on Signal Processing and Machine Learning\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Signal Processing and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3372806.3372822\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3372806.3372822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在本文中,我们提出了一种基于连接时间分类(CTC)的高效端到端架构,用于低资源、小占用的关键字识别(KWS)系统。对于低资源的KWS系统,网络很难完全学习到关键词的特征。我们新模型背后的直觉是,关键字的先验信息是可用的。与传统的KWS系统相比,我们对标签集进行了修改,将预设的关键字添加到原有的标签集中,从而提高了学习性能,优化了系统的最终检测任务。此外,将CTC应用于解决序列对齐问题。由于数据集较小,我们在系统中采用GRU作为编码层。使用WSJ0数据集的实验表明,所提出的KWS系统比基线系统的精度显著提高。与仅字符级的KWS系统相比,该系统可以明显提高性能。此外,改进后的系统在低资源条件下也能很好地工作,特别是对于长单词。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Small-Footprint End-to-End KWS System in Low Resources
In this paper, we propose an efficient end-to-end architecture, based on Connectionist Temporal Classification (CTC), for low-resource small-footprint keyword spotting (KWS) system. For a low-resource KWS system, it is difficult for the network to thoroughly learn the features of keywords. The intuition behind our new model is that a priori information of the keyword is available. In contrast to the conventional KWS system, we modify the label set by adding the preset keyword(s) to the original label set to enhance the learning performance and optimize the final detection task of the system. Besides, CTC is applied to address the sequential alignment problem. We employ GRU as the encoding layer in our system because of the dataset small. Experiments using the WSJ0 dataset show that the proposed KWS system is significantly more accurate than the baseline system. Compared to the character-level-only KWS system, the proposed system can obviously improve the performance. Furthermore, the improved system works well in terms of low resource conditions, especially for long words.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信