Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery

Ryo Ozaki, T. Taniguchi
{"title":"Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery","authors":"Ryo Ozaki, T. Taniguchi","doi":"10.1109/DEVLRN.2018.8761036","DOIUrl":null,"url":null,"abstract":"This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.","PeriodicalId":236346,"journal":{"name":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2018.8761036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.
用于无监督词发现的加速非参数贝叶斯双发音分析器
本文介绍了一种加速的非参数贝叶斯双发音分析仪(NPB-DAA),它使发展中的机器人能够在更现实的场景中直接从语音信号中获取单词和音素,而不是传统的NPB-DAA。单词发现和音素习得是人类儿童发展的重要任务。人类婴儿在8个月大的时候可以在没有任何标签数据的情况下从原始语音信号中发现单词和音素,这与基于监督学习的语音识别系统不同。NPB-DAA由Taniguchi等人提出,并被证明能够在没有任何标签数据的情况下同时进行单词和音素发现。但是,NPB-DAA的计算成本非常大,无法应用于大规模的语音数据。为了降低计算成本,我们在传统的NPB-DAA中引入了查找表,并开发了一种加速的NPB-DAA。使用查找表,在每个子例程中计算的值被记住,并在随后的计算中重用。这种加速不会损害单词和音素发现的质量,因为从理论上支持查找表的引入。本文还表明,与传统的NPB-DAA相比,我们的加速NPB-DAA的计算成本显著降低了90%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信