Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery

2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) Pub Date : 2018-09-01 DOI:10.1109/DEVLRN.2018.8761036

Ryo Ozaki, T. Taniguchi

{"title":"Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery","authors":"Ryo Ozaki, T. Taniguchi","doi":"10.1109/DEVLRN.2018.8761036","DOIUrl":null,"url":null,"abstract":"This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.","PeriodicalId":236346,"journal":{"name":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2018.8761036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.

查看原文本刊更多论文

用于无监督词发现的加速非参数贝叶斯双发音分析器

本文介绍了一种加速的非参数贝叶斯双发音分析仪(NPB-DAA)，它使发展中的机器人能够在更现实的场景中直接从语音信号中获取单词和音素，而不是传统的NPB-DAA。单词发现和音素习得是人类儿童发展的重要任务。人类婴儿在8个月大的时候可以在没有任何标签数据的情况下从原始语音信号中发现单词和音素，这与基于监督学习的语音识别系统不同。NPB-DAA由Taniguchi等人提出，并被证明能够在没有任何标签数据的情况下同时进行单词和音素发现。但是，NPB-DAA的计算成本非常大，无法应用于大规模的语音数据。为了降低计算成本，我们在传统的NPB-DAA中引入了查找表，并开发了一种加速的NPB-DAA。使用查找表，在每个子例程中计算的值被记住，并在随后的计算中重用。这种加速不会损害单词和音素发现的质量，因为从理论上支持查找表的引入。本文还表明，与传统的NPB-DAA相比，我们的加速NPB-DAA的计算成本显著降低了90%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

自引率

0.00%

发文量