{"title":"Accelerated Nonparametric Bayesian Double Articulation Analyzer for Unsupervised Word Discovery","authors":"Ryo Ozaki, T. Taniguchi","doi":"10.1109/DEVLRN.2018.8761036","DOIUrl":null,"url":null,"abstract":"This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.","PeriodicalId":236346,"journal":{"name":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2018.8761036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper describes an accelerated nonparametric Bayesian double articulation analyzer (NPB-DAA) for enabling a developmental robot to acquire words and phonemes directly from speech signals without labeled data in more realistic scenario than conventional NPB-DAA. Word discovery and phoneme acquisition are known as important tasks in human child development. Human infants can discover words and phonemes from raw speech signals at eight months without any label data, unlike supervised learning-based speech recognition systems. NPB-DAA was proposed by Taniguchi et al. and shown to be able to perform simultaneous word and phoneme discovery without any label data. However, the computational cost of NPB-DAA was extremely large, and thus could not be applied to large-scale speech data. In this paper, we introduce lookup tables for conventional NPB-DAA to reduce the computational cost and developed an accelerated NPB-DAA. Using the lookup tables, values calculated in each subroutine are memorized and reused in the subsequent calculations. This acceleration does not harm the quality of word and phoneme discovery because the introduction of the lookup tables is theoretically supported. This paper also shows that our accelerated NPB-DAA significantly reduced the computational cost by 90% compared to conventional NPB-DAA.