Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) Pub Date : 2015-12-17 DOI:10.1109/ICSDA.2015.7357855

Yuichi Sato, Yosuke Kashiwagi, N. Minematsu, D. Saito, K. Hirose

{"title":"Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint","authors":"Yuichi Sato, Yosuke Kashiwagi, N. Minematsu, D. Saito, K. Hirose","doi":"10.1109/ICSDA.2015.7357855","DOIUrl":null,"url":null,"abstract":"The term of “World Englishes” describes the current and real state of English and one of their main characteristics is a large diversity of pronunciation, called accents. We have developed two techniques of individual-based clustering of the diversity [1, 2] and educationally-effective visualization of the diversity [3]. Accent clustering requires a technique to quantify the accent gap between any speaker pair and visualization requires a technique of stress-free plotting of the speakers. In the above studies, however, we developed and assessed these two techniques independently and in this paper, we assess our technique of automatic accept gap prediction when it is used for our stress-free visualization. Further, since CALL applications today are not always used in a quiet environment, we introduce a feature enhancement (denoising) technique to improve noise-robustness of accent gap prediction. Results show that our accent gap prediction shows correlation of 0.77 to IPA-based manually-defined accent gaps and that, by applying feature enhancement to noisy input utterances, our technique can predict the accent gap that could be obtained in a clean condition, when the SNR is larger than 10 [dB].","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2015.7357855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The term of “World Englishes” describes the current and real state of English and one of their main characteristics is a large diversity of pronunciation, called accents. We have developed two techniques of individual-based clustering of the diversity [1, 2] and educationally-effective visualization of the diversity [3]. Accent clustering requires a technique to quantify the accent gap between any speaker pair and visualization requires a technique of stress-free plotting of the speakers. In the above studies, however, we developed and assessed these two techniques independently and in this paper, we assess our technique of automatic accept gap prediction when it is used for our stress-free visualization. Further, since CALL applications today are not always used in a quiet environment, we introduce a feature enhancement (denoising) technique to improve noise-robustness of accent gap prediction. Results show that our accent gap prediction shows correlation of 0.77 to IPA-based manually-defined accent gaps and that, by applying feature enhancement to noisy input utterances, our technique can predict the accent gap that could be obtained in a clean condition, when the SNR is larger than 10 [dB].

查看原文本刊更多论文

以学习者为中心的视角，无噪声、无压力地可视化世界英语的发音多样性

“世界英语”一词描述了当前和真实的英语状态，其主要特征之一是发音的多样性，称为口音。我们已经开发了两种基于个体的多样性聚类技术[1,2]和教育有效的多样性可视化技术[3]。口音聚类需要一种量化任何说话人对之间的口音差距的技术，而可视化需要一种对说话人进行无应力绘图的技术。然而，在上述研究中，我们独立开发和评估了这两种技术，在本文中，我们评估了我们的自动接受间隙预测技术，当它用于我们的无压力可视化时。此外，由于今天的CALL应用并不总是在安静的环境中使用，我们引入了一种特征增强(去噪)技术来提高口音间隙预测的噪声鲁棒性。结果表明，我们的重音间隙预测与基于ipa的手动定义重音间隙的相关性为0.77，并且通过对有噪声的输入话语进行特征增强，我们的技术可以预测在干净条件下，当信噪比大于10 [dB]时可以获得的重音间隙。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)

自引率

0.00%

发文量