Objectivization of Phonological Evaluation of Speech Elements by Means of Audio Parametrization

2018 11th International Conference on Human System Interaction (HSI) Pub Date : 2018-07-01 DOI:10.1109/HSI.2018.8431352

Magdalena Piotrowska, G. Korvel, B. Kostek, A. Rojczyk, A. Czyżewski

{"title":"Objectivization of Phonological Evaluation of Speech Elements by Means of Audio Parametrization","authors":"Magdalena Piotrowska, G. Korvel, B. Kostek, A. Rojczyk, A. Czyżewski","doi":"10.1109/HSI.2018.8431352","DOIUrl":null,"url":null,"abstract":"This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal and pre-fortis clipping. A set of audio features based on mechanism of each phonological process was created. Recordings of phonetic material prepared by phonology expert were executed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert's assessment of produced speech sounds. Analyses presented in this paper allowed for discovering a set of parameters, which enable to determine whether the target processes were pronounced correctly.","PeriodicalId":441117,"journal":{"name":"2018 11th International Conference on Human System Interaction (HSI)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Conference on Human System Interaction (HSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HSI.2018.8431352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal and pre-fortis clipping. A set of audio features based on mechanism of each phonological process was created. Recordings of phonetic material prepared by phonology expert were executed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert's assessment of produced speech sounds. Analyses presented in this paper allowed for discovering a set of parameters, which enable to determine whether the target processes were pronounced correctly.

查看原文本刊更多论文

语音参数化对语音要素语音评价的客观化

本研究通过调查与音素产生相关的五种语音现象，解决了与机器和主观语音评估相关的两个问题。其目的是对所记录的音素进行客观的参数化和音系分类。这些音素被选为对波兰英语使用者来说特别困难的音素:送气音、末阻塞音、暗侧音/l/、软鼻音和前fortis削音。基于每个语音过程的机制，建立了一组音频特征。对语音专家准备的语音材料进行录音。首先，几个说话人朗读提词器上的单词时被录下来。然后，语音专家从之前录制的样本中播放每个单词，每个被测试的说话者都重复一个特定的单词，试图模仿正确的发音。下一步是通过将两组录制的单词编辑成音素进行分割，然后对信号进行分析，随后对音频摘录进行参数化。音位学专家对产生的语音的评估加强了两组音素的比较。本文中提出的分析允许发现一组参数，这些参数能够确定目标过程是否正确发音。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 11th International Conference on Human System Interaction (HSI)

自引率

0.00%

发文量