Objectivization of Phonological Evaluation of Speech Elements by Means of Audio Parametrization

Magdalena Piotrowska, G. Korvel, B. Kostek, A. Rojczyk, A. Czyżewski
{"title":"Objectivization of Phonological Evaluation of Speech Elements by Means of Audio Parametrization","authors":"Magdalena Piotrowska, G. Korvel, B. Kostek, A. Rojczyk, A. Czyżewski","doi":"10.1109/HSI.2018.8431352","DOIUrl":null,"url":null,"abstract":"This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal and pre-fortis clipping. A set of audio features based on mechanism of each phonological process was created. Recordings of phonetic material prepared by phonology expert were executed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert's assessment of produced speech sounds. Analyses presented in this paper allowed for discovering a set of parameters, which enable to determine whether the target processes were pronounced correctly.","PeriodicalId":441117,"journal":{"name":"2018 11th International Conference on Human System Interaction (HSI)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Conference on Human System Interaction (HSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HSI.2018.8431352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal and pre-fortis clipping. A set of audio features based on mechanism of each phonological process was created. Recordings of phonetic material prepared by phonology expert were executed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted in partitioning by editing two recorded sets of words into allophones, then signals were analyzed and subsequently audio excerpts were parametrized. The comparison of two sets of allophones was reinforced by the phonology expert's assessment of produced speech sounds. Analyses presented in this paper allowed for discovering a set of parameters, which enable to determine whether the target processes were pronounced correctly.
语音参数化对语音要素语音评价的客观化
本研究通过调查与音素产生相关的五种语音现象,解决了与机器和主观语音评估相关的两个问题。其目的是对所记录的音素进行客观的参数化和音系分类。这些音素被选为对波兰英语使用者来说特别困难的音素:送气音、末阻塞音、暗侧音/l/、软鼻音和前fortis削音。基于每个语音过程的机制,建立了一组音频特征。对语音专家准备的语音材料进行录音。首先,几个说话人朗读提词器上的单词时被录下来。然后,语音专家从之前录制的样本中播放每个单词,每个被测试的说话者都重复一个特定的单词,试图模仿正确的发音。下一步是通过将两组录制的单词编辑成音素进行分割,然后对信号进行分析,随后对音频摘录进行参数化。音位学专家对产生的语音的评估加强了两组音素的比较。本文中提出的分析允许发现一组参数,这些参数能够确定目标过程是否正确发音。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信