基于声学元音分析的哥斯达黎加人年龄分类初探

IF 0.1 Q4 MULTIDISCIPLINARY SCIENCES
Victor Yeom-Song, Marvin Coto-Jiménez
{"title":"基于声学元音分析的哥斯达黎加人年龄分类初探","authors":"Victor Yeom-Song, Marvin Coto-Jiménez","doi":"10.18845/tm.v35i8.6466","DOIUrl":null,"url":null,"abstract":"According to several studies, children’s speech is more dynamic and inconsistent compared to an adult’s speech. This aspect can be considered in the task of recognizing the age of the person who speaks and of great importance in many applications, such as humancomputer interaction, security on Internet and education assistants. Those applications have a dependency on language and accent, due to the different sounds and styles that characterize the speakers. This paper presents the initial results on the identification of Costa Rican children’s speech, in a database created for this purpose, consisting of words pronounced by adults and children of several ages. For this first study we chose the most common vowel of the language, and extract a set of common acoustic features to determine its applicability in distinguishing between adults and children of an age range. The outcome results shows promising results in the classification using a single vowel, that improves according to the number of vowels used to extract the acoustic features. This means that an automatic system could be able to improve its capacity to identify age as more speech information is received and transcribed, but cannot be very accurate in short interactions.","PeriodicalId":42957,"journal":{"name":"Tecnologia en Marcha","volume":null,"pages":null},"PeriodicalIF":0.1000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A first study on age classification of costa rican speakers based on acoustic vowel analysis\",\"authors\":\"Victor Yeom-Song, Marvin Coto-Jiménez\",\"doi\":\"10.18845/tm.v35i8.6466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"According to several studies, children’s speech is more dynamic and inconsistent compared to an adult’s speech. This aspect can be considered in the task of recognizing the age of the person who speaks and of great importance in many applications, such as humancomputer interaction, security on Internet and education assistants. Those applications have a dependency on language and accent, due to the different sounds and styles that characterize the speakers. This paper presents the initial results on the identification of Costa Rican children’s speech, in a database created for this purpose, consisting of words pronounced by adults and children of several ages. For this first study we chose the most common vowel of the language, and extract a set of common acoustic features to determine its applicability in distinguishing between adults and children of an age range. The outcome results shows promising results in the classification using a single vowel, that improves according to the number of vowels used to extract the acoustic features. This means that an automatic system could be able to improve its capacity to identify age as more speech information is received and transcribed, but cannot be very accurate in short interactions.\",\"PeriodicalId\":42957,\"journal\":{\"name\":\"Tecnologia en Marcha\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tecnologia en Marcha\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18845/tm.v35i8.6466\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tecnologia en Marcha","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18845/tm.v35i8.6466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

根据几项研究,与成年人的语言相比,儿童的语言更有活力,也更不一致。这方面可以在识别说话人的年龄的任务中考虑,并且在许多应用中非常重要,例如人机交互,互联网安全和教育助理。这些应用依赖于语言和口音,因为说话者的声音和风格不同。本文介绍了在为此目的而建立的数据库中鉴定哥斯达黎加儿童语言的初步结果,该数据库由成人和不同年龄的儿童所发的单词组成。在第一项研究中,我们选择了语言中最常见的元音,并提取了一组常见的声学特征,以确定其在区分成人和儿童年龄范围中的适用性。结果表明,单元音分类的效果很好,根据提取声学特征的元音数量的不同,分类效果也有所改善。这意味着,随着接收和转录的语音信息越来越多,自动系统识别年龄的能力可能会提高,但在短时间的互动中就不太准确了。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A first study on age classification of costa rican speakers based on acoustic vowel analysis
According to several studies, children’s speech is more dynamic and inconsistent compared to an adult’s speech. This aspect can be considered in the task of recognizing the age of the person who speaks and of great importance in many applications, such as humancomputer interaction, security on Internet and education assistants. Those applications have a dependency on language and accent, due to the different sounds and styles that characterize the speakers. This paper presents the initial results on the identification of Costa Rican children’s speech, in a database created for this purpose, consisting of words pronounced by adults and children of several ages. For this first study we chose the most common vowel of the language, and extract a set of common acoustic features to determine its applicability in distinguishing between adults and children of an age range. The outcome results shows promising results in the classification using a single vowel, that improves according to the number of vowels used to extract the acoustic features. This means that an automatic system could be able to improve its capacity to identify age as more speech information is received and transcribed, but cannot be very accurate in short interactions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Tecnologia en Marcha
Tecnologia en Marcha MULTIDISCIPLINARY SCIENCES-
自引率
0.00%
发文量
93
审稿时长
28 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信