Age estimation based on speech features and support vector machine

D. Mahmoodi, H. Marvi, M. Taghizadeh, Ali Gholipour Soleimani, F. Razzazi, M. Mahmoodi
{"title":"Age estimation based on speech features and support vector machine","authors":"D. Mahmoodi, H. Marvi, M. Taghizadeh, Ali Gholipour Soleimani, F. Razzazi, M. Mahmoodi","doi":"10.1109/CEEC.2011.5995826","DOIUrl":null,"url":null,"abstract":"Age estimation based on human's speech features is an interesting subject in Automatic Speech Recognition (ASR) systems. There are some works in literature on speaker age estimation but it needs more new works especially for Persian speakers. In age estimation, like other speech processing systems, we encounter with two main challenges: finding an appropriate procedure for feature extraction, and selecting a reliable method for pattern classification. In this paper we propose an automatic age estimation system for classification of 6 age groups of various Persian speaker people. Perceptual Linear Predictive (PLP) and Mel-Frequency Cepstral Coefficients (MFCC) are extracted as speech features and SVM is utilized for classification procedure. Furthermore the effects of variations in parameter of kernel function, time of frame length in sampling process, the number of MFCC coefficients, and the order of PLP on system efficiency has been evaluated, and the results has been compared.","PeriodicalId":409910,"journal":{"name":"2011 3rd Computer Science and Electronic Engineering Conference (CEEC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 3rd Computer Science and Electronic Engineering Conference (CEEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEEC.2011.5995826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 29

Abstract

Age estimation based on human's speech features is an interesting subject in Automatic Speech Recognition (ASR) systems. There are some works in literature on speaker age estimation but it needs more new works especially for Persian speakers. In age estimation, like other speech processing systems, we encounter with two main challenges: finding an appropriate procedure for feature extraction, and selecting a reliable method for pattern classification. In this paper we propose an automatic age estimation system for classification of 6 age groups of various Persian speaker people. Perceptual Linear Predictive (PLP) and Mel-Frequency Cepstral Coefficients (MFCC) are extracted as speech features and SVM is utilized for classification procedure. Furthermore the effects of variations in parameter of kernel function, time of frame length in sampling process, the number of MFCC coefficients, and the order of PLP on system efficiency has been evaluated, and the results has been compared.
基于语音特征和支持向量机的年龄估计
基于人的语音特征的年龄估计是自动语音识别(ASR)系统中一个有趣的研究课题。文献中对说话人年龄的估计已有一定的研究,但对波斯语说话人年龄的估计还需要更多的研究。在年龄估计中,像其他语音处理系统一样,我们遇到了两个主要的挑战:找到合适的特征提取过程,以及选择可靠的模式分类方法。本文提出了一种自动年龄估计系统,用于对不同波斯语人群的6个年龄组进行分类。提取感知线性预测(PLP)和mel -频率倒谱系数(MFCC)作为语音特征,利用支持向量机进行分类。分析了核函数参数、采样过程中帧长时间、MFCC系数个数和PLP阶数的变化对系统效率的影响,并对结果进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信