Dempster-Shafer Fusion Based Gender Recognition for Speech Analysis Applications

2016 International Conference on Platform Technology and Service (PlatCon) Pub Date : 2016-02-01 DOI:10.1109/PLATCON.2016.7456788

Jamil Ahmad, Khan Muhammad, Soon-il Kwon, S. Baik, Seungmin Rho

{"title":"Dempster-Shafer Fusion Based Gender Recognition for Speech Analysis Applications","authors":"Jamil Ahmad, Khan Muhammad, Soon-il Kwon, S. Baik, Seungmin Rho","doi":"10.1109/PLATCON.2016.7456788","DOIUrl":null,"url":null,"abstract":"Speech signals carry valuable information about the speaker including age, gender, and emotional state. Gender information can act as a vital preprocessing ingredient for enhancing speech analysis applications like adaptive human-machine interfaces, multi-modal security applications, and sophisticated intent and context analysis based forensic systems. In uncontrolled environments like telephone speech applications, the gender recognition system should be adaptive, accurate, and robust to noisy environments. This paper presents a reasoning method governed by Dempster-Shafer theory of evidence for automatic gender recognition from telephone speech. The proposed method uses mel-frequency cepstral coefficients with a support vector machine to generate the initial prediction results for individual speech segments. The reasoning scheme collects and validates results from support vector machine and treats convincing predictions as valid evidence. It is argued that the consideration of valid evidence in the reasoning process improves recognition performance by avoiding unconvincing classification results. Experiments conducted on large speech datasets reveal the superiority of the proposed gender recognition scheme for speech analysis applications.","PeriodicalId":247342,"journal":{"name":"2016 International Conference on Platform Technology and Service (PlatCon)","volume":"7 26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Platform Technology and Service (PlatCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PLATCON.2016.7456788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Speech signals carry valuable information about the speaker including age, gender, and emotional state. Gender information can act as a vital preprocessing ingredient for enhancing speech analysis applications like adaptive human-machine interfaces, multi-modal security applications, and sophisticated intent and context analysis based forensic systems. In uncontrolled environments like telephone speech applications, the gender recognition system should be adaptive, accurate, and robust to noisy environments. This paper presents a reasoning method governed by Dempster-Shafer theory of evidence for automatic gender recognition from telephone speech. The proposed method uses mel-frequency cepstral coefficients with a support vector machine to generate the initial prediction results for individual speech segments. The reasoning scheme collects and validates results from support vector machine and treats convincing predictions as valid evidence. It is argued that the consideration of valid evidence in the reasoning process improves recognition performance by avoiding unconvincing classification results. Experiments conducted on large speech datasets reveal the superiority of the proposed gender recognition scheme for speech analysis applications.

查看原文本刊更多论文

基于Dempster-Shafer融合的性别识别语音分析应用

语音信号包含说话人的年龄、性别和情绪状态等有价值的信息。性别信息可以作为重要的预处理成分，用于增强语音分析应用，如自适应人机界面、多模态安全应用以及基于复杂意图和上下文分析的法医系统。在电话语音应用等非受控环境中，性别识别系统应具有自适应性、准确性和对噪声环境的鲁棒性。本文提出了一种基于Dempster-Shafer证据理论的电话语音性别自动识别推理方法。该方法使用mel-frequency倒谱系数和支持向量机对单个语音片段生成初始预测结果。推理方案收集并验证来自支持向量机的结果，并将令人信服的预测作为有效证据。认为在推理过程中考虑有效证据可以避免不令人信服的分类结果，从而提高识别性能。在大型语音数据集上进行的实验显示了所提出的性别识别方案在语音分析应用中的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Platform Technology and Service (PlatCon)

自引率

0.00%

发文量