Speaker Identification System Using CNN Approach

2021 International Conference on Industrial Electronics Research and Applications (ICIERA) Pub Date : 2021-12-22 DOI:10.1109/ICIERA53202.2021.9726767

Neelam Nehra, P. Sangwan, Divya Kumar

引用次数: 1

Abstract

In this paper, a text independent Speaker Identification (SI) system is proposed using convolutional neural network (CNN). Also, the proposed methodology is tested in a noisy environment. The text independent SI system is used in this work due to its ability to learn features for classification task. Spectrogram images are used at front end for feature extraction. For classification, convolutional neural network is utilized indicating promising results. Dataset used in this work comprise of 5 speakers, with each speaker uttering 4 voice samples. The overall accuracy achieved for the proposed approach is 96.54 %.

查看原文本刊更多论文

基于CNN方法的说话人识别系统

本文提出了一种基于卷积神经网络(CNN)的独立于文本的说话人识别系统。此外，本文还对该方法进行了噪声环境下的测试。由于文本独立的SI系统具有学习分类任务特征的能力，因此在本工作中使用了该系统。在前端使用谱图图像进行特征提取。在分类方面，采用了卷积神经网络，效果良好。本研究使用的数据集由5个说话人组成，每个说话人发出4个语音样本。该方法的总体准确率为96.54%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Industrial Electronics Research and Applications (ICIERA)

自引率

0.00%

发文量