{"title":"基于傅立叶谱、FFNN和ANFIS技术的语音轮廓识别","authors":"I. Balabanova, G. Georgiev","doi":"10.1109/TELECOM53156.2021.9659793","DOIUrl":null,"url":null,"abstract":"This paper presents a combined approach for recognition of speech profiles based on FFT windowing, Feed-Forward Neural Networks (FFNN) and Adaptive Neuro-Fuzzy Interface Systems (ANFIS). By using spectral analysis of the speech of physical entities, there have been carried out feature extraction during application of Hamming, 4 Term B-Harris, Flat Top and Hanning windows. Individual informative sets (data sets) have been specified for the employed mathematical recognition apparatuses. A FFNN model has been synthesized during implementation of Scaled Conjugate Gradient (SCG) training for the purpose of speech profiles recognition with attained accuracy of 93.50 %. There has been selected neuro-fuzzy classifier in accordance with Hybrid learning algorithm and Pi shaped membership function of input variables. During testing of selected ANFIS model there has been established a 100.00 % accuracy in speech profiles recognition.","PeriodicalId":293631,"journal":{"name":"2021 29th National Conference with International Participation (TELECOM)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech Profile Recognition by Fourier Spectral, FFNN and ANFIS Techniques\",\"authors\":\"I. Balabanova, G. Georgiev\",\"doi\":\"10.1109/TELECOM53156.2021.9659793\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a combined approach for recognition of speech profiles based on FFT windowing, Feed-Forward Neural Networks (FFNN) and Adaptive Neuro-Fuzzy Interface Systems (ANFIS). By using spectral analysis of the speech of physical entities, there have been carried out feature extraction during application of Hamming, 4 Term B-Harris, Flat Top and Hanning windows. Individual informative sets (data sets) have been specified for the employed mathematical recognition apparatuses. A FFNN model has been synthesized during implementation of Scaled Conjugate Gradient (SCG) training for the purpose of speech profiles recognition with attained accuracy of 93.50 %. There has been selected neuro-fuzzy classifier in accordance with Hybrid learning algorithm and Pi shaped membership function of input variables. During testing of selected ANFIS model there has been established a 100.00 % accuracy in speech profiles recognition.\",\"PeriodicalId\":293631,\"journal\":{\"name\":\"2021 29th National Conference with International Participation (TELECOM)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 29th National Conference with International Participation (TELECOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TELECOM53156.2021.9659793\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 29th National Conference with International Participation (TELECOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TELECOM53156.2021.9659793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种基于FFT加窗、前馈神经网络(FFNN)和自适应神经模糊接口系统(ANFIS)的语音特征识别方法。通过对物理实体的语音进行频谱分析,分别对Hamming、4 Term B-Harris、Flat Top和Hanning窗进行了特征提取。已为所使用的数学识别设备指定了单独的信息集(数据集)。在进行缩放共轭梯度(SCG)训练的过程中,合成了一个FFNN模型,用于语音轮廓识别,准确率达到93.50%。根据混合学习算法和输入变量的Pi形隶属函数选择神经模糊分类器。在选定的ANFIS模型的测试中,已经建立了100.00%的语音轮廓识别准确率。
Speech Profile Recognition by Fourier Spectral, FFNN and ANFIS Techniques
This paper presents a combined approach for recognition of speech profiles based on FFT windowing, Feed-Forward Neural Networks (FFNN) and Adaptive Neuro-Fuzzy Interface Systems (ANFIS). By using spectral analysis of the speech of physical entities, there have been carried out feature extraction during application of Hamming, 4 Term B-Harris, Flat Top and Hanning windows. Individual informative sets (data sets) have been specified for the employed mathematical recognition apparatuses. A FFNN model has been synthesized during implementation of Scaled Conjugate Gradient (SCG) training for the purpose of speech profiles recognition with attained accuracy of 93.50 %. There has been selected neuro-fuzzy classifier in accordance with Hybrid learning algorithm and Pi shaped membership function of input variables. During testing of selected ANFIS model there has been established a 100.00 % accuracy in speech profiles recognition.