基于机器学习的语音情感识别研究进展

International Journal of Innovative Research in Computer Science and Technology Pub Date : 2022-05-29 DOI:10.55524/ijircst.2022.10.3.65

Sk. Mohammed Jubear, D. P. K. Reddy, G. Subramanyam, Sk. Farooq, T. Sreenivasulu, N. S. Rao

{"title":"基于机器学习的语音情感识别研究进展","authors":"Sk. Mohammed Jubear, D. P. K. Reddy, G. Subramanyam, Sk. Farooq, T. Sreenivasulu, N. S. Rao","doi":"10.55524/ijircst.2022.10.3.65","DOIUrl":null,"url":null,"abstract":"This paper focuses on the development of a robust speech emotion recognition system using a combination of different speech features with feature optimization techniques and speech de-noising technique to acquire improved emotion classification accuracy, decreasing the system complexity and obtain noise robustness. Additionally, we create original methods for SER to merge features. We employ feature optimization methods that are based on the feature transformation and feature selection machine learning techniques in order to build SER. The following is a list of the upcoming events. A neural network can use either of these two techniques. As more feelings are taken into account, the feature fusion-acquired SER accuracy falls short of expectations, and the plague of dimensionality starts to spread due to the addition of speech features, which makes the SER system work harder to complete its task. This is due to the SER system becoming more complicated when voice elements are added. Therefore, it is crucial to create a SER system that is more trustworthy, has the most practical features, and uses the least amount of computing power possible. By using strategies that maximize current features, it is possible to streamline the feature selection process by reducing the total number of accessible choices to a more reasonable level. This piece employs a method known as Semi-Non Negative Matrix Factorization to lessen the amount of processing trash that the SER system generates. (Semi-NMF). This approach can be used to change traits that are capable of learning on their own.","PeriodicalId":218345,"journal":{"name":"International Journal of Innovative Research in Computer Science and Technology","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Review on Speech Emotion Recognition Using Machine Learning\",\"authors\":\"Sk. Mohammed Jubear, D. P. K. Reddy, G. Subramanyam, Sk. Farooq, T. Sreenivasulu, N. S. Rao\",\"doi\":\"10.55524/ijircst.2022.10.3.65\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on the development of a robust speech emotion recognition system using a combination of different speech features with feature optimization techniques and speech de-noising technique to acquire improved emotion classification accuracy, decreasing the system complexity and obtain noise robustness. Additionally, we create original methods for SER to merge features. We employ feature optimization methods that are based on the feature transformation and feature selection machine learning techniques in order to build SER. The following is a list of the upcoming events. A neural network can use either of these two techniques. As more feelings are taken into account, the feature fusion-acquired SER accuracy falls short of expectations, and the plague of dimensionality starts to spread due to the addition of speech features, which makes the SER system work harder to complete its task. This is due to the SER system becoming more complicated when voice elements are added. Therefore, it is crucial to create a SER system that is more trustworthy, has the most practical features, and uses the least amount of computing power possible. By using strategies that maximize current features, it is possible to streamline the feature selection process by reducing the total number of accessible choices to a more reasonable level. This piece employs a method known as Semi-Non Negative Matrix Factorization to lessen the amount of processing trash that the SER system generates. (Semi-NMF). This approach can be used to change traits that are capable of learning on their own.\",\"PeriodicalId\":218345,\"journal\":{\"name\":\"International Journal of Innovative Research in Computer Science and Technology\",\"volume\":\"104 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Innovative Research in Computer Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55524/ijircst.2022.10.3.65\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Innovative Research in Computer Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55524/ijircst.2022.10.3.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文重点研究了将不同语音特征结合特征优化技术和语音去噪技术开发鲁棒性语音情感识别系统，以获得更高的情感分类精度，降低系统复杂度，并获得噪声鲁棒性。此外，我们还为SER创建了合并特性的原始方法。我们采用基于特征转换和特征选择机器学习技术的特征优化方法来构建SER。以下是即将举行的活动列表。神经网络可以使用这两种技术中的任何一种。由于考虑了更多的感受，特征融合获得的SER精度达不到预期，并且由于语音特征的增加，维度的瘟疫开始蔓延，这使得SER系统更难完成任务。这是因为添加语音元素后，SER系统变得更加复杂。因此，创建一个更值得信赖、具有最实用的功能并使用尽可能少的计算能力的SER系统至关重要。通过使用最大化当前特征的策略，可以通过将可访问选项的总数减少到更合理的水平来简化特征选择过程。本文采用一种称为半非负矩阵分解的方法来减少SER系统生成的处理垃圾的数量。(Semi-NMF)。这种方法可以用来改变能够自主学习的特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Review on Speech Emotion Recognition Using Machine Learning

This paper focuses on the development of a robust speech emotion recognition system using a combination of different speech features with feature optimization techniques and speech de-noising technique to acquire improved emotion classification accuracy, decreasing the system complexity and obtain noise robustness. Additionally, we create original methods for SER to merge features. We employ feature optimization methods that are based on the feature transformation and feature selection machine learning techniques in order to build SER. The following is a list of the upcoming events. A neural network can use either of these two techniques. As more feelings are taken into account, the feature fusion-acquired SER accuracy falls short of expectations, and the plague of dimensionality starts to spread due to the addition of speech features, which makes the SER system work harder to complete its task. This is due to the SER system becoming more complicated when voice elements are added. Therefore, it is crucial to create a SER system that is more trustworthy, has the most practical features, and uses the least amount of computing power possible. By using strategies that maximize current features, it is possible to streamline the feature selection process by reducing the total number of accessible choices to a more reasonable level. This piece employs a method known as Semi-Non Negative Matrix Factorization to lessen the amount of processing trash that the SER system generates. (Semi-NMF). This approach can be used to change traits that are capable of learning on their own.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Innovative Research in Computer Science and Technology

自引率

0.00%

发文量