Swarna Kuchibhotla, Siva Sahitya Dogga, N. V. S. L. G. Vinay Thota, Gopi Puli, Niranjan M S R, H. D. Vankayalapati
{"title":"基于MFCC的递归神经网络语音情绪抑郁检测","authors":"Swarna Kuchibhotla, Siva Sahitya Dogga, N. V. S. L. G. Vinay Thota, Gopi Puli, Niranjan M S R, H. D. Vankayalapati","doi":"10.1109/ViTECoN58111.2023.10157779","DOIUrl":null,"url":null,"abstract":"Depression is a mental health disorder that affects millions of people worldwide. While there are many effective treatments for depression, the first step is often detecting the condition. In recent years, researchers have explored the use of machine learning algorithms to detect depression in speech patterns. Recurrent neural networks (RNNs) are a popular type of deep learning algorithm that can be used for this task. In this study, depression detection in speech using RNNs is used. The proposed system uses Mel-frequency cepstral coefficients (MFCCs) as input features to an RNN model. The RNN model is trained on an emotion dataset of speech recordings from individuals. The model is then used to classify new speech recordings based on stress/depression in each emotion. The experimental results show that RNNs are prominent for depression detection in speech. Future work could focus on improving the accuracy of the system by incorporating additional features in RNN architecture.","PeriodicalId":407488,"journal":{"name":"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Depression Detection from Speech Emotions using MFCC based Recurrent Neural Network\",\"authors\":\"Swarna Kuchibhotla, Siva Sahitya Dogga, N. V. S. L. G. Vinay Thota, Gopi Puli, Niranjan M S R, H. D. Vankayalapati\",\"doi\":\"10.1109/ViTECoN58111.2023.10157779\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Depression is a mental health disorder that affects millions of people worldwide. While there are many effective treatments for depression, the first step is often detecting the condition. In recent years, researchers have explored the use of machine learning algorithms to detect depression in speech patterns. Recurrent neural networks (RNNs) are a popular type of deep learning algorithm that can be used for this task. In this study, depression detection in speech using RNNs is used. The proposed system uses Mel-frequency cepstral coefficients (MFCCs) as input features to an RNN model. The RNN model is trained on an emotion dataset of speech recordings from individuals. The model is then used to classify new speech recordings based on stress/depression in each emotion. The experimental results show that RNNs are prominent for depression detection in speech. Future work could focus on improving the accuracy of the system by incorporating additional features in RNN architecture.\",\"PeriodicalId\":407488,\"journal\":{\"name\":\"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ViTECoN58111.2023.10157779\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ViTECoN58111.2023.10157779","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Depression Detection from Speech Emotions using MFCC based Recurrent Neural Network
Depression is a mental health disorder that affects millions of people worldwide. While there are many effective treatments for depression, the first step is often detecting the condition. In recent years, researchers have explored the use of machine learning algorithms to detect depression in speech patterns. Recurrent neural networks (RNNs) are a popular type of deep learning algorithm that can be used for this task. In this study, depression detection in speech using RNNs is used. The proposed system uses Mel-frequency cepstral coefficients (MFCCs) as input features to an RNN model. The RNN model is trained on an emotion dataset of speech recordings from individuals. The model is then used to classify new speech recordings based on stress/depression in each emotion. The experimental results show that RNNs are prominent for depression detection in speech. Future work could focus on improving the accuracy of the system by incorporating additional features in RNN architecture.