N. Shanthi, Albert Alexander Stonier, Anli Sherine, T. Devaraju, S. Abinash, R. Ajay, V. Arul Prasath, Vivekananda Ganji
{"title":"An integrated approach for mental health assessment using emotion analysis and scales","authors":"N. Shanthi, Albert Alexander Stonier, Anli Sherine, T. Devaraju, S. Abinash, R. Ajay, V. Arul Prasath, Vivekananda Ganji","doi":"10.1049/htl2.12040","DOIUrl":null,"url":null,"abstract":"<p>Depression is a prominent cause of mental illness, which could primarily increase early death. It is possible that this is the root of suicidal ideation, and it causes severe impairment in daily life. By detecting human face traits, artificial intelligence (AI) has cleared the road for predicting human emotions. This predictive technique will be used to conduct a preliminary assessment of depression. Prediction is accomplished using a mixture of four modules namely Facial Emotion Recognition (FER), Scales Questionnaire, Speech Emotion Recognition (SER), and Doctor Chat. FER2013 dataset is used for the FER module, while for speech-based recognition, RAVDESS, TESS, SAVEE, and CREMA-D are collectively used. To improve the accuracy of the FER, the people in the given image will be fed into a Face API created with TensorFlow JS, which will eventually be given to the proposed model that will recognize human faces in the image. For SER, a python library known as Librosa is used for extracting audio features and it will be fed to the proposed model. The scales module of the app has questionnaires that can be answered, and the result can be generated based on the scores obtained using established scales used in modern psychology such as the HAM-D, YMRS etc., Though deep learning can predict emotions, the user may choose to speak with a real doctor about the issues to clear up any doubts. The application has a Doctor Chat module, which is essentially a chat bot for interacting with a doctor. Using this module, the users can talk, exchange files, and have their questions answered. The accuracy of FER is 91% whereas for SER, it is 82% on the test sets. The proposed approach produces the highest accuracy for the benchmark dataset. These four modules will work together to produce a homogenous depression report.</p>","PeriodicalId":37474,"journal":{"name":"Healthcare Technology Letters","volume":"12 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/htl2.12040","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/htl2.12040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Depression is a prominent cause of mental illness, which could primarily increase early death. It is possible that this is the root of suicidal ideation, and it causes severe impairment in daily life. By detecting human face traits, artificial intelligence (AI) has cleared the road for predicting human emotions. This predictive technique will be used to conduct a preliminary assessment of depression. Prediction is accomplished using a mixture of four modules namely Facial Emotion Recognition (FER), Scales Questionnaire, Speech Emotion Recognition (SER), and Doctor Chat. FER2013 dataset is used for the FER module, while for speech-based recognition, RAVDESS, TESS, SAVEE, and CREMA-D are collectively used. To improve the accuracy of the FER, the people in the given image will be fed into a Face API created with TensorFlow JS, which will eventually be given to the proposed model that will recognize human faces in the image. For SER, a python library known as Librosa is used for extracting audio features and it will be fed to the proposed model. The scales module of the app has questionnaires that can be answered, and the result can be generated based on the scores obtained using established scales used in modern psychology such as the HAM-D, YMRS etc., Though deep learning can predict emotions, the user may choose to speak with a real doctor about the issues to clear up any doubts. The application has a Doctor Chat module, which is essentially a chat bot for interacting with a doctor. Using this module, the users can talk, exchange files, and have their questions answered. The accuracy of FER is 91% whereas for SER, it is 82% on the test sets. The proposed approach produces the highest accuracy for the benchmark dataset. These four modules will work together to produce a homogenous depression report.
期刊介绍:
Healthcare Technology Letters aims to bring together an audience of biomedical and electrical engineers, physical and computer scientists, and mathematicians to enable the exchange of the latest ideas and advances through rapid online publication of original healthcare technology research. Major themes of the journal include (but are not limited to): Major technological/methodological areas: Biomedical signal processing Biomedical imaging and image processing Bioinstrumentation (sensors, wearable technologies, etc) Biomedical informatics Major application areas: Cardiovascular and respiratory systems engineering Neural engineering, neuromuscular systems Rehabilitation engineering Bio-robotics, surgical planning and biomechanics Therapeutic and diagnostic systems, devices and technologies Clinical engineering Healthcare information systems, telemedicine, mHealth.