Ankit Vishwakarma, Sahil Sawant, Prerana Sawant, R. Shankarmani
{"title":"一个有情感意识的朋友:走向人工通用智能","authors":"Ankit Vishwakarma, Sahil Sawant, Prerana Sawant, R. Shankarmani","doi":"10.1109/ICIRCA51532.2021.9544616","DOIUrl":null,"url":null,"abstract":"Mental health is a leading cause of deaths, affecting over 450 million people globally. There are existing emotion recognition models to help and understand the state of a person but mainly via text. The proposed model in the paper is developed in a personalized multi-modal architecture to incorporate all the necessary aspects to predict the cumulative emotional status of a person by his/her text context, speech features, and facial expressions. There are mainly 3 different models: Bidirectional Encoder Representations from Transformers, Multi-layer Perceptron Classifier and Convolutional Neural Network working together in synchronization to cater to the need. Along with it, the advancement implemented includes General Adversarial Networks (GAN), to generate a human entity and help the human to cope up with their emotional state and practically save them from any kind of grave danger. The model with the help of GAN and lip-synced model manages to converse with the user after analyzing and considering their mental state, helping them to find a solution accordingly.","PeriodicalId":245244,"journal":{"name":"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Emotionally Aware Friend: Moving Towards Artificial General Intelligence\",\"authors\":\"Ankit Vishwakarma, Sahil Sawant, Prerana Sawant, R. Shankarmani\",\"doi\":\"10.1109/ICIRCA51532.2021.9544616\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mental health is a leading cause of deaths, affecting over 450 million people globally. There are existing emotion recognition models to help and understand the state of a person but mainly via text. The proposed model in the paper is developed in a personalized multi-modal architecture to incorporate all the necessary aspects to predict the cumulative emotional status of a person by his/her text context, speech features, and facial expressions. There are mainly 3 different models: Bidirectional Encoder Representations from Transformers, Multi-layer Perceptron Classifier and Convolutional Neural Network working together in synchronization to cater to the need. Along with it, the advancement implemented includes General Adversarial Networks (GAN), to generate a human entity and help the human to cope up with their emotional state and practically save them from any kind of grave danger. The model with the help of GAN and lip-synced model manages to converse with the user after analyzing and considering their mental state, helping them to find a solution accordingly.\",\"PeriodicalId\":245244,\"journal\":{\"name\":\"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIRCA51532.2021.9544616\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIRCA51532.2021.9544616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Emotionally Aware Friend: Moving Towards Artificial General Intelligence
Mental health is a leading cause of deaths, affecting over 450 million people globally. There are existing emotion recognition models to help and understand the state of a person but mainly via text. The proposed model in the paper is developed in a personalized multi-modal architecture to incorporate all the necessary aspects to predict the cumulative emotional status of a person by his/her text context, speech features, and facial expressions. There are mainly 3 different models: Bidirectional Encoder Representations from Transformers, Multi-layer Perceptron Classifier and Convolutional Neural Network working together in synchronization to cater to the need. Along with it, the advancement implemented includes General Adversarial Networks (GAN), to generate a human entity and help the human to cope up with their emotional state and practically save them from any kind of grave danger. The model with the help of GAN and lip-synced model manages to converse with the user after analyzing and considering their mental state, helping them to find a solution accordingly.