基于情绪一致性的多模态会话图会话健康检测

Proceedings of the 2023 6th International Conference on Machine Vision and Applications Pub Date : 2023-03-10 DOI:10.1145/3589572.3589588

Kruthika Suresh, Mayuri D Patil, Shrikar Madhu, Yousha Mahamuni, Bhaskarjyoti Das

{"title":"基于情绪一致性的多模态会话图会话健康检测","authors":"Kruthika Suresh, Mayuri D Patil, Shrikar Madhu, Yousha Mahamuni, Bhaskarjyoti Das","doi":"10.1145/3589572.3589588","DOIUrl":null,"url":null,"abstract":"With the advent of social media and technology, the increased connections between individuals and organizations have led to a similar increase in the number of conversations. These conversations, in most cases are bimodal in nature, consisting of both images and text. Existing work in multimodal conversation typically focuses on individual utterances rather than the overall dialogue. The aspect of conversational health is important in many real world conversational uses cases including the emerging world of Metaverse. The work described in this paper investigates conversational health from the viewpoint of emotional concordance in bimodal conversations modelled as graphs. Using this framework, an existing multimodal dialogue dataset has been reformatted as a graph dataset that is labelled with the emotional concordance score. In this work, determination of conversational health has been framed as a graph classification problem. A graph neural network based model using algorithms such as Graph Convolution Network and Graph Attention Network is then used to detect the emotional concordance or discordance based upon the multimodal conversation that is provided. The model proposed in this paper achieves an overall F1 Score of 0.71 for equally sized class training and testing size, which offers improved results compared to previous models using the same benchmark dataset.","PeriodicalId":296325,"journal":{"name":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detection of Conversational Health in a Multimodal Conversation Graph by Measuring Emotional Concordance\",\"authors\":\"Kruthika Suresh, Mayuri D Patil, Shrikar Madhu, Yousha Mahamuni, Bhaskarjyoti Das\",\"doi\":\"10.1145/3589572.3589588\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of social media and technology, the increased connections between individuals and organizations have led to a similar increase in the number of conversations. These conversations, in most cases are bimodal in nature, consisting of both images and text. Existing work in multimodal conversation typically focuses on individual utterances rather than the overall dialogue. The aspect of conversational health is important in many real world conversational uses cases including the emerging world of Metaverse. The work described in this paper investigates conversational health from the viewpoint of emotional concordance in bimodal conversations modelled as graphs. Using this framework, an existing multimodal dialogue dataset has been reformatted as a graph dataset that is labelled with the emotional concordance score. In this work, determination of conversational health has been framed as a graph classification problem. A graph neural network based model using algorithms such as Graph Convolution Network and Graph Attention Network is then used to detect the emotional concordance or discordance based upon the multimodal conversation that is provided. The model proposed in this paper achieves an overall F1 Score of 0.71 for equally sized class training and testing size, which offers improved results compared to previous models using the same benchmark dataset.\",\"PeriodicalId\":296325,\"journal\":{\"name\":\"Proceedings of the 2023 6th International Conference on Machine Vision and Applications\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 6th International Conference on Machine Vision and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3589572.3589588\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 6th International Conference on Machine Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589572.3589588","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着社交媒体和技术的出现，个人和组织之间联系的增加导致了对话数量的增加。在大多数情况下，这些对话本质上是双峰的，由图像和文本组成。现有的多模态会话研究通常侧重于单个话语，而不是整体对话。会话运行状况在许多真实世界的会话用例中都很重要，包括新兴的Metaverse世界。本文所描述的工作是从情感和谐的角度来研究会话健康的双峰对话模型。使用该框架，现有的多模态对话数据集被重新格式化为带有情感一致性分数标记的图形数据集。在这项工作中，会话健康的确定被框架为一个图分类问题。然后使用基于图卷积网络和图注意网络等算法的基于图神经网络的模型来检测基于所提供的多模态对话的情感一致性或不一致性。本文提出的模型在相同大小的类训练和测试规模下获得了0.71的F1总分，与使用相同基准数据集的先前模型相比，结果有所改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detection of Conversational Health in a Multimodal Conversation Graph by Measuring Emotional Concordance

With the advent of social media and technology, the increased connections between individuals and organizations have led to a similar increase in the number of conversations. These conversations, in most cases are bimodal in nature, consisting of both images and text. Existing work in multimodal conversation typically focuses on individual utterances rather than the overall dialogue. The aspect of conversational health is important in many real world conversational uses cases including the emerging world of Metaverse. The work described in this paper investigates conversational health from the viewpoint of emotional concordance in bimodal conversations modelled as graphs. Using this framework, an existing multimodal dialogue dataset has been reformatted as a graph dataset that is labelled with the emotional concordance score. In this work, determination of conversational health has been framed as a graph classification problem. A graph neural network based model using algorithms such as Graph Convolution Network and Graph Attention Network is then used to detect the emotional concordance or discordance based upon the multimodal conversation that is provided. The model proposed in this paper achieves an overall F1 Score of 0.71 for equally sized class training and testing size, which offers improved results compared to previous models using the same benchmark dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2023 6th International Conference on Machine Vision and Applications

自引率

0.00%

发文量