社交媒体数据对心理健康的可解释因果分析

International Conference on Neural Information Processing Pub Date : 2022-10-16 DOI:10.48550/arXiv.2210.08430

Chandni Saxena, Muskan Garg, G. Saxena

{"title":"社交媒体数据对心理健康的可解释因果分析","authors":"Chandni Saxena, Muskan Garg, G. Saxena","doi":"10.48550/arXiv.2210.08430","DOIUrl":null,"url":null,"abstract":"With recent developments in Social Computing, Natural Language Processing and Clinical Psychology, the social NLP research community addresses the challenge of automation in mental illness on social media. A recent extension to the problem of multi-class classification of mental health issues is to identify the cause behind the user's intention. However, multi-class causal categorization for mental health issues on social media has a major challenge of wrong prediction due to the overlapping problem of causal explanations. There are two possible mitigation techniques to solve this problem: (i) Inconsistency among causal explanations/ inappropriate human-annotated inferences in the dataset, (ii) in-depth analysis of arguments and stances in self-reported text using discourse analysis. In this research work, we hypothesise that if there exists the inconsistency among F1 scores of different classes, there must be inconsistency among corresponding causal explanations as well. In this task, we fine tune the classifiers and find explanations for multi-class causal categorization of mental illness on social media with LIME and Integrated Gradient (IG) methods. We test our methods with CAMS dataset and validate with annotated interpretations. A key contribution of this research work is to find the reason behind inconsistency in accuracy of multi-class causal categorization. The effectiveness of our methods is evident with the results obtained having category-wise average scores of $81.29 \\%$ and $0.906$ using cosine similarity and word mover's distance, respectively.","PeriodicalId":281152,"journal":{"name":"International Conference on Neural Information Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Explainable Causal Analysis of Mental Health on Social Media Data\",\"authors\":\"Chandni Saxena, Muskan Garg, G. Saxena\",\"doi\":\"10.48550/arXiv.2210.08430\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With recent developments in Social Computing, Natural Language Processing and Clinical Psychology, the social NLP research community addresses the challenge of automation in mental illness on social media. A recent extension to the problem of multi-class classification of mental health issues is to identify the cause behind the user's intention. However, multi-class causal categorization for mental health issues on social media has a major challenge of wrong prediction due to the overlapping problem of causal explanations. There are two possible mitigation techniques to solve this problem: (i) Inconsistency among causal explanations/ inappropriate human-annotated inferences in the dataset, (ii) in-depth analysis of arguments and stances in self-reported text using discourse analysis. In this research work, we hypothesise that if there exists the inconsistency among F1 scores of different classes, there must be inconsistency among corresponding causal explanations as well. In this task, we fine tune the classifiers and find explanations for multi-class causal categorization of mental illness on social media with LIME and Integrated Gradient (IG) methods. We test our methods with CAMS dataset and validate with annotated interpretations. A key contribution of this research work is to find the reason behind inconsistency in accuracy of multi-class causal categorization. The effectiveness of our methods is evident with the results obtained having category-wise average scores of $81.29 \\\\%$ and $0.906$ using cosine similarity and word mover's distance, respectively.\",\"PeriodicalId\":281152,\"journal\":{\"name\":\"International Conference on Neural Information Processing\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Neural Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2210.08430\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Neural Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.08430","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

随着社会计算、自然语言处理和临床心理学的最新发展，社会NLP研究界在社交媒体上解决了精神疾病自动化的挑战。最近对心理健康问题的多类分类问题的扩展是确定使用者意图背后的原因。然而，社交媒体心理健康问题的多类因果分类，由于因果解释的重叠问题，存在预测错误的重大挑战。有两种可能的缓解技术可以解决这一问题:(i)数据集中因果解释之间的不一致/不适当的人工注释推断，(ii)使用话语分析对自我报告文本中的论点和立场进行深入分析。在本研究工作中，我们假设如果不同班级的F1成绩之间存在不一致，那么相应的因果解释之间也必然存在不一致。在这项任务中，我们对分类器进行微调，并使用LIME和集成梯度(IG)方法对社交媒体上的精神疾病进行多类因果分类。我们使用CAMS数据集测试我们的方法，并使用注释解释进行验证。本研究的一个重要贡献是找到了多类因果分类准确率不一致的原因。我们的方法的有效性是显而易见的，使用余弦相似度和单词移动器距离获得的分类平均分数分别为81.29%和0.906。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Explainable Causal Analysis of Mental Health on Social Media Data

With recent developments in Social Computing, Natural Language Processing and Clinical Psychology, the social NLP research community addresses the challenge of automation in mental illness on social media. A recent extension to the problem of multi-class classification of mental health issues is to identify the cause behind the user's intention. However, multi-class causal categorization for mental health issues on social media has a major challenge of wrong prediction due to the overlapping problem of causal explanations. There are two possible mitigation techniques to solve this problem: (i) Inconsistency among causal explanations/ inappropriate human-annotated inferences in the dataset, (ii) in-depth analysis of arguments and stances in self-reported text using discourse analysis. In this research work, we hypothesise that if there exists the inconsistency among F1 scores of different classes, there must be inconsistency among corresponding causal explanations as well. In this task, we fine tune the classifiers and find explanations for multi-class causal categorization of mental illness on social media with LIME and Integrated Gradient (IG) methods. We test our methods with CAMS dataset and validate with annotated interpretations. A key contribution of this research work is to find the reason behind inconsistency in accuracy of multi-class causal categorization. The effectiveness of our methods is evident with the results obtained having category-wise average scores of $81.29 \%$ and $0.906$ using cosine similarity and word mover's distance, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Neural Information Processing

自引率

0.00%

发文量