{"title":"XAI Transformer based Approach for Interpreting Depressed and Suicidal User Behavior on Online Social Networks","authors":"Anshu Malhotra, Rajni Jindal","doi":"10.1016/j.cogsys.2023.101186","DOIUrl":null,"url":null,"abstract":"<div><p><span><span><span><span><span><span>Online social networks can be used for mental healthcare monitoring using </span>Artificial Intelligence<span> and Machine Learning techniques for detecting various </span></span>mental health disorders and corresponding risk assessment. Recent research in this domain has primarily been focused on leveraging deep </span>neural networks<span> and various Transformer based Large Language Models, which have now become state-of-the-art for most </span></span>natural language processing<span><span> and computational linguistic<span><span><span> tasks due to their unmatched prediction accuracy. Unlike conventional machine learning algorithms, these deep neural networks are black box architectures, where it is difficult to interpret and explain their predicted outcome. However, a black box classification outcome is insufficient for healthcare applications. Such systems will not be widely adopted and trusted by healthcare practitioners if they are not able to understand and explain the reasoning behind the predicted decisions made by an AI and ML based healthcare diagnostic system. The key objective of our research is to demonstrate the applications of model agnostic, post hoc surrogate </span>XAI techniques for providing explainability to classification decisions of pretrained LLMs (Transformers) based mental healthcare diagnostic systems fine-tuned (or trained) to detect depressive and suicidal </span>behavior using UGC from online social networks. For this, we have used the two most recent and popular techniques, SHAP and LIME. We have conducted extensive and in-depth experiments with four datasets and six pretrained LLMs, three of which have already been domain-adapted using mental health related datasets. We have also performed </span></span>Few Shot Learning experiments with these three pretrained mental health domain-adapted LLMs. The results of qualitative and </span></span>descriptive data analysis in this paper demonstrate that in order to build a comprehensive understanding of a person’s psychological state, emotion, and behavior and to discover the causes, symptoms, and triggers of mental health issues, it is essential to utilize e</span><em>X</em>pl<em>AI</em>nable <em>(XAI)</em> techniques with Transformer based LLMs (supervised). Alternatively, Transformer based unsupervised topic modeling technique BERTopic may be used for mental health risk monitoring and cause or symptom extraction when supervised training of LLMs is not feasible due to dataset annotation or availability challenges.</p></div>","PeriodicalId":55242,"journal":{"name":"Cognitive Systems Research","volume":"84 ","pages":"Article 101186"},"PeriodicalIF":2.1000,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Systems Research","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389041723001201","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Online social networks can be used for mental healthcare monitoring using Artificial Intelligence and Machine Learning techniques for detecting various mental health disorders and corresponding risk assessment. Recent research in this domain has primarily been focused on leveraging deep neural networks and various Transformer based Large Language Models, which have now become state-of-the-art for most natural language processing and computational linguistic tasks due to their unmatched prediction accuracy. Unlike conventional machine learning algorithms, these deep neural networks are black box architectures, where it is difficult to interpret and explain their predicted outcome. However, a black box classification outcome is insufficient for healthcare applications. Such systems will not be widely adopted and trusted by healthcare practitioners if they are not able to understand and explain the reasoning behind the predicted decisions made by an AI and ML based healthcare diagnostic system. The key objective of our research is to demonstrate the applications of model agnostic, post hoc surrogate XAI techniques for providing explainability to classification decisions of pretrained LLMs (Transformers) based mental healthcare diagnostic systems fine-tuned (or trained) to detect depressive and suicidal behavior using UGC from online social networks. For this, we have used the two most recent and popular techniques, SHAP and LIME. We have conducted extensive and in-depth experiments with four datasets and six pretrained LLMs, three of which have already been domain-adapted using mental health related datasets. We have also performed Few Shot Learning experiments with these three pretrained mental health domain-adapted LLMs. The results of qualitative and descriptive data analysis in this paper demonstrate that in order to build a comprehensive understanding of a person’s psychological state, emotion, and behavior and to discover the causes, symptoms, and triggers of mental health issues, it is essential to utilize eXplAInable (XAI) techniques with Transformer based LLMs (supervised). Alternatively, Transformer based unsupervised topic modeling technique BERTopic may be used for mental health risk monitoring and cause or symptom extraction when supervised training of LLMs is not feasible due to dataset annotation or availability challenges.
期刊介绍:
Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial.
The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition.
Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.