Explainable machine learning analysis of longitudinal mental health trajectories after breast cancer diagnosis

E. Mylona, Konstantina Kourou, Georgios C. Manikis, H. Kondylakis, E. Karademas, K. Marias, K. Mazzocco, P. Poikonen-Saksela, R. Pat-Horenczyk, B. Sousa, P. Simos, D. Fotiadis
{"title":"Explainable machine learning analysis of longitudinal mental health trajectories after breast cancer diagnosis","authors":"E. Mylona, Konstantina Kourou, Georgios C. Manikis, H. Kondylakis, E. Karademas, K. Marias, K. Mazzocco, P. Poikonen-Saksela, R. Pat-Horenczyk, B. Sousa, P. Simos, D. Fotiadis","doi":"10.1109/BHI56158.2022.9926952","DOIUrl":null,"url":null,"abstract":"Mental health impairment after breast cancer diagnosis may persist for months or years. The present work leverages on novel machine learning techniques to identify distinct trajectories of mental health progression in an 18-month period following BC diagnosis and develop an explainable predictive model of mental health progression using a large list of clinical, sociodemographic and psychological variables. The modelling process was conducted in two phases. The first modeling step included an unsupervised clustering to define the number of trajectory clusters, by means of a longitudinal K-means algorithm. In the second modeling step an explainable ML framework was developed, on the basis of Extreme Gradient Boosting (XGBoost) model and SHAP values, in order to identify the most prominent variables that can discriminate between good and unfavorable mental health progression and to explain how they contribute to model's decisions. The trajectory analysis revealed 5 distinct trajectory groups with the majority of patients following stable good (56%) or improving (21%) trends, while for others mental health levels either deteriorated (12%) or remained at unsatisfactory levels (11%). The model's performance for classifying patient mental health into good and unfavorable progression achieved an AUC of $0.82\\pm 0.04$. The top ranking predictors driving the classification task were the higher number of sick leave days, aggressive cancer type (triple-negative) and higher levels of negative affect, anxious preoccupation, helplessness, arm and breast symptoms, as well as lower values of optimism, social and emotional support and lower age.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI56158.2022.9926952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Mental health impairment after breast cancer diagnosis may persist for months or years. The present work leverages on novel machine learning techniques to identify distinct trajectories of mental health progression in an 18-month period following BC diagnosis and develop an explainable predictive model of mental health progression using a large list of clinical, sociodemographic and psychological variables. The modelling process was conducted in two phases. The first modeling step included an unsupervised clustering to define the number of trajectory clusters, by means of a longitudinal K-means algorithm. In the second modeling step an explainable ML framework was developed, on the basis of Extreme Gradient Boosting (XGBoost) model and SHAP values, in order to identify the most prominent variables that can discriminate between good and unfavorable mental health progression and to explain how they contribute to model's decisions. The trajectory analysis revealed 5 distinct trajectory groups with the majority of patients following stable good (56%) or improving (21%) trends, while for others mental health levels either deteriorated (12%) or remained at unsatisfactory levels (11%). The model's performance for classifying patient mental health into good and unfavorable progression achieved an AUC of $0.82\pm 0.04$. The top ranking predictors driving the classification task were the higher number of sick leave days, aggressive cancer type (triple-negative) and higher levels of negative affect, anxious preoccupation, helplessness, arm and breast symptoms, as well as lower values of optimism, social and emotional support and lower age.
乳腺癌诊断后纵向心理健康轨迹的可解释机器学习分析
乳腺癌诊断后的心理健康损害可能持续数月或数年。目前的工作利用新颖的机器学习技术来识别BC诊断后18个月期间心理健康进展的不同轨迹,并使用大量临床,社会人口统计学和心理变量开发可解释的心理健康进展预测模型。建模过程分两个阶段进行。第一步建模包括一个无监督聚类,通过纵向K-means算法来定义轨迹聚类的数量。在第二个建模步骤中,基于极端梯度增强(XGBoost)模型和SHAP值,开发了一个可解释的ML框架,以确定可以区分良好和不利心理健康进展的最突出变量,并解释它们如何对模型决策做出贡献。轨迹分析显示了5个不同的轨迹组,大多数患者遵循稳定的良好(56%)或改善(21%)趋势,而其他患者的心理健康水平要么恶化(12%),要么保持在不满意的水平(11%)。该模型将患者心理健康分为良好和不良进展的AUC为0.82\pm 0.04$。推动分类任务的排名最高的预测因素是病假天数较多、侵略性癌症类型(三阴性)和较高水平的负面情绪、焦虑、无助、手臂和乳房症状,以及乐观、社会和情感支持的较低值和较低的年龄。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信