Predicting anxiety treatment outcomes with machine learning

Marija Stanojevic, Lesley A. Norris, P. Kendall, Z. Obradovic
{"title":"Predicting anxiety treatment outcomes with machine learning","authors":"Marija Stanojevic, Lesley A. Norris, P. Kendall, Z. Obradovic","doi":"10.1109/ICMLA55696.2022.00160","DOIUrl":null,"url":null,"abstract":"Youth anxiety disorders are highly prevalent and associated with considerable concurrent functional impairments. According to the State of the World’s Children report, 13% of youth between 10 and 19 years old have a diagnosed mental health disorder, 40% of which are anxious and depressive disorders. In a typical longitudinal anxiety clinical study, many explanatory variables are observed in a few patients. As patients drop or miss appointments, collected data has a high missing rate in explanatory and predicted variables. We suggest using machine learning methods to improve understanding of treatments and prediction of outcomes in such studies. We propose machine learning-based imputation for understanding youth anxiety data containing features with high missing rates. In the dataset used, the missing rate of features is up to 80%, making them impossible to use in traditional analysis. Our results show that the proposed iterative imputation with a bag of elastic net regressions imputes missing data better than traditional imputation methods and allow for the best prediction result. We investigate imputation and prediction performance change when using jointly data from multiple studies, where each study has a different bias and missing rate. Leveraging joint dataset allows for predicting the therapy outcome in longitudinal studies with few patients. Additionally, we can now impute or predict features and diagnoses not reported by the clinical study. In conducted experiments, pooling data from nine different studies resulted in 9.3% smaller imputation and 33% lower prediction errors, respectively. Results have higher confidence than when studies are considered separately. We also explored the performance of imputation and prediction in the domain adaptation case of withdrawn patients, in which 50% improvement is obtained when data from all studies are used to impute and train the model.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA55696.2022.00160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Youth anxiety disorders are highly prevalent and associated with considerable concurrent functional impairments. According to the State of the World’s Children report, 13% of youth between 10 and 19 years old have a diagnosed mental health disorder, 40% of which are anxious and depressive disorders. In a typical longitudinal anxiety clinical study, many explanatory variables are observed in a few patients. As patients drop or miss appointments, collected data has a high missing rate in explanatory and predicted variables. We suggest using machine learning methods to improve understanding of treatments and prediction of outcomes in such studies. We propose machine learning-based imputation for understanding youth anxiety data containing features with high missing rates. In the dataset used, the missing rate of features is up to 80%, making them impossible to use in traditional analysis. Our results show that the proposed iterative imputation with a bag of elastic net regressions imputes missing data better than traditional imputation methods and allow for the best prediction result. We investigate imputation and prediction performance change when using jointly data from multiple studies, where each study has a different bias and missing rate. Leveraging joint dataset allows for predicting the therapy outcome in longitudinal studies with few patients. Additionally, we can now impute or predict features and diagnoses not reported by the clinical study. In conducted experiments, pooling data from nine different studies resulted in 9.3% smaller imputation and 33% lower prediction errors, respectively. Results have higher confidence than when studies are considered separately. We also explored the performance of imputation and prediction in the domain adaptation case of withdrawn patients, in which 50% improvement is obtained when data from all studies are used to impute and train the model.
用机器学习预测焦虑治疗结果
青少年焦虑症非常普遍,并伴有相当多的并发功能障碍。根据《世界儿童状况报告》,10至19岁的青少年中有13%被诊断患有精神健康障碍,其中40%为焦虑和抑郁障碍。在一项典型的纵向焦虑临床研究中,在少数患者中观察到许多解释变量。由于患者放弃或错过预约,收集的数据在解释和预测变量中有很高的缺失率。我们建议在这类研究中使用机器学习方法来提高对治疗方法的理解和对结果的预测。我们提出基于机器学习的imputation来理解包含高缺失率特征的青少年焦虑数据。在所使用的数据集中,特征的缺失率高达80%,使得它们无法用于传统的分析。研究结果表明,该方法对缺失数据的补全效果优于传统的补全方法,预测结果最好。当使用来自多个研究的联合数据时,我们研究了imputation和预测性能的变化,其中每个研究都有不同的偏差和缺失率。利用联合数据集可以预测少量患者的纵向研究的治疗结果。此外,我们现在可以推测或预测临床研究未报告的特征和诊断。在已进行的实验中,汇集了来自9个不同研究的数据,分别使估算误差降低了9.3%和33%。结果比单独考虑研究时具有更高的可信度。我们还探讨了在退缩患者的领域适应情况下的输入和预测性能,当使用所有研究的数据来输入和训练模型时,获得了50%的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信