An Interpretable Trend Analysis Neural Networks for Longitudinal Data Analysis

Zhenjie Yao, Yixin Chen, Jinwei Wang, Junjuan Li, Shuohua Chen, Shouling Wu, Yanhui Tu, Ming-Hui Zhao, Luxia Zhang
{"title":"An Interpretable Trend Analysis Neural Networks for Longitudinal Data Analysis","authors":"Zhenjie Yao, Yixin Chen, Jinwei Wang, Junjuan Li, Shuohua Chen, Shouling Wu, Yanhui Tu, Ming-Hui Zhao, Luxia Zhang","doi":"10.1145/3648105","DOIUrl":null,"url":null,"abstract":"Cohort study is one of the most commonly used study methods in medical and public health researches, which result in longitudinal data. Conventional statistical models and machine learning methods are not capable of modeling the evolution trend of the variables in longitudinal data. In this paper, we propose a Trend Analysis Neural Networks (TANN), which models the evolution trend of the variables by adaptive feature learning. TANN was tested on dataset of Kaiuan research. The task was to predict occurrence of cardiovascular events within 2 and 5 years, with 3 repeated medical examinations during 2008 and 2013. For 2-year prediction, The AUC of the TANN is 0.7378, which is a significant improvement than that of conventional methods, while that of TRNS, RNN, DNN, GBDT, RF, and LR are 0.7222, 0.7034, 0.7054, 0.7136, 0.7160 and 0.7024, respectively. For 5-year prediction, TANN also shows improvement. The experimental results show that the proposed TANN achieves better prediction performance on cardiovascular events prediction than conventional models. Furthermore, by analyzing the weights of TANN, we could find out important trends of the indicators, which are ignored by conventional machine learning models. The trend discovery mechanism interprets the model well. TANN is an appropriate balance between high performance and interpretability.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":"22 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM transactions on computing for healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3648105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Cohort study is one of the most commonly used study methods in medical and public health researches, which result in longitudinal data. Conventional statistical models and machine learning methods are not capable of modeling the evolution trend of the variables in longitudinal data. In this paper, we propose a Trend Analysis Neural Networks (TANN), which models the evolution trend of the variables by adaptive feature learning. TANN was tested on dataset of Kaiuan research. The task was to predict occurrence of cardiovascular events within 2 and 5 years, with 3 repeated medical examinations during 2008 and 2013. For 2-year prediction, The AUC of the TANN is 0.7378, which is a significant improvement than that of conventional methods, while that of TRNS, RNN, DNN, GBDT, RF, and LR are 0.7222, 0.7034, 0.7054, 0.7136, 0.7160 and 0.7024, respectively. For 5-year prediction, TANN also shows improvement. The experimental results show that the proposed TANN achieves better prediction performance on cardiovascular events prediction than conventional models. Furthermore, by analyzing the weights of TANN, we could find out important trends of the indicators, which are ignored by conventional machine learning models. The trend discovery mechanism interprets the model well. TANN is an appropriate balance between high performance and interpretability.
用于纵向数据分析的可解释趋势分析神经网络
队列研究是医学和公共卫生研究中最常用的研究方法之一,其结果是纵向数据。传统的统计模型和机器学习方法无法对纵向数据中变量的演变趋势进行建模。本文提出了一种趋势分析神经网络(TANN),它通过自适应特征学习对变量的演变趋势进行建模。我们在开元研究的数据集上对 TANN 进行了测试。任务是通过 2008 年和 2013 年期间的 3 次重复体检预测 2 年和 5 年内心血管事件的发生率。在 2 年预测中,TANN 的 AUC 为 0.7378,比传统方法显著提高,而 TRNS、RNN、DNN、GBDT、RF 和 LR 的 AUC 分别为 0.7222、0.7034、0.7054、0.7136、0.7160 和 0.7024。在 5 年期预测方面,TANN 也有所改进。实验结果表明,与传统模型相比,所提出的 TANN 在心血管事件预测方面取得了更好的预测效果。此外,通过分析 TANN 的权重,我们可以发现指标的重要趋势,而传统的机器学习模型会忽略这些趋势。趋势发现机制很好地解释了模型。TANN 在高性能和可解释性之间取得了适当的平衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信