CD-Surv:一个基于对比的动态生存分析模型。

IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS
Health Information Science and Systems Pub Date : 2022-04-12 eCollection Date: 2022-12-01 DOI:10.1007/s13755-022-00173-z
Caogen Hong, Jinbiao Chen, Fan Yi, Yuzhe Hao, Fanwen Meng, Zhanghuiya Dong, Hui Lin, Zhengxing Huang
{"title":"CD-Surv:一个基于对比的动态生存分析模型。","authors":"Caogen Hong, Jinbiao Chen, Fan Yi, Yuzhe Hao, Fanwen Meng, Zhanghuiya Dong, Hui Lin, Zhengxing Huang","doi":"10.1007/s13755-022-00173-z","DOIUrl":null,"url":null,"abstract":"<p><p>Survival analysis, aimed at investigating the relationships between covariates and event time, has exhibited profound effects on health service management. Longitudinal data with sequential patterns, such as electronic health records (EHRs), contain a large volume of patient treatment trajectories, and therefore, provide great potential for survival analysis. However, most existing studies address the survival analysis problem in a static manner, that is, they only utilize a fraction of longitudinal data, ignore the correlations between multiple visits, and usually may not be able to capture the latent representations of patient treatment trajectories. This inevitably deteriorates the performance of the survival analysis. To address this challenge, we propose an end-to-end contrastive-based model <i>CD-Surv</i> to better understand the patient treatment trajectories and dynamically predict the survival probability of a target patient. Specifically, two data augmentation strategies, namely, <i>mask generation</i> and <i>shuffle generation</i>, are adopted to augment the real treatment trajectories documented in the EHR. Based on this, the hidden representations of the real trajectories can be improved by utilizing contrastive learning between augmented and real trajectories. We evaluated our proposed CD-Surv on two real-world datasets, and the experimental results indicated that our proposed model could outperform state-of-the-art baselines on various evaluation metrics.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"10 1","pages":"5"},"PeriodicalIF":3.4000,"publicationDate":"2022-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9005562/pdf/","citationCount":"0","resultStr":"{\"title\":\"CD-Surv: a contrastive-based model for dynamic survival analysis.\",\"authors\":\"Caogen Hong, Jinbiao Chen, Fan Yi, Yuzhe Hao, Fanwen Meng, Zhanghuiya Dong, Hui Lin, Zhengxing Huang\",\"doi\":\"10.1007/s13755-022-00173-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Survival analysis, aimed at investigating the relationships between covariates and event time, has exhibited profound effects on health service management. Longitudinal data with sequential patterns, such as electronic health records (EHRs), contain a large volume of patient treatment trajectories, and therefore, provide great potential for survival analysis. However, most existing studies address the survival analysis problem in a static manner, that is, they only utilize a fraction of longitudinal data, ignore the correlations between multiple visits, and usually may not be able to capture the latent representations of patient treatment trajectories. This inevitably deteriorates the performance of the survival analysis. To address this challenge, we propose an end-to-end contrastive-based model <i>CD-Surv</i> to better understand the patient treatment trajectories and dynamically predict the survival probability of a target patient. Specifically, two data augmentation strategies, namely, <i>mask generation</i> and <i>shuffle generation</i>, are adopted to augment the real treatment trajectories documented in the EHR. Based on this, the hidden representations of the real trajectories can be improved by utilizing contrastive learning between augmented and real trajectories. We evaluated our proposed CD-Surv on two real-world datasets, and the experimental results indicated that our proposed model could outperform state-of-the-art baselines on various evaluation metrics.</p>\",\"PeriodicalId\":46312,\"journal\":{\"name\":\"Health Information Science and Systems\",\"volume\":\"10 1\",\"pages\":\"5\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2022-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9005562/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Information Science and Systems\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s13755-022-00173-z\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/12/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Information Science and Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13755-022-00173-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

生存分析旨在调查协变量与事件时间之间的关系,对卫生服务管理产生了深远的影响。具有顺序模式的纵向数据,如电子健康记录(EHRs),包含大量患者治疗轨迹,因此为生存分析提供了巨大的潜力。然而,大多数现有研究以静态方式解决生存分析问题,也就是说,它们只利用了一小部分纵向数据,忽略了多次就诊之间的相关性,并且通常可能无法捕获患者治疗轨迹的潜在表征。这不可避免地降低了生存分析的性能。为了应对这一挑战,我们提出了一种基于端到端对比的CD-Surv模型,以更好地了解患者的治疗轨迹并动态预测目标患者的生存概率。具体来说,采用了两种数据增强策略,即掩码生成和洗牌生成,来增强EHR中记录的真实治疗轨迹。在此基础上,利用增强轨迹和真实轨迹之间的对比学习,可以改进真实轨迹的隐藏表示。我们在两个真实世界的数据集上评估了我们提出的CD-Surv,实验结果表明,我们提出的模型在各种评估指标上都优于最先进的基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CD-Surv: a contrastive-based model for dynamic survival analysis.

Survival analysis, aimed at investigating the relationships between covariates and event time, has exhibited profound effects on health service management. Longitudinal data with sequential patterns, such as electronic health records (EHRs), contain a large volume of patient treatment trajectories, and therefore, provide great potential for survival analysis. However, most existing studies address the survival analysis problem in a static manner, that is, they only utilize a fraction of longitudinal data, ignore the correlations between multiple visits, and usually may not be able to capture the latent representations of patient treatment trajectories. This inevitably deteriorates the performance of the survival analysis. To address this challenge, we propose an end-to-end contrastive-based model CD-Surv to better understand the patient treatment trajectories and dynamically predict the survival probability of a target patient. Specifically, two data augmentation strategies, namely, mask generation and shuffle generation, are adopted to augment the real treatment trajectories documented in the EHR. Based on this, the hidden representations of the real trajectories can be improved by utilizing contrastive learning between augmented and real trajectories. We evaluated our proposed CD-Surv on two real-world datasets, and the experimental results indicated that our proposed model could outperform state-of-the-art baselines on various evaluation metrics.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
11.30
自引率
5.00%
发文量
30
期刊介绍: Health Information Science and Systems is a multidisciplinary journal that integrates artificial intelligence/computer science/information technology with health science and services, embracing information science research coupled with topics related to the modeling, design, development, integration and management of health information systems, smart health, artificial intelligence in medicine, and computer aided diagnosis, medical expert systems. The scope includes: i.) smart health, artificial Intelligence in medicine, computer aided diagnosis, medical image processing, medical expert systems ii.) medical big data, medical/health/biomedicine information resources such as patient medical records, devices and equipments, software and tools to capture, store, retrieve, process, analyze, optimize the use of information in the health domain, iii.) data management, data mining, and knowledge discovery, all of which play a key role in decision making, management of public health, examination of standards, privacy and security issues, iv.) development of new architectures and applications for health information systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信