Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang
{"title":"基于深度学习的治疗加权逆概率的准确治疗效果估计。","authors":"Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang","doi":"10.1093/jamiaopen/ooaf032","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confounding that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method since it provides unbiased treatment effect estimation and its derivation is straightforward. In this study, we aim to utilize IPTW to estimate treatment effect in the presence of time-dependent confounding using claims records.</p><p><strong>Materials and methods: </strong>Previous studies have utilized propensity score methods with features derived from claims records through feature processing, which generally requires domain knowledge and additional resources to extract information to accurately estimate propensity scores. Deep learning, particularly using deep sequence models such as recurrent neural networks and Transformer, has demonstrated good performance in modeling EHRs for various downstream tasks. We propose that these deep sequence models can provide accurate IPTW estimation of treatment effect by directly estimating the propensity scores from claims records without the need for feature processing.</p><p><strong>Results: </strong>Comprehensive evaluations on synthetic and semi-synthetic datasets demonstrate that IPTW treatment effect estimation using deep sequence models consistently outperforms baseline approaches, including logistic regression and multilayer perceptrons, combined with feature processing.</p><p><strong>Discussion: </strong>Our findings demonstrate that deep sequence models consistently outperform traditional approaches in estimating treatment effects, particularly under time-dependent confounding. Moreover, Transformer-based models offer interpretability by assigning higher attention weights to relevant confounders, even when prior domain knowledge is limited.</p><p><strong>Conclusion: </strong>Deep sequence models enable accurate treatment effect estimation through IPTW without the need for feature processing.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf032"},"PeriodicalIF":2.5000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033031/pdf/","citationCount":"0","resultStr":"{\"title\":\"Accurate treatment effect estimation using inverse probability of treatment weighting with deep learning.\",\"authors\":\"Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang\",\"doi\":\"10.1093/jamiaopen/ooaf032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confounding that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method since it provides unbiased treatment effect estimation and its derivation is straightforward. In this study, we aim to utilize IPTW to estimate treatment effect in the presence of time-dependent confounding using claims records.</p><p><strong>Materials and methods: </strong>Previous studies have utilized propensity score methods with features derived from claims records through feature processing, which generally requires domain knowledge and additional resources to extract information to accurately estimate propensity scores. Deep learning, particularly using deep sequence models such as recurrent neural networks and Transformer, has demonstrated good performance in modeling EHRs for various downstream tasks. We propose that these deep sequence models can provide accurate IPTW estimation of treatment effect by directly estimating the propensity scores from claims records without the need for feature processing.</p><p><strong>Results: </strong>Comprehensive evaluations on synthetic and semi-synthetic datasets demonstrate that IPTW treatment effect estimation using deep sequence models consistently outperforms baseline approaches, including logistic regression and multilayer perceptrons, combined with feature processing.</p><p><strong>Discussion: </strong>Our findings demonstrate that deep sequence models consistently outperform traditional approaches in estimating treatment effects, particularly under time-dependent confounding. Moreover, Transformer-based models offer interpretability by assigning higher attention weights to relevant confounders, even when prior domain knowledge is limited.</p><p><strong>Conclusion: </strong>Deep sequence models enable accurate treatment effect estimation through IPTW without the need for feature processing.</p>\",\"PeriodicalId\":36278,\"journal\":{\"name\":\"JAMIA Open\",\"volume\":\"8 2\",\"pages\":\"ooaf032\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-04-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033031/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JAMIA Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/jamiaopen/ooaf032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooaf032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
摘要
目的:在电子健康记录(EHRs)日益普及的推动下,观察性数据已被积极用于评估治疗效果。然而,电子病历通常由纵向记录组成,经常引入时间相关的混淆,妨碍对治疗效果的无偏估计。治疗加权逆概率法(Inverse probability of treatment weighting, IPTW)是一种被广泛使用的倾向评分方法,因为它能提供无偏的治疗效果估计,而且推导简单。在这项研究中,我们的目的是利用IPTW来估计治疗效果的存在时间依赖的混淆使用索赔记录。材料和方法:以往的研究使用倾向得分方法,通过特征处理从索赔记录中提取特征,通常需要领域知识和额外的资源来提取信息,以准确估计倾向得分。深度学习,特别是使用深度序列模型,如循环神经网络和Transformer,在为各种下游任务建模电子病历方面表现良好。我们提出这些深度序列模型可以通过直接估计索赔记录的倾向得分而不需要特征处理来提供准确的治疗效果IPTW估计。结果:对合成和半合成数据集的综合评估表明,使用深度序列模型估计IPTW治疗效果始终优于基线方法,包括逻辑回归和多层感知器,并结合特征处理。讨论:我们的研究结果表明,深度序列模型在估计治疗效果方面始终优于传统方法,特别是在时间相关的混杂情况下。此外,基于transformer的模型通过为相关的混杂因素分配更高的关注权重来提供可解释性,即使在先前的领域知识有限的情况下也是如此。结论:深度序列模型可以在不需要特征处理的情况下,通过IPTW准确估计治疗效果。
Accurate treatment effect estimation using inverse probability of treatment weighting with deep learning.
Objectives: Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confounding that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method since it provides unbiased treatment effect estimation and its derivation is straightforward. In this study, we aim to utilize IPTW to estimate treatment effect in the presence of time-dependent confounding using claims records.
Materials and methods: Previous studies have utilized propensity score methods with features derived from claims records through feature processing, which generally requires domain knowledge and additional resources to extract information to accurately estimate propensity scores. Deep learning, particularly using deep sequence models such as recurrent neural networks and Transformer, has demonstrated good performance in modeling EHRs for various downstream tasks. We propose that these deep sequence models can provide accurate IPTW estimation of treatment effect by directly estimating the propensity scores from claims records without the need for feature processing.
Results: Comprehensive evaluations on synthetic and semi-synthetic datasets demonstrate that IPTW treatment effect estimation using deep sequence models consistently outperforms baseline approaches, including logistic regression and multilayer perceptrons, combined with feature processing.
Discussion: Our findings demonstrate that deep sequence models consistently outperform traditional approaches in estimating treatment effects, particularly under time-dependent confounding. Moreover, Transformer-based models offer interpretability by assigning higher attention weights to relevant confounders, even when prior domain knowledge is limited.
Conclusion: Deep sequence models enable accurate treatment effect estimation through IPTW without the need for feature processing.