Xiaoyang Ruan, Sunyang Fu, Heling Jia, Kellie L Mathis, Cornelius A Thiels, Schaeferle M Gavin, Patrick M Wilson, Curtis B Storlie, Hongfang Liu
{"title":"Gated recurrent unit with decay has real-time capability for postoperative ileus surveillance and offers cross-hospital transferability.","authors":"Xiaoyang Ruan, Sunyang Fu, Heling Jia, Kellie L Mathis, Cornelius A Thiels, Schaeferle M Gavin, Patrick M Wilson, Curtis B Storlie, Hongfang Liu","doi":"10.1038/s43856-025-01053-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ileus, a postoperative complication after colorectal surgery, increases morbidity, costs, and hospital stays. Assessing risk of ileus is crucial, especially with the trend towards early discharge. Prior studies assessed risk of ileus with regression models, the role of deep learning remains unexplored.</p><p><strong>Methods: </strong>We evaluated the Gated Recurrent Unit with Decay (GRU-D) for real-time ileus risk assessment in 7349 colorectal surgeries across three Mayo Clinic sites with two Electronic Health Record (EHR) systems. The results were compared with atemporal models on a panel of benchmark metrics.</p><p><strong>Results: </strong>Here we show that despite extreme data sparsity (e.g., 72.2% of labs, 26.9% of vitals lack measurements within 24 h post-surgery), GRU-D demonstrates improved performance by integrating new measurements and exhibits robust transferability. In brute-force transfer, AUROC decreases by no more than 5%, while multi-source instance transfer yields up to a 2.6% improvement in AUROC and an 86% narrower confidence interval. Although atemporal models perform better at certain pre-surgical time points, their performance fluctuates considerably and generally falls short of GRU-D in post-surgical hours.</p><p><strong>Conclusions: </strong>GRU-D's dynamic risk assessment capability is crucial in scenarios where clinical follow-up is essential, warranting further research on built-in explainability for clinical integration.</p>","PeriodicalId":72646,"journal":{"name":"Communications medicine","volume":"5 1","pages":"331"},"PeriodicalIF":5.4000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12322181/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43856-025-01053-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Ileus, a postoperative complication after colorectal surgery, increases morbidity, costs, and hospital stays. Assessing risk of ileus is crucial, especially with the trend towards early discharge. Prior studies assessed risk of ileus with regression models, the role of deep learning remains unexplored.
Methods: We evaluated the Gated Recurrent Unit with Decay (GRU-D) for real-time ileus risk assessment in 7349 colorectal surgeries across three Mayo Clinic sites with two Electronic Health Record (EHR) systems. The results were compared with atemporal models on a panel of benchmark metrics.
Results: Here we show that despite extreme data sparsity (e.g., 72.2% of labs, 26.9% of vitals lack measurements within 24 h post-surgery), GRU-D demonstrates improved performance by integrating new measurements and exhibits robust transferability. In brute-force transfer, AUROC decreases by no more than 5%, while multi-source instance transfer yields up to a 2.6% improvement in AUROC and an 86% narrower confidence interval. Although atemporal models perform better at certain pre-surgical time points, their performance fluctuates considerably and generally falls short of GRU-D in post-surgical hours.
Conclusions: GRU-D's dynamic risk assessment capability is crucial in scenarios where clinical follow-up is essential, warranting further research on built-in explainability for clinical integration.
背景:肠梗阻是结直肠手术后的一种并发症,可增加发病率、费用和住院时间。评估肠梗阻的风险是至关重要的,特别是在早期出院的趋势下。先前的研究用回归模型评估肠梗阻的风险,深度学习的作用仍未被探索。方法:我们评估了gate - Recurrent Unit with Decay (GRU-D)用于实时肠梗阻风险评估的三个Mayo诊所站点的7349例结直肠手术,使用两个电子健康记录(EHR)系统。结果与基准指标面板上的非时间模型进行了比较。结果:在这里,我们表明,尽管极端的数据稀疏(例如,72.2%的实验室,26.9%的生命体征在术后24小时内缺乏测量),GRU-D通过整合新的测量结果显示出改进的性能,并表现出强大的可移植性。在暴力迁移中,AUROC降低不超过5%,而多源实例迁移使AUROC提高了2.6%,置信区间缩小了86%。虽然非时态模型在手术前的某些时间点表现较好,但其性能波动较大,在手术后的时间点通常达不到GRU-D。结论:GRU-D的动态风险评估能力在需要临床随访的情况下至关重要,需要进一步研究临床整合的内在可解释性。