基于变压器的慢性肾脏病三期预测模型

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-09-30 DOI:10.1002/cpe.70322

Yifeng Lu, Wenxiu Chang, Deyao Yang, Yuxuan Jiang

{"title":"基于变压器的慢性肾脏病三期预测模型","authors":"Yifeng Lu, Wenxiu Chang, Deyao Yang, Yuxuan Jiang","doi":"10.1002/cpe.70322","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 25-26","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Three-Stage Prediction Model Based on Transformer for Chronic Kidney Disease\",\"authors\":\"Yifeng Lu, Wenxiu Chang, Deyao Yang, Yuxuan Jiang\",\"doi\":\"10.1002/cpe.70322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 25-26\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70322\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70322","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

慢性肾脏疾病（CKD）是严重的全球健康威胁。在晚期，肾功能几乎完全丧失。因此，根据患者的就诊预测CKD的发展可以使医生尽早干预并延缓疾病的进展。在本文中，我们提出了一个基于Transformer架构的三阶段预测模型，命名为输入-捕获-预测（ICP），用于使用电子健康记录（EHRs）的慢性肾脏疾病（CKD）。第一阶段是解决EHR中的数据缺失问题，ICP采用两阶段的插值方法，在填充后使用深度学习方法SAITS模块。第二阶段旨在更好地捕获这种时间依赖性和特征之间的关系，其中ICP采用了两分支架构并引入了两个模块：时间感知卷积（TC）和动态-静态-医学图注意网络（DSMGAT），以提取各种特征信息。TC模块旨在捕捉访问记录之间的关系，考虑访问间隔的不等长度，同时强调最近记录的重要性。另一方面，DSMGAT模块考虑了各种类别的记录特征，使用具有可学习权重的图注意网络（GAT）来建模它们之间的关系。然后我们使用前馈网络来预测估计的肾小球滤过率（eGFR）。为了评估我们的方法的有效性，我们将其与使用真实EHR数据集TFHCKD的几种先进方法进行了比较。平均绝对误差（MAE）和均方误差（MSE）分别为0.0344和0.0028，与现有方法相比有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A Three-Stage Prediction Model Based on Transformer for Chronic Kidney Disease

查看原文本刊更多论文

A Three-Stage Prediction Model Based on Transformer for Chronic Kidney Disease

Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.