{"title":"基于变压器的慢性肾脏病三期预测模型","authors":"Yifeng Lu, Wenxiu Chang, Deyao Yang, Yuxuan Jiang","doi":"10.1002/cpe.70322","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 25-26","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Three-Stage Prediction Model Based on Transformer for Chronic Kidney Disease\",\"authors\":\"Yifeng Lu, Wenxiu Chang, Deyao Yang, Yuxuan Jiang\",\"doi\":\"10.1002/cpe.70322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 25-26\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70322\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70322","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
A Three-Stage Prediction Model Based on Transformer for Chronic Kidney Disease
Chronic kidney disease (CKD) is a serious global health threat. At the terminal stage, kidney function is nearly completely lost. Therefore, predicting the development of CKD based on a patient's visits can enable doctors to intervene early and delay the disease's progression. In this paper, we propose a three-stage prediction model named Imputation-Capture-Prediction (ICP) and based on the Transformer architecture, for chronic kidney disease (CKD) using electronic health records (EHRs). The first stage is to address the missing data problem in EHR, and ICP employs a two-stage imputation method, using the deep learning method SAITS module after recent padding. The second stage is designed to better capture this temporal dependency and the relationships between features, where ICP incorporates a two-branch architecture and introduces two modules: Time-Aware Convolution (TC) and Dynamic-Static-Medical Graph Attention Network (DSMGAT), to extract diverse feature information. The TC module is designed to capture the relationships within visit records, accounting for the unequal lengths of visit intervals while emphasizing the importance of recent records. The DSMGAT module, on the other hand, considers various categories of record features, using a Graph Attention Network (GAT) with learnable weights to model the relationships among them. Then we use a Feed-Forward Network to predict the estimated glomerular filtration rate (eGFR). To evaluate the effectiveness of our method, we compared it with several advanced approaches using a real EHR dataset, TFHCKD. The Mean Absolute Error (MAE) and Mean Squared Error (MSE) were 0.0344 and 0.0028, respectively, demonstrating a significant improvement over existing methods.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.