A machine learning approach towards assessing consistency and reproducibility: an application to graft survival across three kidney transplantation eras.

IF 3.2 Q1 HEALTH CARE SCIENCES & SERVICES
Frontiers in digital health Pub Date : 2024-09-03 eCollection Date: 2024-01-01 DOI:10.3389/fdgth.2024.1427845
Okechinyere Achilonu, George Obaido, Blessing Ogbuokiri, Kehinde Aruleba, Eustasius Musenge, June Fabian
{"title":"A machine learning approach towards assessing consistency and reproducibility: an application to graft survival across three kidney transplantation eras.","authors":"Okechinyere Achilonu, George Obaido, Blessing Ogbuokiri, Kehinde Aruleba, Eustasius Musenge, June Fabian","doi":"10.3389/fdgth.2024.1427845","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In South Africa, between 1966 and 2014, there were three kidney transplant eras defined by evolving access to certain immunosuppressive therapies defined as <i>Pre-CYA</i> (before availability of cyclosporine), <i>CYA</i> (when cyclosporine became available), and <i>New-Gen</i> (availability of tacrolimus and mycophenolic acid). As such, factors influencing kidney graft failure may vary across these eras. Therefore, evaluating the consistency and reproducibility of models developed to study these variations using machine learning (ML) algorithms could enhance our understanding of post-transplant graft survival dynamics across these three eras.</p><p><strong>Methods: </strong>This study explored the effectiveness of nine ML algorithms in predicting 10-year graft survival across the three eras. We developed and internally validated these algorithms using data spanning the specified eras. The predictive performance of these algorithms was assessed using the area under the curve (AUC) of the receiver operating characteristics curve (ROC), supported by other evaluation metrics. We employed local interpretable model-agnostic explanations to provide detailed interpretations of individual model predictions and used permutation importance to assess global feature importance across each era.</p><p><strong>Results: </strong>Overall, the proportion of graft failure decreased from 41.5% in the <i>Pre-CYA</i> era to 15.1% in the <i>New-Gen</i> era. Our best-performing model across the three eras demonstrated high predictive accuracy. Notably, the ensemble models, particularly the Extra Trees model, emerged as standout performers, consistently achieving high AUC scores of 0.95, 0.95, and 0.97 across the eras. This indicates that the models achieved high consistency and reproducibility in predicting graft survival outcomes. Among the features evaluated, recipient age and donor age were the only features consistently influencing graft failure throughout these eras, while features such as glomerular filtration rate and recipient ethnicity showed high importance in specific eras, resulting in relatively poor historical transportability of the best model.</p><p><strong>Conclusions: </strong>Our study emphasises the significance of analysing post-kidney transplant outcomes and identifying era-specific factors mitigating graft failure. The proposed framework can serve as a foundation for future research and assist physicians in identifying patients at risk of graft failure.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11405382/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2024.1427845","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: In South Africa, between 1966 and 2014, there were three kidney transplant eras defined by evolving access to certain immunosuppressive therapies defined as Pre-CYA (before availability of cyclosporine), CYA (when cyclosporine became available), and New-Gen (availability of tacrolimus and mycophenolic acid). As such, factors influencing kidney graft failure may vary across these eras. Therefore, evaluating the consistency and reproducibility of models developed to study these variations using machine learning (ML) algorithms could enhance our understanding of post-transplant graft survival dynamics across these three eras.

Methods: This study explored the effectiveness of nine ML algorithms in predicting 10-year graft survival across the three eras. We developed and internally validated these algorithms using data spanning the specified eras. The predictive performance of these algorithms was assessed using the area under the curve (AUC) of the receiver operating characteristics curve (ROC), supported by other evaluation metrics. We employed local interpretable model-agnostic explanations to provide detailed interpretations of individual model predictions and used permutation importance to assess global feature importance across each era.

Results: Overall, the proportion of graft failure decreased from 41.5% in the Pre-CYA era to 15.1% in the New-Gen era. Our best-performing model across the three eras demonstrated high predictive accuracy. Notably, the ensemble models, particularly the Extra Trees model, emerged as standout performers, consistently achieving high AUC scores of 0.95, 0.95, and 0.97 across the eras. This indicates that the models achieved high consistency and reproducibility in predicting graft survival outcomes. Among the features evaluated, recipient age and donor age were the only features consistently influencing graft failure throughout these eras, while features such as glomerular filtration rate and recipient ethnicity showed high importance in specific eras, resulting in relatively poor historical transportability of the best model.

Conclusions: Our study emphasises the significance of analysing post-kidney transplant outcomes and identifying era-specific factors mitigating graft failure. The proposed framework can serve as a foundation for future research and assist physicians in identifying patients at risk of graft failure.

背景:从 1966 年到 2014 年,南非经历了三个肾移植时代,这三个时代的定义是:Pre-CYA(环孢素上市之前)、CYA(环孢素上市之后)和 New-Gen(他克莫司和霉酚酸上市之后)。因此,影响肾移植失败的因素在不同时期可能有所不同。因此,利用机器学习(ML)算法评估为研究这些变化而开发的模型的一致性和可重复性,可以加深我们对这三个时代移植后存活动态的了解:本研究探讨了九种 ML 算法在预测这三个时代的 10 年移植物存活率方面的有效性。我们使用跨越特定年代的数据开发并在内部验证了这些算法。我们使用接收者操作特征曲线(ROC)的曲线下面积(AUC)来评估这些算法的预测性能,并辅以其他评估指标。我们采用了局部可解释的模型失衡解释来提供单个模型预测的详细解释,并使用置换重要性来评估每个时代的全局特征重要性:总体而言,移植物失败的比例从前 CYA 时代的 41.5% 降至新基因时代的 15.1%。我们在三个时代中表现最好的模型显示出很高的预测准确性。值得注意的是,集合模型,尤其是 Extra Trees 模型,表现突出,在各个时代的 AUC 分数一直高达 0.95、0.95 和 0.97。这表明这些模型在预测移植物存活结果方面具有很高的一致性和可重复性。在评估的特征中,受体年龄和供体年龄是唯一在这些年代中始终影响移植物失败的特征,而肾小球滤过率和受体种族等特征在特定年代显示出较高的重要性,导致最佳模型的历史可移植性相对较差:我们的研究强调了分析肾移植后结果和确定减轻移植失败的特定时代因素的重要性。提出的框架可作为未来研究的基础,并帮助医生识别有移植失败风险的患者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
审稿时长
13 weeks
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信