Niels F. Cleymans;Mark Van De Casteele;Julie Vandewalle;Aster K. Desouter;Frans K. Gorus;Kurt Barbé
{"title":"Analyzing Random Forest’s Predictive Capability for Type 1 Diabetes Progression","authors":"Niels F. Cleymans;Mark Van De Casteele;Julie Vandewalle;Aster K. Desouter;Frans K. Gorus;Kurt Barbé","doi":"10.1109/OJIM.2025.3551837","DOIUrl":null,"url":null,"abstract":"Type 1 diabetes (T1Ds) is a chronic, for now, incurable multifactorial disease caused by the immune-mediated destruction of insulin-producing pancreatic <inline-formula> <tex-math>$\\beta $ </tex-math></inline-formula>-cells, causing devastating and costly acute and chronic complications, despite lifelong insulin treatment. Abrupt clinical onset is preceded by an asymptomatic disease phase of highly variable duration which is marked by the sequential appearance of various types of <inline-formula> <tex-math>$\\beta $ </tex-math></inline-formula>-cell autoantibodies (AAbs). Optimized predictions of time to clinical onset facilitate early diagnosis which is also key to reducing the incidence of inaugural life-threatening diabetic ketoacidosis and planning novel prevention trials in the asymptomatic stage. Research in first-degree relatives of known T1D patients has shown that disease progression can be predicted by genetic and immune biomarkers, but these predictions are limited by using the traditional statistical approaches such as Cox regression models. This explorative study aims to uncover the potential of random forest machine learning algorithms as survival models within the biomedical context of T1D. Two random forest survival models were constructed in R. The first constructed model predicts how long it will take for individuals to go from single to multiple AAb positivity (AAb+), a crucial step in T1D development. The second model predicts the transition from multiple AAb+ to the onset of T1D. This article demonstrates that our random forest survival models outperform traditional Cox regression methods; we conduct a detailed analysis of variable importance to uncover novel biomarker interactions; and we establish a refined framework for precise measurement and risk stratification of T1D, paving the way for earlier and more targeted intervention.","PeriodicalId":100630,"journal":{"name":"IEEE Open Journal of Instrumentation and Measurement","volume":"4 ","pages":"1-11"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10929762","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Instrumentation and Measurement","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10929762/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Type 1 diabetes (T1Ds) is a chronic, for now, incurable multifactorial disease caused by the immune-mediated destruction of insulin-producing pancreatic $\beta $ -cells, causing devastating and costly acute and chronic complications, despite lifelong insulin treatment. Abrupt clinical onset is preceded by an asymptomatic disease phase of highly variable duration which is marked by the sequential appearance of various types of $\beta $ -cell autoantibodies (AAbs). Optimized predictions of time to clinical onset facilitate early diagnosis which is also key to reducing the incidence of inaugural life-threatening diabetic ketoacidosis and planning novel prevention trials in the asymptomatic stage. Research in first-degree relatives of known T1D patients has shown that disease progression can be predicted by genetic and immune biomarkers, but these predictions are limited by using the traditional statistical approaches such as Cox regression models. This explorative study aims to uncover the potential of random forest machine learning algorithms as survival models within the biomedical context of T1D. Two random forest survival models were constructed in R. The first constructed model predicts how long it will take for individuals to go from single to multiple AAb positivity (AAb+), a crucial step in T1D development. The second model predicts the transition from multiple AAb+ to the onset of T1D. This article demonstrates that our random forest survival models outperform traditional Cox regression methods; we conduct a detailed analysis of variable importance to uncover novel biomarker interactions; and we establish a refined framework for precise measurement and risk stratification of T1D, paving the way for earlier and more targeted intervention.