{"title":"Genetic biomarkers and machine learning techniques for predicting diabetes: systematic review","authors":"Sulaiman Khan, Farida Mohsen, Zubair Shah","doi":"10.1007/s10462-024-11020-w","DOIUrl":null,"url":null,"abstract":"<div><p>Diabetes mellitus is a long-term metabolic condition marked by high blood sugar levels due to issues with insulin production, insulin effectiveness, or a combination of both. It stands as one of the fastest-growing diseases worldwide, projected to afflict 693 million adults by 2045. The escalating prevalence of diabetes and associated health complications (kidney disease, retinopathy, and neuropathy) underscore the imperative to devise predictive models for early diagnosis and intervention. These complications contribute to increased mortality rates, blindness, kidney failure, and an overall diminished quality of life in individuals living with diabetes. While clinical risk factors and glycemic control provide valuable insights, they alone cannot reliably predict the onset of vascular complications. Genetic biomarkers and machine learning techniques have emerged as promising tools for predicting diabetes development risk and associated complications. Despite the emergence of numerous smart AI models for diabetes prediction, there is still a need for a thorough review outlining their progress and challenges. To address this gap, this paper offers a systematic review of the literature on AI-based models for diabetes identification, following the PRISMA extension for scoping reviews guidelines. Our review revealed that multimodal diabetes prediction models outperformed unimodal models. Most studies focused on classical machine learning models, with SNPs being the most used data type, followed by gene expression profiles, while lipidomic and metabolomic data were the least utilized. Moreover, some studies focused on identifying genetic determinants of diabetes complications relied on familial linkage analysis, tailored for robust effect loci. However, these approaches had limitations, including susceptibility to false positives in candidate gene studies and underpowered AI models capabilities due to sample size constraints. The landscape shifted dramatically with the proliferation of genomic datasets, fueled by the emergence of biobanks and the amalgamation of global cohorts. This surge has led to a more than twofold increase in genetic discoveries related to both diabetes and its complications using AI. Our focus here is on these genetic breakthroughs, particularly those empowered by AI models. However, we also highlight the existing gaps in research and underscore the need for further advancements to propel genomic discovery to the next level.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 2","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11020-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-11020-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetes mellitus is a long-term metabolic condition marked by high blood sugar levels due to issues with insulin production, insulin effectiveness, or a combination of both. It stands as one of the fastest-growing diseases worldwide, projected to afflict 693 million adults by 2045. The escalating prevalence of diabetes and associated health complications (kidney disease, retinopathy, and neuropathy) underscore the imperative to devise predictive models for early diagnosis and intervention. These complications contribute to increased mortality rates, blindness, kidney failure, and an overall diminished quality of life in individuals living with diabetes. While clinical risk factors and glycemic control provide valuable insights, they alone cannot reliably predict the onset of vascular complications. Genetic biomarkers and machine learning techniques have emerged as promising tools for predicting diabetes development risk and associated complications. Despite the emergence of numerous smart AI models for diabetes prediction, there is still a need for a thorough review outlining their progress and challenges. To address this gap, this paper offers a systematic review of the literature on AI-based models for diabetes identification, following the PRISMA extension for scoping reviews guidelines. Our review revealed that multimodal diabetes prediction models outperformed unimodal models. Most studies focused on classical machine learning models, with SNPs being the most used data type, followed by gene expression profiles, while lipidomic and metabolomic data were the least utilized. Moreover, some studies focused on identifying genetic determinants of diabetes complications relied on familial linkage analysis, tailored for robust effect loci. However, these approaches had limitations, including susceptibility to false positives in candidate gene studies and underpowered AI models capabilities due to sample size constraints. The landscape shifted dramatically with the proliferation of genomic datasets, fueled by the emergence of biobanks and the amalgamation of global cohorts. This surge has led to a more than twofold increase in genetic discoveries related to both diabetes and its complications using AI. Our focus here is on these genetic breakthroughs, particularly those empowered by AI models. However, we also highlight the existing gaps in research and underscore the need for further advancements to propel genomic discovery to the next level.
期刊介绍:
Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.