Clustering-based risk stratification of prediabetes populations: Insights from the Taiwan and UK Biobanks.

IF 3.2 3区 医学
Djeane Debora Onthoni, Ying-Erh Chen, Yi-Hsuan Lai, Guo-Hung Li, Yong-Sheng Zhuang, Hong-Ming Lin, Yu-Ping Hsiao, Ade Indra Onthoni, Hung-Yi Chiou, Ren-Hua Chung
{"title":"Clustering-based risk stratification of prediabetes populations: Insights from the Taiwan and UK Biobanks.","authors":"Djeane Debora Onthoni, Ying-Erh Chen, Yi-Hsuan Lai, Guo-Hung Li, Yong-Sheng Zhuang, Hong-Ming Lin, Yu-Ping Hsiao, Ade Indra Onthoni, Hung-Yi Chiou, Ren-Hua Chung","doi":"10.1111/jdi.14328","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims/introduction: </strong>This study aimed to identify low- and high-risk diabetes groups within prediabetes populations using data from the Taiwan Biobank (TWB) and UK Biobank (UKB) through a clustering-based Unsupervised Learning (UL) approach, to inform targeted type 2 diabetes (T2D) interventions.</p><p><strong>Materials and methods: </strong>Data from TWB and UKB, comprising clinical and genetic information, were analyzed. Prediabetes was defined by glucose thresholds, and incident T2D was identified through follow-up data. K-means clustering was performed on prediabetes participants using significant features determined through logistic regression and LASSO. Cluster stability was assessed using mean Jaccard similarity, silhouette score, and the elbow method.</p><p><strong>Results: </strong>We identified two stable clusters representing high- and low-risk diabetes groups in both biobanks. The high-risk clusters showed higher diabetes incidence, with 15.7% in TWB and 13.0% in UKB, compared to 7.3% and 9.1% in the low-risk clusters, respectively. Notably, males were predominant in the high-risk groups, constituting 76.6% in TWB and 52.7% in UKB. In TWB, the high-risk group also exhibited significantly higher BMI, fasting glucose, and triglycerides, while UKB showed marginal significance in BMI and other metabolic indicators. Current smoking was significantly associated with increased diabetes risk in the TWB high-risk group (P < 0.001). Kaplan-Meier curves indicated significant differences in diabetes complication incidences between clusters.</p><p><strong>Conclusions: </strong>UL effectively identified risk-specific groups within prediabetes populations, with high-risk groups strongly associated male gender, higher BMI, smoking, and metabolic markers. Tailored preventive strategies, particularly for young males in Taiwan, are crucial to reducing T2D risk.</p>","PeriodicalId":190,"journal":{"name":"Journal of Diabetes Investigation","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Diabetes Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jdi.14328","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aims/introduction: This study aimed to identify low- and high-risk diabetes groups within prediabetes populations using data from the Taiwan Biobank (TWB) and UK Biobank (UKB) through a clustering-based Unsupervised Learning (UL) approach, to inform targeted type 2 diabetes (T2D) interventions.

Materials and methods: Data from TWB and UKB, comprising clinical and genetic information, were analyzed. Prediabetes was defined by glucose thresholds, and incident T2D was identified through follow-up data. K-means clustering was performed on prediabetes participants using significant features determined through logistic regression and LASSO. Cluster stability was assessed using mean Jaccard similarity, silhouette score, and the elbow method.

Results: We identified two stable clusters representing high- and low-risk diabetes groups in both biobanks. The high-risk clusters showed higher diabetes incidence, with 15.7% in TWB and 13.0% in UKB, compared to 7.3% and 9.1% in the low-risk clusters, respectively. Notably, males were predominant in the high-risk groups, constituting 76.6% in TWB and 52.7% in UKB. In TWB, the high-risk group also exhibited significantly higher BMI, fasting glucose, and triglycerides, while UKB showed marginal significance in BMI and other metabolic indicators. Current smoking was significantly associated with increased diabetes risk in the TWB high-risk group (P < 0.001). Kaplan-Meier curves indicated significant differences in diabetes complication incidences between clusters.

Conclusions: UL effectively identified risk-specific groups within prediabetes populations, with high-risk groups strongly associated male gender, higher BMI, smoking, and metabolic markers. Tailored preventive strategies, particularly for young males in Taiwan, are crucial to reducing T2D risk.

基于聚类的糖尿病前期人群风险分层:来自台湾和英国生物库的启示。
目的/简介:本研究旨在通过基于聚类的无监督学习(UL)方法,利用台湾生物样本库(TWB)和英国生物样本库(UKB)的数据,识别糖尿病前期人群中的低风险和高风险糖尿病群体,为有针对性的2型糖尿病(T2D)干预措施提供依据:对来自 TWB 和 UKB 的数据(包括临床和遗传信息)进行了分析。糖尿病前期由血糖阈值定义,T2D事件则通过随访数据确定。利用逻辑回归和LASSO确定的重要特征对糖尿病前期参与者进行K均值聚类。使用平均 Jaccard 相似度、剪影得分和肘法评估聚类的稳定性:结果:我们在两个生物库中发现了代表高危和低危糖尿病群体的两个稳定聚类。高风险群组的糖尿病发病率较高,在 TWB 中为 15.7%,在 UKB 中为 13.0%,而在低风险群组中分别为 7.3%和 9.1%。值得注意的是,男性在高危人群中占主导地位,在 TWB 中占 76.6%,在 UKB 中占 52.7%。在 TWB 中,高风险组的体重指数、空腹血糖和甘油三酯也明显较高,而在 UKB 中,体重指数和其他代谢指标的差异不大。在 TWB 高危人群中,当前吸烟与糖尿病风险的增加有明显相关性(P 结论:UL 能有效识别 TWB 高危人群中的特定风险组:UL 能有效识别糖尿病前期人群中的特定风险组,高风险组与男性性别、较高的体重指数、吸烟和代谢指标密切相关。量身定制的预防策略,尤其是针对台湾年轻男性的策略,对于降低 T2D 风险至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Diabetes Investigation
Journal of Diabetes Investigation Medicine-Internal Medicine
自引率
9.40%
发文量
218
期刊介绍: Journal of Diabetes Investigation is your core diabetes journal from Asia; the official journal of the Asian Association for the Study of Diabetes (AASD). The journal publishes original research, country reports, commentaries, reviews, mini-reviews, case reports, letters, as well as editorials and news. Embracing clinical and experimental research in diabetes and related areas, the Journal of Diabetes Investigation includes aspects of prevention, treatment, as well as molecular aspects and pathophysiology. Translational research focused on the exchange of ideas between clinicians and researchers is also welcome. Journal of Diabetes Investigation is indexed by Science Citation Index Expanded (SCIE).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信