Machine Learning Reveals Serum Glycopatterns as Potential Biomarkers for the Diagnosis of Nonalcoholic Fatty Liver Disease (NAFLD)

IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Xiaocheng Li, Yaqing Xiao, Xinhuan Chen, Yayun Zhu, Haoqi Du, Jian Shu, Hanjie Yu, Xiameng Ren, Fan Zhang, Jing Dang, Chen Zhang, Shi Su* and Zheng Li*, 
{"title":"Machine Learning Reveals Serum Glycopatterns as Potential Biomarkers for the Diagnosis of Nonalcoholic Fatty Liver Disease (NAFLD)","authors":"Xiaocheng Li,&nbsp;Yaqing Xiao,&nbsp;Xinhuan Chen,&nbsp;Yayun Zhu,&nbsp;Haoqi Du,&nbsp;Jian Shu,&nbsp;Hanjie Yu,&nbsp;Xiameng Ren,&nbsp;Fan Zhang,&nbsp;Jing Dang,&nbsp;Chen Zhang,&nbsp;Shi Su* and Zheng Li*,&nbsp;","doi":"10.1021/acs.jproteome.4c00204","DOIUrl":null,"url":null,"abstract":"<p >Nonalcoholic fatty liver disease (NAFLD) has emerged as the predominant chronic liver condition globally, and underdiagnosis is common, particularly in mild cases, attributed to the asymptomatic nature and traditional ultrasonography’s limited sensitivity to detect early-stage steatosis. Consequently, patients may experience progressive liver pathology. The objective of this research is to ascertain the efficacy of serum glycan glycopatterns as a potential diagnostic biomarker, with a particular focus on the disease’s early stages. We collected a total of 170 serum samples from volunteers with mild-NAFLD (Mild), severe-NAFLD (Severe), and non-NAFLD (None). Examination via lectin microarrays has uncovered pronounced disparities in serum glycopatterns identified by 19 distinct lectins. Following this, we employed four distinct machine learning algorithms to categorize the None, Mild, and Severe groups, drawing on the alterations observed in serum glycopatterns. The gradient boosting decision tree (GBDT) algorithm outperformed other models in diagnostic accuracy within the validation set, achieving an accuracy rate of 95% in differentiating the None group from the Mild group. Our research indicates that employing lectin microarrays to identify alterations in serum glycopatterns, when integrated with advanced machine learning algorithms, could constitute a promising approach for the diagnosis of NAFLD, with a special emphasis on its early detection.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":"23 6","pages":"2253–2264"},"PeriodicalIF":3.6000,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jproteome.4c00204","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Nonalcoholic fatty liver disease (NAFLD) has emerged as the predominant chronic liver condition globally, and underdiagnosis is common, particularly in mild cases, attributed to the asymptomatic nature and traditional ultrasonography’s limited sensitivity to detect early-stage steatosis. Consequently, patients may experience progressive liver pathology. The objective of this research is to ascertain the efficacy of serum glycan glycopatterns as a potential diagnostic biomarker, with a particular focus on the disease’s early stages. We collected a total of 170 serum samples from volunteers with mild-NAFLD (Mild), severe-NAFLD (Severe), and non-NAFLD (None). Examination via lectin microarrays has uncovered pronounced disparities in serum glycopatterns identified by 19 distinct lectins. Following this, we employed four distinct machine learning algorithms to categorize the None, Mild, and Severe groups, drawing on the alterations observed in serum glycopatterns. The gradient boosting decision tree (GBDT) algorithm outperformed other models in diagnostic accuracy within the validation set, achieving an accuracy rate of 95% in differentiating the None group from the Mild group. Our research indicates that employing lectin microarrays to identify alterations in serum glycopatterns, when integrated with advanced machine learning algorithms, could constitute a promising approach for the diagnosis of NAFLD, with a special emphasis on its early detection.

Abstract Image

Abstract Image

机器学习发现血清糖型是诊断非酒精性脂肪肝 (NAFLD) 的潜在生物标记物
非酒精性脂肪肝(NAFLD)已成为全球最主要的慢性肝病,由于无症状和传统超声波检查对早期脂肪变性的检测灵敏度有限,因此诊断不足的情况非常普遍,尤其是轻度病例。因此,患者的肝脏可能会出现进行性病变。本研究的目的是确定血清聚糖糖型作为潜在诊断生物标志物的有效性,尤其关注疾病的早期阶段。我们从患有轻度-NAFLD(轻度)、重度-NAFLD(重度)和非-NAFLD(无)的志愿者中收集了共计 170 份血清样本。通过凝集素微阵列检查发现,由 19 种不同凝集素鉴定出的血清糖型存在明显差异。在此基础上,我们采用了四种不同的机器学习算法,根据在血清糖图中观察到的变化,对无、轻度和重度组进行分类。在验证集中,梯度提升决策树(GBDT)算法的诊断准确率优于其他模型,在区分无组和轻度组方面的准确率达到了 95%。我们的研究表明,利用凝集素微阵列识别血清糖型的改变,并与先进的机器学习算法相结合,可以成为诊断非酒精性脂肪肝的一种很有前途的方法,尤其适用于非酒精性脂肪肝的早期检测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Proteome Research
Journal of Proteome Research 生物-生化研究方法
CiteScore
9.00
自引率
4.50%
发文量
251
审稿时长
3 months
期刊介绍: Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信