{"title":"Targeted urinary metabolomics combined with machine learning to identify biomarkers related to central carbon metabolism for IBD.","authors":"Miao-Lin Lei, Guan-Wei Bi, Xiao-Lin Yin, Yue Wang, Zi-Ru Sun, Xin-Rui Guo, Hui-Peng Zhang, Xiao-Han Zhao, Feng Li, Yan-Bo Yu","doi":"10.3389/fmolb.2025.1615047","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Inflammatory bowel disease (IBD), comprising Crohn's disease (CD) and ulcerative colitis (UC), is a chronic and relapsing inflammatory disorder of the gastrointestinal tract. Current diagnostic approaches are invasive, costly, and time-consuming, underscoring the need for non-invasive, accurate diagnostic methods.</p><p><strong>Methods: </strong>We conducted a targeted metabolomic analysis of 49 metabolites related to central carbon metabolism in urinary samples from individuals with IBD and control group. Diagnostic models were constructed using six machine learning algorithms, and their performance was evaluated by cross-validated area under the receiver operating characteristic curve (AUC). The SHAP (SHapley Additive exPlanations) method was used to interpret the models and identify key discriminatory features.</p><p><strong>Results: </strong>Six metabolites-xylose, isocitric acid, fructose, L-fucose, N-acetyl-D-glucosamine (GlcNAc), and glycolic acid-differentiated UC from control group, while three metabolites-xylose, L-fucose, and citric acid-distinguished CD from control group. The optimal diagnostic model achieved a mean AUC of 0.84 for UC and 0.93 for CD. These models retained high diagnostic accuracy even after adjusting for disease activity. SHAP analysis identified L-fucose, xylose, and GlcNAc as important features for UC, and citric acid and xylose for CD.</p><p><strong>Discussion: </strong>Our findings highlight distinct metabolic signatures in central carbon metabolism associated with IBD subtypes. The identified metabolite panels, combined with machine learning models, offer promising non-invasive tools for differentiating UC and CD from healthy individuals.</p>","PeriodicalId":12465,"journal":{"name":"Frontiers in Molecular Biosciences","volume":"12 ","pages":"1615047"},"PeriodicalIF":3.9000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12375463/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Molecular Biosciences","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmolb.2025.1615047","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Inflammatory bowel disease (IBD), comprising Crohn's disease (CD) and ulcerative colitis (UC), is a chronic and relapsing inflammatory disorder of the gastrointestinal tract. Current diagnostic approaches are invasive, costly, and time-consuming, underscoring the need for non-invasive, accurate diagnostic methods.
Methods: We conducted a targeted metabolomic analysis of 49 metabolites related to central carbon metabolism in urinary samples from individuals with IBD and control group. Diagnostic models were constructed using six machine learning algorithms, and their performance was evaluated by cross-validated area under the receiver operating characteristic curve (AUC). The SHAP (SHapley Additive exPlanations) method was used to interpret the models and identify key discriminatory features.
Results: Six metabolites-xylose, isocitric acid, fructose, L-fucose, N-acetyl-D-glucosamine (GlcNAc), and glycolic acid-differentiated UC from control group, while three metabolites-xylose, L-fucose, and citric acid-distinguished CD from control group. The optimal diagnostic model achieved a mean AUC of 0.84 for UC and 0.93 for CD. These models retained high diagnostic accuracy even after adjusting for disease activity. SHAP analysis identified L-fucose, xylose, and GlcNAc as important features for UC, and citric acid and xylose for CD.
Discussion: Our findings highlight distinct metabolic signatures in central carbon metabolism associated with IBD subtypes. The identified metabolite panels, combined with machine learning models, offer promising non-invasive tools for differentiating UC and CD from healthy individuals.
期刊介绍:
Much of contemporary investigation in the life sciences is devoted to the molecular-scale understanding of the relationships between genes and the environment — in particular, dynamic alterations in the levels, modifications, and interactions of cellular effectors, including proteins. Frontiers in Molecular Biosciences offers an international publication platform for basic as well as applied research; we encourage contributions spanning both established and emerging areas of biology. To this end, the journal draws from empirical disciplines such as structural biology, enzymology, biochemistry, and biophysics, capitalizing as well on the technological advancements that have enabled metabolomics and proteomics measurements in massively parallel throughput, and the development of robust and innovative computational biology strategies. We also recognize influences from medicine and technology, welcoming studies in molecular genetics, molecular diagnostics and therapeutics, and nanotechnology.
Our ultimate objective is the comprehensive illustration of the molecular mechanisms regulating proteins, nucleic acids, carbohydrates, lipids, and small metabolites in organisms across all branches of life.
In addition to interesting new findings, techniques, and applications, Frontiers in Molecular Biosciences will consider new testable hypotheses to inspire different perspectives and stimulate scientific dialogue. The integration of in silico, in vitro, and in vivo approaches will benefit endeavors across all domains of the life sciences.