Thejus T. Jayakrishnan, Naseer Sangwan, Shimoli V. Barot, Nicole Farha, Arshiya Mariam, Shao Xiang, Federico Aucejo, Madison Conces, Kanika G. Nair, Smitha S. Krishnamurthi, Stephanie L. Schmit, David Liska, Daniel M. Rotroff, Alok A. Khorana, Suneel D. Kamath
{"title":"多组学机器学习研究早期结直肠癌中宿主与微生物组的相互作用","authors":"Thejus T. Jayakrishnan, Naseer Sangwan, Shimoli V. Barot, Nicole Farha, Arshiya Mariam, Shao Xiang, Federico Aucejo, Madison Conces, Kanika G. Nair, Smitha S. Krishnamurthi, Stephanie L. Schmit, David Liska, Daniel M. Rotroff, Alok A. Khorana, Suneel D. Kamath","doi":"10.1038/s41698-024-00647-1","DOIUrl":null,"url":null,"abstract":"The incidence of early-onset colorectal cancer (eoCRC) is rising, and its pathogenesis is not completely understood. We hypothesized that machine learning utilizing paired tissue microbiome and plasma metabolome features could uncover distinct host-microbiome associations between eoCRC and average-onset CRC (aoCRC). Individuals with stages I–IV CRC (n = 64) were categorized as eoCRC (age ≤ 50, n = 20) or aoCRC (age ≥ 60, n = 44). Untargeted plasma metabolomics and 16S rRNA amplicon sequencing (microbiome analysis) of tumor tissue were performed. We fit DIABLO (Data Integration Analysis for Biomarker Discovery using Latent variable approaches for Omics studies) to construct a supervised machine-learning classifier using paired multi-omics (microbiome and metabolomics) data and identify associations unique to eoCRC. A differential association network analysis was also performed. Distinct clustering patterns emerged in multi-omic dimension reduction analysis. The metabolomics classifier achieved an AUC of 0.98, compared to AUC 0.61 for microbiome-based classifier. Circular correlation technique highlighted several key associations. Metabolites glycerol and pseudouridine (higher abundance in individuals with aoCRC) had negative correlations with Parasutterella, and Ruminococcaceae (higher abundance in individuals with eoCRC). Cholesterol and xylitol correlated negatively with Erysipelatoclostridium and Eubacterium, and showed a positive correlation with Acidovorax with higher abundance in individuals with eoCRC. Network analysis revealed different clustering patterns and associations for several metabolites e.g.: urea cycle metabolites and microbes such as Akkermansia. We show that multi-omics analysis can be utilized to study host-microbiome correlations in eoCRC and demonstrates promising biomarker potential of a metabolomics classifier. The distinct host-microbiome correlations for urea cycle in eoCRC may offer opportunities for therapeutic interventions.","PeriodicalId":19433,"journal":{"name":"NPJ Precision Oncology","volume":null,"pages":null},"PeriodicalIF":6.8000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11255257/pdf/","citationCount":"0","resultStr":"{\"title\":\"Multi-omics machine learning to study host-microbiome interactions in early-onset colorectal cancer\",\"authors\":\"Thejus T. Jayakrishnan, Naseer Sangwan, Shimoli V. Barot, Nicole Farha, Arshiya Mariam, Shao Xiang, Federico Aucejo, Madison Conces, Kanika G. Nair, Smitha S. Krishnamurthi, Stephanie L. Schmit, David Liska, Daniel M. Rotroff, Alok A. Khorana, Suneel D. Kamath\",\"doi\":\"10.1038/s41698-024-00647-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The incidence of early-onset colorectal cancer (eoCRC) is rising, and its pathogenesis is not completely understood. We hypothesized that machine learning utilizing paired tissue microbiome and plasma metabolome features could uncover distinct host-microbiome associations between eoCRC and average-onset CRC (aoCRC). Individuals with stages I–IV CRC (n = 64) were categorized as eoCRC (age ≤ 50, n = 20) or aoCRC (age ≥ 60, n = 44). Untargeted plasma metabolomics and 16S rRNA amplicon sequencing (microbiome analysis) of tumor tissue were performed. We fit DIABLO (Data Integration Analysis for Biomarker Discovery using Latent variable approaches for Omics studies) to construct a supervised machine-learning classifier using paired multi-omics (microbiome and metabolomics) data and identify associations unique to eoCRC. A differential association network analysis was also performed. Distinct clustering patterns emerged in multi-omic dimension reduction analysis. The metabolomics classifier achieved an AUC of 0.98, compared to AUC 0.61 for microbiome-based classifier. Circular correlation technique highlighted several key associations. Metabolites glycerol and pseudouridine (higher abundance in individuals with aoCRC) had negative correlations with Parasutterella, and Ruminococcaceae (higher abundance in individuals with eoCRC). Cholesterol and xylitol correlated negatively with Erysipelatoclostridium and Eubacterium, and showed a positive correlation with Acidovorax with higher abundance in individuals with eoCRC. Network analysis revealed different clustering patterns and associations for several metabolites e.g.: urea cycle metabolites and microbes such as Akkermansia. We show that multi-omics analysis can be utilized to study host-microbiome correlations in eoCRC and demonstrates promising biomarker potential of a metabolomics classifier. The distinct host-microbiome correlations for urea cycle in eoCRC may offer opportunities for therapeutic interventions.\",\"PeriodicalId\":19433,\"journal\":{\"name\":\"NPJ Precision Oncology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11255257/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Precision Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.nature.com/articles/s41698-024-00647-1\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Precision Oncology","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41698-024-00647-1","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
Multi-omics machine learning to study host-microbiome interactions in early-onset colorectal cancer
The incidence of early-onset colorectal cancer (eoCRC) is rising, and its pathogenesis is not completely understood. We hypothesized that machine learning utilizing paired tissue microbiome and plasma metabolome features could uncover distinct host-microbiome associations between eoCRC and average-onset CRC (aoCRC). Individuals with stages I–IV CRC (n = 64) were categorized as eoCRC (age ≤ 50, n = 20) or aoCRC (age ≥ 60, n = 44). Untargeted plasma metabolomics and 16S rRNA amplicon sequencing (microbiome analysis) of tumor tissue were performed. We fit DIABLO (Data Integration Analysis for Biomarker Discovery using Latent variable approaches for Omics studies) to construct a supervised machine-learning classifier using paired multi-omics (microbiome and metabolomics) data and identify associations unique to eoCRC. A differential association network analysis was also performed. Distinct clustering patterns emerged in multi-omic dimension reduction analysis. The metabolomics classifier achieved an AUC of 0.98, compared to AUC 0.61 for microbiome-based classifier. Circular correlation technique highlighted several key associations. Metabolites glycerol and pseudouridine (higher abundance in individuals with aoCRC) had negative correlations with Parasutterella, and Ruminococcaceae (higher abundance in individuals with eoCRC). Cholesterol and xylitol correlated negatively with Erysipelatoclostridium and Eubacterium, and showed a positive correlation with Acidovorax with higher abundance in individuals with eoCRC. Network analysis revealed different clustering patterns and associations for several metabolites e.g.: urea cycle metabolites and microbes such as Akkermansia. We show that multi-omics analysis can be utilized to study host-microbiome correlations in eoCRC and demonstrates promising biomarker potential of a metabolomics classifier. The distinct host-microbiome correlations for urea cycle in eoCRC may offer opportunities for therapeutic interventions.
期刊介绍:
Online-only and open access, npj Precision Oncology is an international, peer-reviewed journal dedicated to showcasing cutting-edge scientific research in all facets of precision oncology, spanning from fundamental science to translational applications and clinical medicine.