Qiulong Yan, Liansha Huang, Shenghui Li, Yue Zhang, Ruochun Guo, Pan Zhang, Zhixin Lei, Qingbo Lv, Fang Chen, Zhiming Li, Jinxin Meng, Jing Li, Guangyang Wang, Changming Chen, Hayan Ullah, Lin Cheng, Shao Fan, Wei You, Yan Zhang, Jie Ma, Shanshan Sha, Wen Sun
{"title":"The Chinese gut virus catalogue reveals gut virome diversity and disease-related viral signatures.","authors":"Qiulong Yan, Liansha Huang, Shenghui Li, Yue Zhang, Ruochun Guo, Pan Zhang, Zhixin Lei, Qingbo Lv, Fang Chen, Zhiming Li, Jinxin Meng, Jing Li, Guangyang Wang, Changming Chen, Hayan Ullah, Lin Cheng, Shao Fan, Wei You, Yan Zhang, Jie Ma, Shanshan Sha, Wen Sun","doi":"10.1186/s13073-025-01460-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The gut viral community has been increasingly recognized for its role in human physiology and health; however, our understanding of its genetic makeup, functional potential, and disease associations remains incomplete.</p><p><strong>Methods: </strong>In this study, we collected 11,286 bulk or viral metagenomes from fecal samples across large-scale Chinese populations to establish a Chinese Gut Virus Catalogue (cnGVC) using a de novo virus identification approach. We then examined the diversity and compositional patterns of the gut virome in relation to common diseases by analyzing 6311 bulk metagenomes representing 28 disease or unhealthy states.</p><p><strong>Results: </strong>The cnGVC contains 93,462 nonredundant viral genomes, with over 70% of these being novel viruses not included in existing gut viral databases. This resource enabled us to characterize the functional diversity and specificity of the gut virome. Using cnGVC, we profiled the gut virome in large-scale populations, assessed sex- and age-related variations, and identified 4238 universal viral signatures of diseases. A random forest classifier based on these signatures achieved high accuracy in distinguishing diseased individuals from controls (AUC = 0.698) and high-risk patients from controls (AUC = 0.761), and its predictive ability was also validated in external cohorts.</p><p><strong>Conclusions: </strong>Our resources and findings significantly expand the current understanding of the human gut virome and provide a comprehensive view of the associations between gut viruses and common diseases. This will pave the way for novel strategies in the treatment and prevention of these diseases.</p>","PeriodicalId":12645,"journal":{"name":"Genome Medicine","volume":"17 1","pages":"30"},"PeriodicalIF":10.4000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11938785/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Medicine","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13073-025-01460-6","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The gut viral community has been increasingly recognized for its role in human physiology and health; however, our understanding of its genetic makeup, functional potential, and disease associations remains incomplete.
Methods: In this study, we collected 11,286 bulk or viral metagenomes from fecal samples across large-scale Chinese populations to establish a Chinese Gut Virus Catalogue (cnGVC) using a de novo virus identification approach. We then examined the diversity and compositional patterns of the gut virome in relation to common diseases by analyzing 6311 bulk metagenomes representing 28 disease or unhealthy states.
Results: The cnGVC contains 93,462 nonredundant viral genomes, with over 70% of these being novel viruses not included in existing gut viral databases. This resource enabled us to characterize the functional diversity and specificity of the gut virome. Using cnGVC, we profiled the gut virome in large-scale populations, assessed sex- and age-related variations, and identified 4238 universal viral signatures of diseases. A random forest classifier based on these signatures achieved high accuracy in distinguishing diseased individuals from controls (AUC = 0.698) and high-risk patients from controls (AUC = 0.761), and its predictive ability was also validated in external cohorts.
Conclusions: Our resources and findings significantly expand the current understanding of the human gut virome and provide a comprehensive view of the associations between gut viruses and common diseases. This will pave the way for novel strategies in the treatment and prevention of these diseases.
期刊介绍:
Genome Medicine is an open access journal that publishes outstanding research applying genetics, genomics, and multi-omics to understand, diagnose, and treat disease. Bridging basic science and clinical research, it covers areas such as cancer genomics, immuno-oncology, immunogenomics, infectious disease, microbiome, neurogenomics, systems medicine, clinical genomics, gene therapies, precision medicine, and clinical trials. The journal publishes original research, methods, software, and reviews to serve authors and promote broad interest and importance in the field.