Yin-Cheng Chen, Yin-Yuan Su, Tzu-Yu Chu, Ming-Fong Wu, Chieh-Chun Huang, Chen-Ching Lin
{"title":"预选:流行率杠杆一致的特征选择解码跨队列的微生物特征。","authors":"Yin-Cheng Chen, Yin-Yuan Su, Tzu-Yu Chu, Ming-Fong Wu, Chieh-Chun Huang, Chen-Ching Lin","doi":"10.1038/s41522-024-00598-2","DOIUrl":null,"url":null,"abstract":"<p><p>The intricate nature of microbiota sequencing data-high dimensionality and sparsity-presents a challenge in identifying informative and reproducible microbial features for both research and clinical applications. Addressing this, we introduce PreLect, an innovative feature selection framework that harnesses microbes' prevalence to facilitate consistent selection in sparse microbiota data. Upon rigorous benchmarking against established feature selection methodologies across 42 microbiome datasets, PreLect demonstrated superior classification capabilities compared to statistical methods and outperformed machine learning-based methods by selecting features with greater prevalence and abundance. A significant strength of PreLect lies in its ability to reliably identify reproducible microbial features across varied cohorts. Applied to colorectal cancer, PreLect identifies key microbes and highlights crucial pathways, such as lipopolysaccharide and glycerophospholipid biosynthesis, in cancer progression. This case study exemplifies PreLect's utility in discerning clinically relevant microbial signatures. In summary, PreLect's accuracy and robustness make it a significant advancement in the analysis of complex microbiota data.</p>","PeriodicalId":19370,"journal":{"name":"npj Biofilms and Microbiomes","volume":"11 1","pages":"3"},"PeriodicalIF":7.8000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698977/pdf/","citationCount":"0","resultStr":"{\"title\":\"PreLect: Prevalence leveraged consistent feature selection decodes microbial signatures across cohorts.\",\"authors\":\"Yin-Cheng Chen, Yin-Yuan Su, Tzu-Yu Chu, Ming-Fong Wu, Chieh-Chun Huang, Chen-Ching Lin\",\"doi\":\"10.1038/s41522-024-00598-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The intricate nature of microbiota sequencing data-high dimensionality and sparsity-presents a challenge in identifying informative and reproducible microbial features for both research and clinical applications. Addressing this, we introduce PreLect, an innovative feature selection framework that harnesses microbes' prevalence to facilitate consistent selection in sparse microbiota data. Upon rigorous benchmarking against established feature selection methodologies across 42 microbiome datasets, PreLect demonstrated superior classification capabilities compared to statistical methods and outperformed machine learning-based methods by selecting features with greater prevalence and abundance. A significant strength of PreLect lies in its ability to reliably identify reproducible microbial features across varied cohorts. Applied to colorectal cancer, PreLect identifies key microbes and highlights crucial pathways, such as lipopolysaccharide and glycerophospholipid biosynthesis, in cancer progression. This case study exemplifies PreLect's utility in discerning clinically relevant microbial signatures. In summary, PreLect's accuracy and robustness make it a significant advancement in the analysis of complex microbiota data.</p>\",\"PeriodicalId\":19370,\"journal\":{\"name\":\"npj Biofilms and Microbiomes\",\"volume\":\"11 1\",\"pages\":\"3\"},\"PeriodicalIF\":7.8000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698977/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"npj Biofilms and Microbiomes\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s41522-024-00598-2\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Biofilms and Microbiomes","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41522-024-00598-2","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
The intricate nature of microbiota sequencing data-high dimensionality and sparsity-presents a challenge in identifying informative and reproducible microbial features for both research and clinical applications. Addressing this, we introduce PreLect, an innovative feature selection framework that harnesses microbes' prevalence to facilitate consistent selection in sparse microbiota data. Upon rigorous benchmarking against established feature selection methodologies across 42 microbiome datasets, PreLect demonstrated superior classification capabilities compared to statistical methods and outperformed machine learning-based methods by selecting features with greater prevalence and abundance. A significant strength of PreLect lies in its ability to reliably identify reproducible microbial features across varied cohorts. Applied to colorectal cancer, PreLect identifies key microbes and highlights crucial pathways, such as lipopolysaccharide and glycerophospholipid biosynthesis, in cancer progression. This case study exemplifies PreLect's utility in discerning clinically relevant microbial signatures. In summary, PreLect's accuracy and robustness make it a significant advancement in the analysis of complex microbiota data.
期刊介绍:
npj Biofilms and Microbiomes is a comprehensive platform that promotes research on biofilms and microbiomes across various scientific disciplines. The journal facilitates cross-disciplinary discussions to enhance our understanding of the biology, ecology, and communal functions of biofilms, populations, and communities. It also focuses on applications in the medical, environmental, and engineering domains. The scope of the journal encompasses all aspects of the field, ranging from cell-cell communication and single cell interactions to the microbiomes of humans, animals, plants, and natural and built environments. The journal also welcomes research on the virome, phageome, mycome, and fungome. It publishes both applied science and theoretical work. As an open access and interdisciplinary journal, its primary goal is to publish significant scientific advancements in microbial biofilms and microbiomes. The journal enables discussions that span multiple disciplines and contributes to our understanding of the social behavior of microbial biofilm populations and communities, and their impact on life, human health, and the environment.