Setegn Worku Alemu, Timothy P. Bilton, Patricia L. Johnson, Benjamin J. Perry, Hannah Henry, Ken G. Dodds, John C. McEwan, Suzanne J. Rowe
{"title":"提高绵羊甲烷排放和饲料效率基因组预测精度:基于神经网络GBLUP (NN-GBLUP)整合瘤胃微生物主成分分析与宿主基因组变异","authors":"Setegn Worku Alemu, Timothy P. Bilton, Patricia L. Johnson, Benjamin J. Perry, Hannah Henry, Ken G. Dodds, John C. McEwan, Suzanne J. Rowe","doi":"10.1186/s12711-025-00987-x","DOIUrl":null,"url":null,"abstract":"Methane emissions from ruminant livestock pose a significant challenge to mitigating climate change. Genomic selection offers a promising approach to reduce methane emissions, but prediction accuracy remains low due to the high cost of measuring methane emissions. Integrating rumen microbiome composition (RMC) data may improve genomic prediction accuracy, yet the high dimensionality of RMC data presents computational challenges. This study aimed to (1) evaluate the effectiveness of principal component analysis (PCA) for reducing RMC data dimensionality while retaining essential information, and (2) assess whether incorporating PCA-reduced RMC data as intermediate traits in a Neural Network Genomic Best Linear Unbiased Prediction (NN-GBLUP) model improves genomic prediction accuracy for methane emissions and feed efficiency traits in sheep. For the first objective, Principal Components (PCs) explaining 100% of variation effectively captured RMC information, with microbiability estimates closely matching those from the full dataset. For the second objective, the NN-GBLUP model incorporating PCA-reduced RMC data improved prediction accuracy compared to standard GBLUP methods. Prediction accuracy for methane emissions increased from 0.09 to 0.30 in train-test validation and from 0.15 to 0.27 in five-fold cross-validation using PCA components explaining 25% of total RMC variation. For residual feed intake, accuracy improved from 0.25 to 0.37 in train-test validation and from 0.25 to 0.34 in cross-validation. Optimal PCA components varied by trait, with 25% and 50% components showing the best results. Prediction accuracy did not improve for carbon dioxide emissions, live weight, and mid-intake, indicating trait-dependent microbiome influence. Principal Component Analysis reduced the dimensionality of rumen microbiome data while preserving essential biological information. The integration of these PCA-reduced data with host genomic information through an NN-GBLUP model substantially improved genomic prediction accuracy for methane emissions and feed efficiency in sheep. Principal components explaining 25% and 50% of the variation yielded the highest accuracy, whereas higher components (75% and 95%) reduced accuracy for methane traits. This approach shows promise for implementing genomic selection strategies to reduce methane emissions and improve feed efficiency in ruminant livestock in a computationally efficient manner, thereby contributing to climate change mitigation efforts in agriculture.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"17 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving genomic prediction accuracy for methane emission and feed efficiency in sheep: integrating rumen microbial PCA with host genomic variation using neural network GBLUP (NN-GBLUP)\",\"authors\":\"Setegn Worku Alemu, Timothy P. Bilton, Patricia L. Johnson, Benjamin J. Perry, Hannah Henry, Ken G. Dodds, John C. McEwan, Suzanne J. Rowe\",\"doi\":\"10.1186/s12711-025-00987-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Methane emissions from ruminant livestock pose a significant challenge to mitigating climate change. Genomic selection offers a promising approach to reduce methane emissions, but prediction accuracy remains low due to the high cost of measuring methane emissions. Integrating rumen microbiome composition (RMC) data may improve genomic prediction accuracy, yet the high dimensionality of RMC data presents computational challenges. This study aimed to (1) evaluate the effectiveness of principal component analysis (PCA) for reducing RMC data dimensionality while retaining essential information, and (2) assess whether incorporating PCA-reduced RMC data as intermediate traits in a Neural Network Genomic Best Linear Unbiased Prediction (NN-GBLUP) model improves genomic prediction accuracy for methane emissions and feed efficiency traits in sheep. For the first objective, Principal Components (PCs) explaining 100% of variation effectively captured RMC information, with microbiability estimates closely matching those from the full dataset. For the second objective, the NN-GBLUP model incorporating PCA-reduced RMC data improved prediction accuracy compared to standard GBLUP methods. Prediction accuracy for methane emissions increased from 0.09 to 0.30 in train-test validation and from 0.15 to 0.27 in five-fold cross-validation using PCA components explaining 25% of total RMC variation. For residual feed intake, accuracy improved from 0.25 to 0.37 in train-test validation and from 0.25 to 0.34 in cross-validation. Optimal PCA components varied by trait, with 25% and 50% components showing the best results. Prediction accuracy did not improve for carbon dioxide emissions, live weight, and mid-intake, indicating trait-dependent microbiome influence. Principal Component Analysis reduced the dimensionality of rumen microbiome data while preserving essential biological information. The integration of these PCA-reduced data with host genomic information through an NN-GBLUP model substantially improved genomic prediction accuracy for methane emissions and feed efficiency in sheep. Principal components explaining 25% and 50% of the variation yielded the highest accuracy, whereas higher components (75% and 95%) reduced accuracy for methane traits. This approach shows promise for implementing genomic selection strategies to reduce methane emissions and improve feed efficiency in ruminant livestock in a computationally efficient manner, thereby contributing to climate change mitigation efforts in agriculture.\",\"PeriodicalId\":55120,\"journal\":{\"name\":\"Genetics Selection Evolution\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetics Selection Evolution\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12711-025-00987-x\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetics Selection Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12711-025-00987-x","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
Improving genomic prediction accuracy for methane emission and feed efficiency in sheep: integrating rumen microbial PCA with host genomic variation using neural network GBLUP (NN-GBLUP)
Methane emissions from ruminant livestock pose a significant challenge to mitigating climate change. Genomic selection offers a promising approach to reduce methane emissions, but prediction accuracy remains low due to the high cost of measuring methane emissions. Integrating rumen microbiome composition (RMC) data may improve genomic prediction accuracy, yet the high dimensionality of RMC data presents computational challenges. This study aimed to (1) evaluate the effectiveness of principal component analysis (PCA) for reducing RMC data dimensionality while retaining essential information, and (2) assess whether incorporating PCA-reduced RMC data as intermediate traits in a Neural Network Genomic Best Linear Unbiased Prediction (NN-GBLUP) model improves genomic prediction accuracy for methane emissions and feed efficiency traits in sheep. For the first objective, Principal Components (PCs) explaining 100% of variation effectively captured RMC information, with microbiability estimates closely matching those from the full dataset. For the second objective, the NN-GBLUP model incorporating PCA-reduced RMC data improved prediction accuracy compared to standard GBLUP methods. Prediction accuracy for methane emissions increased from 0.09 to 0.30 in train-test validation and from 0.15 to 0.27 in five-fold cross-validation using PCA components explaining 25% of total RMC variation. For residual feed intake, accuracy improved from 0.25 to 0.37 in train-test validation and from 0.25 to 0.34 in cross-validation. Optimal PCA components varied by trait, with 25% and 50% components showing the best results. Prediction accuracy did not improve for carbon dioxide emissions, live weight, and mid-intake, indicating trait-dependent microbiome influence. Principal Component Analysis reduced the dimensionality of rumen microbiome data while preserving essential biological information. The integration of these PCA-reduced data with host genomic information through an NN-GBLUP model substantially improved genomic prediction accuracy for methane emissions and feed efficiency in sheep. Principal components explaining 25% and 50% of the variation yielded the highest accuracy, whereas higher components (75% and 95%) reduced accuracy for methane traits. This approach shows promise for implementing genomic selection strategies to reduce methane emissions and improve feed efficiency in ruminant livestock in a computationally efficient manner, thereby contributing to climate change mitigation efforts in agriculture.
期刊介绍:
Genetics Selection Evolution invites basic, applied and methodological content that will aid the current understanding and the utilization of genetic variability in domestic animal species. Although the focus is on domestic animal species, research on other species is invited if it contributes to the understanding of the use of genetic variability in domestic animals. Genetics Selection Evolution publishes results from all levels of study, from the gene to the quantitative trait, from the individual to the population, the breed or the species. Contributions concerning both the biological approach, from molecular genetics to quantitative genetics, as well as the mathematical approach, from population genetics to statistics, are welcome. Specific areas of interest include but are not limited to: gene and QTL identification, mapping and characterization, analysis of new phenotypes, high-throughput SNP data analysis, functional genomics, cytogenetics, genetic diversity of populations and breeds, genetic evaluation, applied and experimental selection, genomic selection, selection efficiency, and statistical methodology for the genetic analysis of phenotypes with quantitative and mixed inheritance.