利用生物注释神经网络框架提高奶牛基因组预测的准确性。

IF 6.5 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE

Journal of Animal Science and Biotechnology Pub Date : 2024-07-01 DOI:10.1186/s40104-024-01044-1

Xue Wang, Shaolei Shi, Md Yousuf Ali Khan, Zhe Zhang, Yi Zhang

{"title":"利用生物注释神经网络框架提高奶牛基因组预测的准确性。","authors":"Xue Wang, Shaolei Shi, Md Yousuf Ali Khan, Zhe Zhang, Yi Zhang","doi":"10.1186/s40104-024-01044-1","DOIUrl":null,"url":null,"abstract":"Background: Biologically annotated neural networks (BANNs) are feedforward Bayesian neural network models that utilize partially connected architectures based on SNP-set annotations. As an interpretable neural network, BANNs model SNP and SNP-set effects in their input and hidden layers, respectively. Furthermore, the weights and connections of the network are regarded as random variables with prior distributions reflecting the manifestation of genetic effects at various genomic scales. However, its application in genomic prediction has yet to be explored.Results: This study extended the BANNs framework to the area of genomic selection and explored the optimal SNP-set partitioning strategies by using dairy cattle datasets. The SNP-sets were partitioned based on two strategies-gene annotations and 100 kb windows, denoted as BANN_gene and BANN_100kb, respectively. The BANNs model was compared with GBLUP, random forest (RF), BayesB and BayesCπ through five replicates of five-fold cross-validation using genotypic and phenotypic data on milk production traits, type traits, and one health trait of 6,558, 6,210 and 5,962 Chinese Holsteins, respectively. Results showed that the BANNs framework achieves higher genomic prediction accuracy compared to GBLUP, RF and Bayesian methods. Specifically, the BANN_100kb demonstrated superior accuracy and the BANN_gene exhibited generally suboptimal accuracy compared to GBLUP, RF, BayesB and BayesCπ across all traits. The average accuracy improvements of BANN_100kb over GBLUP, RF, BayesB and BayesCπ were 4.86%, 3.95%, 3.84% and 1.92%, and the accuracy of BANN_gene was improved by 3.75%, 2.86%, 2.73% and 0.85% compared to GBLUP, RF, BayesB and BayesCπ, respectively across all seven traits. Meanwhile, both BANN_100kb and BANN_gene yielded lower overall mean square error values than GBLUP, RF and Bayesian methods.Conclusion: Our findings demonstrated that the BANNs framework performed better than traditional genomic prediction methods in our tested scenarios, and might serve as a promising alternative approach for genomic prediction in dairy cattle.","PeriodicalId":64067,"journal":{"name":"Journal of Animal Science and Biotechnology","volume":"15 1","pages":"87"},"PeriodicalIF":6.5000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11215832/pdf/","citationCount":"0","resultStr":"{\"title\":\"Improving the accuracy of genomic prediction in dairy cattle using the biologically annotated neural networks framework.\",\"authors\":\"Xue Wang, Shaolei Shi, Md Yousuf Ali Khan, Zhe Zhang, Yi Zhang\",\"doi\":\"10.1186/s40104-024-01044-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Biologically annotated neural networks (BANNs) are feedforward Bayesian neural network models that utilize partially connected architectures based on SNP-set annotations. As an interpretable neural network, BANNs model SNP and SNP-set effects in their input and hidden layers, respectively. Furthermore, the weights and connections of the network are regarded as random variables with prior distributions reflecting the manifestation of genetic effects at various genomic scales. However, its application in genomic prediction has yet to be explored.Results: This study extended the BANNs framework to the area of genomic selection and explored the optimal SNP-set partitioning strategies by using dairy cattle datasets. The SNP-sets were partitioned based on two strategies-gene annotations and 100 kb windows, denoted as BANN_gene and BANN_100kb, respectively. The BANNs model was compared with GBLUP, random forest (RF), BayesB and BayesCπ through five replicates of five-fold cross-validation using genotypic and phenotypic data on milk production traits, type traits, and one health trait of 6,558, 6,210 and 5,962 Chinese Holsteins, respectively. Results showed that the BANNs framework achieves higher genomic prediction accuracy compared to GBLUP, RF and Bayesian methods. Specifically, the BANN_100kb demonstrated superior accuracy and the BANN_gene exhibited generally suboptimal accuracy compared to GBLUP, RF, BayesB and BayesCπ across all traits. The average accuracy improvements of BANN_100kb over GBLUP, RF, BayesB and BayesCπ were 4.86%, 3.95%, 3.84% and 1.92%, and the accuracy of BANN_gene was improved by 3.75%, 2.86%, 2.73% and 0.85% compared to GBLUP, RF, BayesB and BayesCπ, respectively across all seven traits. Meanwhile, both BANN_100kb and BANN_gene yielded lower overall mean square error values than GBLUP, RF and Bayesian methods.Conclusion: Our findings demonstrated that the BANNs framework performed better than traditional genomic prediction methods in our tested scenarios, and might serve as a promising alternative approach for genomic prediction in dairy cattle.\",\"PeriodicalId\":64067,\"journal\":{\"name\":\"Journal of Animal Science and Biotechnology\",\"volume\":\"15 1\",\"pages\":\"87\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11215832/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Animal Science and Biotechnology\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://doi.org/10.1186/s40104-024-01044-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Science and Biotechnology","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.1186/s40104-024-01044-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}

引用次数: 0

摘要

背景：生物注释神经网络（BANNs）是一种前馈贝叶斯神经网络模型，它利用基于SNP集注释的部分连接架构。作为一种可解释的神经网络，BANNs 在其输入层和隐藏层分别模拟 SNP 和 SNP 集效应。此外，网络的权重和连接被视为随机变量，其先验分布反映了遗传效应在不同基因组尺度上的表现。然而，其在基因组预测中的应用还有待探索：本研究将 BANNs 框架扩展到基因组选择领域，并利用奶牛数据集探索最佳 SNP 集划分策略。SNP集的划分基于两种策略--基因注释和100 kb窗口，分别称为BANN_gene和BANN_100kb。利用 6558 头、6210 头和 5962 头中国荷斯坦牛的产奶性状、类型性状和一个健康性状的基因型和表型数据，通过五次重复的五倍交叉验证，将 BANNs 模型与 GBLUP、随机森林（RF）、BayesB 和 BayesCπ 进行了比较。结果表明，与 GBLUP、RF 和贝叶斯方法相比，BANNs 框架实现了更高的基因组预测精度。具体来说，在所有性状上，与 GBLUP、RF、BayesB 和 BayesCπ 相比，BANN_100kb 表现出更高的准确性，而 BANN_gene 一般表现出次优的准确性。与 GBLUP、RF、BayesB 和 BayesCπ 相比，BANN_100kb 在所有七个性状上的平均准确率分别提高了 4.86%、3.95%、3.84% 和 1.92%，BANN_gene 在所有七个性状上的准确率分别提高了 3.75%、2.86%、2.73% 和 0.85%。同时，与 GBLUP、RF 和贝叶斯方法相比，BANN_100kb 和 BANN_gene 的总体均方误差值更低：我们的研究结果表明，在我们测试的情况下，BANNs 框架比传统的基因组预测方法表现更好，可以作为奶牛基因组预测的一种有前途的替代方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Improving the accuracy of genomic prediction in dairy cattle using the biologically annotated neural networks framework.

查看原文本刊更多论文

Improving the accuracy of genomic prediction in dairy cattle using the biologically annotated neural networks framework.

Background: Biologically annotated neural networks (BANNs) are feedforward Bayesian neural network models that utilize partially connected architectures based on SNP-set annotations. As an interpretable neural network, BANNs model SNP and SNP-set effects in their input and hidden layers, respectively. Furthermore, the weights and connections of the network are regarded as random variables with prior distributions reflecting the manifestation of genetic effects at various genomic scales. However, its application in genomic prediction has yet to be explored.

Results: This study extended the BANNs framework to the area of genomic selection and explored the optimal SNP-set partitioning strategies by using dairy cattle datasets. The SNP-sets were partitioned based on two strategies-gene annotations and 100 kb windows, denoted as BANN_gene and BANN_100kb, respectively. The BANNs model was compared with GBLUP, random forest (RF), BayesB and BayesCπ through five replicates of five-fold cross-validation using genotypic and phenotypic data on milk production traits, type traits, and one health trait of 6,558, 6,210 and 5,962 Chinese Holsteins, respectively. Results showed that the BANNs framework achieves higher genomic prediction accuracy compared to GBLUP, RF and Bayesian methods. Specifically, the BANN_100kb demonstrated superior accuracy and the BANN_gene exhibited generally suboptimal accuracy compared to GBLUP, RF, BayesB and BayesCπ across all traits. The average accuracy improvements of BANN_100kb over GBLUP, RF, BayesB and BayesCπ were 4.86%, 3.95%, 3.84% and 1.92%, and the accuracy of BANN_gene was improved by 3.75%, 2.86%, 2.73% and 0.85% compared to GBLUP, RF, BayesB and BayesCπ, respectively across all seven traits. Meanwhile, both BANN_100kb and BANN_gene yielded lower overall mean square error values than GBLUP, RF and Bayesian methods.

Conclusion: Our findings demonstrated that the BANNs framework performed better than traditional genomic prediction methods in our tested scenarios, and might serve as a promising alternative approach for genomic prediction in dairy cattle.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Animal Science and Biotechnology

CiteScore

10.30

自引率

0.00%

发文量

822