An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers

IF 3.8 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE
Bogong Liu , Huichao Liu , Junhao Tu , Jian Xiao , Jie Yang , Xi He , Haihan Zhang
{"title":"An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers","authors":"Bogong Liu ,&nbsp;Huichao Liu ,&nbsp;Junhao Tu ,&nbsp;Jian Xiao ,&nbsp;Jie Yang ,&nbsp;Xi He ,&nbsp;Haihan Zhang","doi":"10.1016/j.psj.2024.104489","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML) methods have rapidly developed in various theoretical and practical research areas, including predicting genomic breeding values for large livestock animals. However, few studies have investigated the application of ML in broiler breeding. In this study, seven different ML methods—support vector regression (SVR), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), kernel ridge regression (KRR) and multilayer perceptron (MLP) were employed to predict the genomic breeding values of laying traits, growth and carcass traits in a yellow-feathered broiler breeding population. The results indicated that classic methods, such as GBLUP and Bayesian, achieved superior prediction accuracy compared to ML methods in five of the eight traits. For half-eviscerated weight (HEW), ML methods showed an average improvement of 54.4% over GBLUP and Bayesian methods. Among the ML methods, SVR, RF, GBDT, and XGBoost exhibited improvements exceeding 60%, with respective values of 61.3%, 61.0%, 60.4%, and 60.7%; while MLP improved by 54.4% and LightGBM by 53.7%, KRR had the lowest improvement at 29.4%. For eviscerated weight (EW), ML methods still outperformed GBLUP and Bayesian methods. MLP gained the largest improvement at 19.0%, while SVR, RF, GBDT, XGBoost, LightGBM, and KRR improved by 15.0%, 16.5%, 9.5%, 7.0%, 1.6%, and 15.9%, respectively. Compared to default hyperparameters, the average improvement of ML methods with tuned hyperparameters was 34.0%, 32.9%, 27.0%, 19.3%, 26.8%, 13.2%, 18.9%, and 46.3%, respectively. The prediction accuracy of above algorithms could be optimized using genome-wide association study (GWAS) to select subsets of significant SNPs. This work provides valuable insights into genomic prediction, aiding genetic breeding in broilers.</div></div>","PeriodicalId":20459,"journal":{"name":"Poultry Science","volume":"104 1","pages":"Article 104489"},"PeriodicalIF":3.8000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Poultry Science","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0032579124010678","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning (ML) methods have rapidly developed in various theoretical and practical research areas, including predicting genomic breeding values for large livestock animals. However, few studies have investigated the application of ML in broiler breeding. In this study, seven different ML methods—support vector regression (SVR), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), kernel ridge regression (KRR) and multilayer perceptron (MLP) were employed to predict the genomic breeding values of laying traits, growth and carcass traits in a yellow-feathered broiler breeding population. The results indicated that classic methods, such as GBLUP and Bayesian, achieved superior prediction accuracy compared to ML methods in five of the eight traits. For half-eviscerated weight (HEW), ML methods showed an average improvement of 54.4% over GBLUP and Bayesian methods. Among the ML methods, SVR, RF, GBDT, and XGBoost exhibited improvements exceeding 60%, with respective values of 61.3%, 61.0%, 60.4%, and 60.7%; while MLP improved by 54.4% and LightGBM by 53.7%, KRR had the lowest improvement at 29.4%. For eviscerated weight (EW), ML methods still outperformed GBLUP and Bayesian methods. MLP gained the largest improvement at 19.0%, while SVR, RF, GBDT, XGBoost, LightGBM, and KRR improved by 15.0%, 16.5%, 9.5%, 7.0%, 1.6%, and 15.9%, respectively. Compared to default hyperparameters, the average improvement of ML methods with tuned hyperparameters was 34.0%, 32.9%, 27.0%, 19.3%, 26.8%, 13.2%, 18.9%, and 46.3%, respectively. The prediction accuracy of above algorithms could be optimized using genome-wide association study (GWAS) to select subsets of significant SNPs. This work provides valuable insights into genomic prediction, aiding genetic breeding in broilers.
将机器学习方法应用于黄羽肉鸡基因组预测的研究。
机器学习(ML)方法在各种理论和实践研究领域得到了迅速发展,包括预测大型家畜的基因组育种价值。然而,很少有研究调查 ML 在肉鸡育种中的应用。本研究采用支持向量回归(SVR)、随机森林(RF)、梯度提升决策树(GBDT)、极端梯度提升(XGBoost)、轻梯度提升机(LightGBM)、核岭回归(KRR)和多层感知器(MLP)等七种不同的 ML 方法预测黄羽肉鸡育种群体的产蛋性状、生长和胴体性状的基因组育种值。结果表明,与 MLP 方法相比,GBLUP 和贝叶斯等传统方法在 8 个性状中的 5 个性状的预测准确率更高。在半裂重(HEW)方面,ML 方法比 GBLUP 和贝叶斯方法平均提高了 54.4%。在 ML 方法中,SVR、RF、GBDT 和 XGBoost 的改进率超过了 60%,分别为 61.3%、61.0%、60.4% 和 60.7%;而 MLP 的改进率为 54.4%,LightGBM 为 53.7%,KRR 的改进率最低,仅为 29.4%。就蒸发重量(EW)而言,ML 方法仍然优于 GBLUP 和贝叶斯方法。MLP 的改进幅度最大,为 19.0%,而 SVR、RF、GBDT、XGBoost、LightGBM 和 KRR 的改进幅度分别为 15.0%、16.5%、9.5%、7.0%、1.6% 和 15.9%。与默认超参数相比,采用调整超参数的 ML 方法的平均改进率分别为 34.0%、32.9%、27.0%、19.3%、26.8%、13.2%、18.9% 和 46.3%。利用全基因组关联研究(GWAS)选择重要的 SNPs 子集,可以优化上述算法的预测准确性。这项工作为基因组预测提供了宝贵的见解,有助于肉鸡的遗传育种。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Poultry Science
Poultry Science 农林科学-奶制品与动物科学
CiteScore
7.60
自引率
15.90%
发文量
0
审稿时长
94 days
期刊介绍: First self-published in 1921, Poultry Science is an internationally renowned monthly journal, known as the authoritative source for a broad range of poultry information and high-caliber research. The journal plays a pivotal role in the dissemination of preeminent poultry-related knowledge across all disciplines. As of January 2020, Poultry Science will become an Open Access journal with no subscription charges, meaning authors who publish here can make their research immediately, permanently, and freely accessible worldwide while retaining copyright to their work. Papers submitted for publication after October 1, 2019 will be published as Open Access papers. An international journal, Poultry Science publishes original papers, research notes, symposium papers, and reviews of basic science as applied to poultry. This authoritative source of poultry information is consistently ranked by ISI Impact Factor as one of the top 10 agriculture, dairy and animal science journals to deliver high-caliber research. Currently it is the highest-ranked (by Impact Factor and Eigenfactor) journal dedicated to publishing poultry research. Subject areas include breeding, genetics, education, production, management, environment, health, behavior, welfare, immunology, molecular biology, metabolism, nutrition, physiology, reproduction, processing, and products.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信