An Extended Application of the Fast Multi-Locus Ridge Regression Algorithm in Genome-Wide Association Studies of Categorical Phenotypes

Plants Pub Date : 2024-09-07 DOI:10.3390/plants13172520
Jin Zhang, Bolin Shen, Ziyang Zhou, Mingzhi Cai, Xinyi Wu, Le Han, Yangjun Wen
{"title":"An Extended Application of the Fast Multi-Locus Ridge Regression Algorithm in Genome-Wide Association Studies of Categorical Phenotypes","authors":"Jin Zhang, Bolin Shen, Ziyang Zhou, Mingzhi Cai, Xinyi Wu, Le Han, Yangjun Wen","doi":"10.3390/plants13172520","DOIUrl":null,"url":null,"abstract":"Categorical (either binary or ordinal) quantitative traits are widely observed to measure count and resistance in plants. Unlike continuous traits, categorical traits often provide less detailed insights into genetic variation and possess a more complex underlying genetic architecture, which presents additional challenges for their genome-wide association studies. Meanwhile, methods designed for binary or continuous phenotypes are commonly used to inappropriately analyze ordinal traits, which leads to the loss of original phenotype information and the detection power of quantitative trait nucleotides (QTN). To address these issues, fast multi-locus ridge regression (FastRR), which was originally designed for continuous traits, is used to directly analyze binary or ordinal traits in this study. FastRR includes three stages of continuous transformation, variable reduction, and parameter estimation, and it can computationally handle categorical phenotype data instead of link functions introduced or methods inappropriately used. A series of simulation studies demonstrate that, compared with four other continuous or binary or ordinal approaches, including logistic regression, FarmCPU, FaST-LMM, and POLMM, the FastRR method outperforms in the detection of small-effect QTN, accuracy of estimated effect, and computation speed. We applied FastRR to 14 binary or ordinal phenotypes in the Arabidopsis real dataset and identified 479 significant loci and 76 known genes, at least seven times as many as detected by other algorithms. These findings underscore the potential of FastRR as a very useful tool for genome-wide association studies and novel gene mining of binary and ordinal traits.","PeriodicalId":20103,"journal":{"name":"Plants","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plants","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/plants13172520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Categorical (either binary or ordinal) quantitative traits are widely observed to measure count and resistance in plants. Unlike continuous traits, categorical traits often provide less detailed insights into genetic variation and possess a more complex underlying genetic architecture, which presents additional challenges for their genome-wide association studies. Meanwhile, methods designed for binary or continuous phenotypes are commonly used to inappropriately analyze ordinal traits, which leads to the loss of original phenotype information and the detection power of quantitative trait nucleotides (QTN). To address these issues, fast multi-locus ridge regression (FastRR), which was originally designed for continuous traits, is used to directly analyze binary or ordinal traits in this study. FastRR includes three stages of continuous transformation, variable reduction, and parameter estimation, and it can computationally handle categorical phenotype data instead of link functions introduced or methods inappropriately used. A series of simulation studies demonstrate that, compared with four other continuous or binary or ordinal approaches, including logistic regression, FarmCPU, FaST-LMM, and POLMM, the FastRR method outperforms in the detection of small-effect QTN, accuracy of estimated effect, and computation speed. We applied FastRR to 14 binary or ordinal phenotypes in the Arabidopsis real dataset and identified 479 significant loci and 76 known genes, at least seven times as many as detected by other algorithms. These findings underscore the potential of FastRR as a very useful tool for genome-wide association studies and novel gene mining of binary and ordinal traits.
快速多焦点岭回归算法在分类表型全基因组关联研究中的扩展应用
分类(二元或序数)数量性状被广泛用于测量植物的数量和抗性。与连续性性状不同的是,分类性状通常无法提供详细的遗传变异信息,其潜在的遗传结构也更为复杂,这给全基因组关联研究带来了更多挑战。同时,为二元或连续表型设计的方法通常被不恰当地用于分析顺序性状,从而导致原始表型信息和数量性状核苷酸(QTN)检测能力的损失。为了解决这些问题,本研究采用了最初为连续性性状设计的快速多焦点脊回归(FastRR)来直接分析二元或序数性状。FastRR 包括连续变换、变量减少和参数估计三个阶段,它可以计算处理分类表型数据,而不是引入链接函数或不适当地使用方法。一系列模拟研究表明,与其他四种连续或二元或序数方法(包括逻辑回归、FarmCPU、FaST-LMM 和 POLMM)相比,FastRR 方法在检测小效应 QTN、估计效应的准确性和计算速度方面都更胜一筹。我们将 FastRR 应用于拟南芥真实数据集中的 14 种二元或顺序表型,发现了 479 个重要基因座和 76 个已知基因,至少是其他算法检测到的基因数量的七倍。这些发现强调了FastRR作为全基因组关联研究以及二元和顺序性状新基因挖掘的有用工具的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信