Use of continuous genotypes for genomic prediction in sugarcane

The Plant Genome Pub Date : 2023-12-08 DOI:10.1002/tpg2.20417

Seema Yadav, Elizabeth M. Ross, Xianming Wei, Shouye Liu, Loan To Nguyen, Owen Powell, Lee T. Hickey, Emily Deomano, Felicity Atkin, Kai P. Voss-Fels, Ben J. Hayes

{"title":"Use of continuous genotypes for genomic prediction in sugarcane","authors":"Seema Yadav, Elizabeth M. Ross, Xianming Wei, Shouye Liu, Loan To Nguyen, Owen Powell, Lee T. Hickey, Emily Deomano, Felicity Atkin, Kai P. Voss-Fels, Ben J. Hayes","doi":"10.1002/tpg2.20417","DOIUrl":null,"url":null,"abstract":"Genomic selection in sugarcane faces challenges due to limited genomic tools and high genomic complexity, particularly because of its high and variable ploidy. The classification of genotypes for single nucleotide polymorphisms (SNPs) becomes difficult due to the wide range of possible allele dosages. Previous genomic studies in sugarcane used pseudo-diploid genotyping, grouping all heterozygotes into a single class. In this study, we investigate the use of continuous genotypes as a proxy for allele-dosage in genomic prediction models. The hypothesis is that continuous genotypes could better reflect allele dosage at SNPs linked to mutations affecting target traits, resulting in phenotypic variation. The dataset included genotypes of 1318 clones at 58K SNP markers, with about 26K markers filtered using standard quality controls. Predictions for tonnes of cane per hectare (TCH), commercial cane sugar (CCS), and fiber content (Fiber) were made using parametric, non-parametric, and Bayesian methods. Continuous genotypes increased accuracy by 5%–7% for CCS and Fiber. The pseudo-diploid parametrization performed better for TCH. Reproducing kernel Hilbert spaces model with Gaussian kernel and AK4 (arc-cosine kernel with hidden layer 4) kernel outperformed other methods for TCH and CCS, suggesting that non-additive effects might influence these traits. The prevalence of low-dosage markers in the study may have limited the benefits of approximating allele-dosage information with continuous genotypes in genomic prediction models. Continuous genotypes simplify genomic prediction in polyploid crops, allowing additional markers to be used without adhering to pseudo-diploid inheritance. The approach can particularly benefit high ploidy species or emerging crops with unknown ploidy.","PeriodicalId":501653,"journal":{"name":"The Plant Genome","volume":"3 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Plant Genome","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/tpg2.20417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Genomic selection in sugarcane faces challenges due to limited genomic tools and high genomic complexity, particularly because of its high and variable ploidy. The classification of genotypes for single nucleotide polymorphisms (SNPs) becomes difficult due to the wide range of possible allele dosages. Previous genomic studies in sugarcane used pseudo-diploid genotyping, grouping all heterozygotes into a single class. In this study, we investigate the use of continuous genotypes as a proxy for allele-dosage in genomic prediction models. The hypothesis is that continuous genotypes could better reflect allele dosage at SNPs linked to mutations affecting target traits, resulting in phenotypic variation. The dataset included genotypes of 1318 clones at 58K SNP markers, with about 26K markers filtered using standard quality controls. Predictions for tonnes of cane per hectare (TCH), commercial cane sugar (CCS), and fiber content (Fiber) were made using parametric, non-parametric, and Bayesian methods. Continuous genotypes increased accuracy by 5%–7% for CCS and Fiber. The pseudo-diploid parametrization performed better for TCH. Reproducing kernel Hilbert spaces model with Gaussian kernel and AK4 (arc-cosine kernel with hidden layer 4) kernel outperformed other methods for TCH and CCS, suggesting that non-additive effects might influence these traits. The prevalence of low-dosage markers in the study may have limited the benefits of approximating allele-dosage information with continuous genotypes in genomic prediction models. Continuous genotypes simplify genomic prediction in polyploid crops, allowing additional markers to be used without adhering to pseudo-diploid inheritance. The approach can particularly benefit high ploidy species or emerging crops with unknown ploidy.

查看原文本刊更多论文

利用连续基因型进行甘蔗基因组预测

由于基因组工具有限和基因组的高度复杂性，甘蔗的基因组选择面临着挑战，特别是由于其倍性高且多变。单核苷酸多态性（SNPs）基因型的分类也因等位基因剂量范围广而变得困难。以前的甘蔗基因组研究使用假二倍体基因分型，将所有杂合子归为一类。在本研究中，我们调查了在基因组预测模型中使用连续基因型作为等位基因剂量的替代物的情况。我们的假设是，连续基因型能更好地反映与影响目标性状的突变相关的 SNP 的等位基因剂量，从而导致表型变异。数据集包括 58K SNP 标记的 1318 个克隆的基因型，其中约 26K 标记使用标准质量控制进行了过滤。使用参数、非参数和贝叶斯方法对每公顷甘蔗吨数（TCH）、商品蔗糖（CCS）和纤维含量（Fiber）进行了预测。连续基因型使 CCS 和纤维含量的准确性提高了 5%-7%。假二倍体参数化在 TCH 方面表现更好。使用高斯核和 AK4（带隐藏层 4 的弧余弦核）核的重现核希尔伯特空间模型在 TCH 和 CCS 方面的表现优于其他方法，这表明非加成效应可能会影响这些性状。研究中低剂量标记的普遍存在可能限制了在基因组预测模型中用连续基因型近似等位基因剂量信息的好处。连续基因型可简化多倍体作物的基因组预测，允许使用额外的标记，而不遵循假二倍体遗传。这种方法尤其有利于高倍性物种或倍性未知的新兴作物。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Plant Genome

自引率

0.00%

发文量