Tri D Vuong, Guangqi He, Haifei Hu, Babu Valliyodan, Dongho Lee, Philipp E Bayer, William T Schapaugh, Rene Hessel, David Edwards, Henry T Nguyen
{"title":"Identification of new genomic loci for seed protein and oil content in the soybean pangenome using genome-wide association and haplotype analyses.","authors":"Tri D Vuong, Guangqi He, Haifei Hu, Babu Valliyodan, Dongho Lee, Philipp E Bayer, William T Schapaugh, Rene Hessel, David Edwards, Henry T Nguyen","doi":"10.1007/s00122-025-05020-9","DOIUrl":null,"url":null,"abstract":"<p><p>The soybean [Glycine max (L.) Merr.] pangenome has been studied and shown to be an invaluable resource for investigating structural variations (SVs), from which different genomic markers were successfully developed and employed for genome-wide association studies (GWAS). Among the SVs markers, gene presence-and-absence variations (PAVs) have been developed in soybean, but have not been widely utilized for association analyses. Here, we reported GWAS and haplotype analysis of seed protein and oil content for two diverse panels, comprised over 500 soybean accessions evaluated in multiple field environments using three marker datasets, whole genome sequence (WGS)-single-nucleotide polymorphisms (SNPs), 50 K-SNPs, and PAVs. The analyses identified new quantitative trait loci (QTL) for protein and oil content, along with the validation of previously reported QTL for these traits. This includes a well-studied QTL on chromosome (Chr.) 20 and another one on Chr. 05 for protein and/or oil. Importantly, this study is the first to report a new genomic locus for both protein and oil mapped to Chr. 08. Gene ontology annotations and expression profiles suggested candidate genes. Further analyses using haplotype-based markers led to the identification of multiple haplotype blocks encompassing candidate genes. Among these, Glyma.05G243400 on Chr. 05 and Glyma.08G109900 and Glyma.08G110000 on Chr. 08 were identified as promising targets. These genes can be incorporated into soybean breeding programs to enhance the selection of desirable protein and oil phenotypes through a haplotype-based breeding approach.</p>","PeriodicalId":22955,"journal":{"name":"Theoretical and Applied Genetics","volume":"138 9","pages":"237"},"PeriodicalIF":4.2000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Applied Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s00122-025-05020-9","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
The soybean [Glycine max (L.) Merr.] pangenome has been studied and shown to be an invaluable resource for investigating structural variations (SVs), from which different genomic markers were successfully developed and employed for genome-wide association studies (GWAS). Among the SVs markers, gene presence-and-absence variations (PAVs) have been developed in soybean, but have not been widely utilized for association analyses. Here, we reported GWAS and haplotype analysis of seed protein and oil content for two diverse panels, comprised over 500 soybean accessions evaluated in multiple field environments using three marker datasets, whole genome sequence (WGS)-single-nucleotide polymorphisms (SNPs), 50 K-SNPs, and PAVs. The analyses identified new quantitative trait loci (QTL) for protein and oil content, along with the validation of previously reported QTL for these traits. This includes a well-studied QTL on chromosome (Chr.) 20 and another one on Chr. 05 for protein and/or oil. Importantly, this study is the first to report a new genomic locus for both protein and oil mapped to Chr. 08. Gene ontology annotations and expression profiles suggested candidate genes. Further analyses using haplotype-based markers led to the identification of multiple haplotype blocks encompassing candidate genes. Among these, Glyma.05G243400 on Chr. 05 and Glyma.08G109900 and Glyma.08G110000 on Chr. 08 were identified as promising targets. These genes can be incorporated into soybean breeding programs to enhance the selection of desirable protein and oil phenotypes through a haplotype-based breeding approach.
期刊介绍:
Theoretical and Applied Genetics publishes original research and review articles in all key areas of modern plant genetics, plant genomics and plant biotechnology. All work needs to have a clear genetic component and significant impact on plant breeding. Theoretical considerations are only accepted in combination with new experimental data and/or if they indicate a relevant application in plant genetics or breeding. Emphasizing the practical, the journal focuses on research into leading crop plants and articles presenting innovative approaches.