{"title":"Detection of genes associated with soybean protein content using a genome-wide association study.","authors":"Zhiyuan Yu, Bo Hu, Hailong Ning, Wen-Xia Li","doi":"10.1007/s11103-025-01576-8","DOIUrl":null,"url":null,"abstract":"<p><p>The protein content in soybean seeds serves as a crucial measure of soybean quality. Breeding high-protein varieties remains the most cost-effective and efficient approach to increasing soybean protein levels. Nevertheless, limited research has focused on identifying the genes responsible for high protein content among the diverse soybean cultivars. To address this gap, a genome-wide association study (GWAS) was conducted on 455 soybean varieties with varying protein content to predict and validate novel genes involved in regulating protein levels in soybean seeds. Protein content data were obtained from three distinct environments, along with three environmental variables derived from oil content, which is closely related to protein levels. Genotyping was performed using the SoySNP180k BeadChip, yielding genotype data for 63,306 non-redundant single nucleotide polymorphisms (SNPs). Five multi-locus GWAS methods were employed, resulting in the identification of 81 significant quantitative trait nucleotides (QTNs), of which 37 QTNs detected across different methods and environments were further analyzed. Moreover, the simulation platform Blib was used to conduct single-crossing simulation breeding on 81 QTN loci for actual breeding prediction. Haplotype analysis based on re-sequencing data confirmed 2 genes closely linked to protein synthesis, providing a theoretical basis for breeding high-protein soybean varieties and developing molecular breeding strategies.</p>","PeriodicalId":20064,"journal":{"name":"Plant Molecular Biology","volume":"115 2","pages":"49"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Molecular Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s11103-025-01576-8","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The protein content in soybean seeds serves as a crucial measure of soybean quality. Breeding high-protein varieties remains the most cost-effective and efficient approach to increasing soybean protein levels. Nevertheless, limited research has focused on identifying the genes responsible for high protein content among the diverse soybean cultivars. To address this gap, a genome-wide association study (GWAS) was conducted on 455 soybean varieties with varying protein content to predict and validate novel genes involved in regulating protein levels in soybean seeds. Protein content data were obtained from three distinct environments, along with three environmental variables derived from oil content, which is closely related to protein levels. Genotyping was performed using the SoySNP180k BeadChip, yielding genotype data for 63,306 non-redundant single nucleotide polymorphisms (SNPs). Five multi-locus GWAS methods were employed, resulting in the identification of 81 significant quantitative trait nucleotides (QTNs), of which 37 QTNs detected across different methods and environments were further analyzed. Moreover, the simulation platform Blib was used to conduct single-crossing simulation breeding on 81 QTN loci for actual breeding prediction. Haplotype analysis based on re-sequencing data confirmed 2 genes closely linked to protein synthesis, providing a theoretical basis for breeding high-protein soybean varieties and developing molecular breeding strategies.
期刊介绍:
Plant Molecular Biology is an international journal dedicated to rapid publication of original research articles in all areas of plant biology.The Editorial Board welcomes full-length manuscripts that address important biological problems of broad interest, including research in comparative genomics, functional genomics, proteomics, bioinformatics, computational biology, biochemical and regulatory networks, and biotechnology. Because space in the journal is limited, however, preference is given to publication of results that provide significant new insights into biological problems and that advance the understanding of structure, function, mechanisms, or regulation. Authors must ensure that results are of high quality and that manuscripts are written for a broad plant science audience.