Na Dong, Junshuai Chen, Mao Chai, Jiapeng Han, Weipeng Wang, Song Yang, Peipei Zhang, Baohong Zhang, Qinglian Wang, Qinghua Yang
{"title":"利用机器学习技术研究纤维性能优良的巴贝多BMC79品种30个性状的遗传基础","authors":"Na Dong, Junshuai Chen, Mao Chai, Jiapeng Han, Weipeng Wang, Song Yang, Peipei Zhang, Baohong Zhang, Qinglian Wang, Qinghua Yang","doi":"10.1016/j.indcrop.2025.121977","DOIUrl":null,"url":null,"abstract":"With economic development and rising living standards, the demand for high-quality cotton fiber is increasing. Understanding the genetic basis of key traits for cotton with super fiber is crucial for breeding new cultivars. Here, we conducted QTL mapping and candidate gene identification for 30 important agronomic traits in the early-maturing, high-quality fiber cultivar <em>Gossypium barbadense</em> BMC79. We crossed BMC79 with upland cotton XLZ14 to generate an F<sub>2</sub> population of 303 families and constructed a genetic map spanning 4026.30 cM with an average inter-bin distance of 0.31 cM. QTL analysis, integrated with machine learning, identified 55 QTLs for yield, fiber quality, and growth period related traits. Notably, QTLs including genes for fiber length (A01), lint percentage (A06), plant height (D11), fiber strength (D11), fiber uniformity (D12), and early maturity (D07), showed high phenotypic variance explained. Machine learning predicted several key candidate genes, such as <em>Gh_A01G162500</em> (fiber length), <em>Gh_A06G112000</em> (lint percentage), <em>Gh_D11G351100</em> (fiber strength), and <em>Gh_D07G112500</em> (flowering time). Importantly, Virus-Induced Gene Silencing (VIGS) validation showed that silencing <em>Gh_D11G351100</em> significantly reduced fiber strength and length, confirming its role in fiber development. Our study provides valuable insights into the genetic basis of high-fiber-quality cotton varieties and offers important targets and references for the development of new cotton cultivars.","PeriodicalId":13581,"journal":{"name":"Industrial Crops and Products","volume":"97 1","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating machine learning to elucidate the genetic basis of 30 traits in G. barbadense BMC79 cultivar with superior fiber properties\",\"authors\":\"Na Dong, Junshuai Chen, Mao Chai, Jiapeng Han, Weipeng Wang, Song Yang, Peipei Zhang, Baohong Zhang, Qinglian Wang, Qinghua Yang\",\"doi\":\"10.1016/j.indcrop.2025.121977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With economic development and rising living standards, the demand for high-quality cotton fiber is increasing. Understanding the genetic basis of key traits for cotton with super fiber is crucial for breeding new cultivars. Here, we conducted QTL mapping and candidate gene identification for 30 important agronomic traits in the early-maturing, high-quality fiber cultivar <em>Gossypium barbadense</em> BMC79. We crossed BMC79 with upland cotton XLZ14 to generate an F<sub>2</sub> population of 303 families and constructed a genetic map spanning 4026.30 cM with an average inter-bin distance of 0.31 cM. QTL analysis, integrated with machine learning, identified 55 QTLs for yield, fiber quality, and growth period related traits. Notably, QTLs including genes for fiber length (A01), lint percentage (A06), plant height (D11), fiber strength (D11), fiber uniformity (D12), and early maturity (D07), showed high phenotypic variance explained. Machine learning predicted several key candidate genes, such as <em>Gh_A01G162500</em> (fiber length), <em>Gh_A06G112000</em> (lint percentage), <em>Gh_D11G351100</em> (fiber strength), and <em>Gh_D07G112500</em> (flowering time). Importantly, Virus-Induced Gene Silencing (VIGS) validation showed that silencing <em>Gh_D11G351100</em> significantly reduced fiber strength and length, confirming its role in fiber development. Our study provides valuable insights into the genetic basis of high-fiber-quality cotton varieties and offers important targets and references for the development of new cotton cultivars.\",\"PeriodicalId\":13581,\"journal\":{\"name\":\"Industrial Crops and Products\",\"volume\":\"97 1\",\"pages\":\"\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Industrial Crops and Products\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1016/j.indcrop.2025.121977\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial Crops and Products","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1016/j.indcrop.2025.121977","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
Integrating machine learning to elucidate the genetic basis of 30 traits in G. barbadense BMC79 cultivar with superior fiber properties
With economic development and rising living standards, the demand for high-quality cotton fiber is increasing. Understanding the genetic basis of key traits for cotton with super fiber is crucial for breeding new cultivars. Here, we conducted QTL mapping and candidate gene identification for 30 important agronomic traits in the early-maturing, high-quality fiber cultivar Gossypium barbadense BMC79. We crossed BMC79 with upland cotton XLZ14 to generate an F2 population of 303 families and constructed a genetic map spanning 4026.30 cM with an average inter-bin distance of 0.31 cM. QTL analysis, integrated with machine learning, identified 55 QTLs for yield, fiber quality, and growth period related traits. Notably, QTLs including genes for fiber length (A01), lint percentage (A06), plant height (D11), fiber strength (D11), fiber uniformity (D12), and early maturity (D07), showed high phenotypic variance explained. Machine learning predicted several key candidate genes, such as Gh_A01G162500 (fiber length), Gh_A06G112000 (lint percentage), Gh_D11G351100 (fiber strength), and Gh_D07G112500 (flowering time). Importantly, Virus-Induced Gene Silencing (VIGS) validation showed that silencing Gh_D11G351100 significantly reduced fiber strength and length, confirming its role in fiber development. Our study provides valuable insights into the genetic basis of high-fiber-quality cotton varieties and offers important targets and references for the development of new cotton cultivars.
期刊介绍:
Industrial Crops and Products is an International Journal publishing academic and industrial research on industrial (defined as non-food/non-feed) crops and products. Papers concern both crop-oriented and bio-based materials from crops-oriented research, and should be of interest to an international audience, hypothesis driven, and where comparisons are made statistics performed.