G. Song, Qiuhong Yang, Dongyun Zuo, Hailiang Cheng, Youping Zhang, Qiaolian Wang, J. Ashraf, Xiaoxu Feng, Simin Li, Xiaoqing Chen, Shang Liu
{"title":"Improved Gossypium raimondii genome using a Hi-C-based proximity-guided assembly","authors":"G. Song, Qiuhong Yang, Dongyun Zuo, Hailiang Cheng, Youping Zhang, Qiaolian Wang, J. Ashraf, Xiaoxu Feng, Simin Li, Xiaoqing Chen, Shang Liu","doi":"10.21203/rs.3.rs-63005/v1","DOIUrl":null,"url":null,"abstract":"Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii , the putative contributor of the D subgenome of upland cotton ( G. hirsutum ), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.","PeriodicalId":15400,"journal":{"name":"Journal of Cotton Research","volume":"1 1","pages":"1-7"},"PeriodicalIF":3.1000,"publicationDate":"2020-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cotton Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.21203/rs.3.rs-63005/v1","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii , the putative contributor of the D subgenome of upland cotton ( G. hirsutum ), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.