G. Song, Qiuhong Yang, Dongyun Zuo, Hailiang Cheng, Youping Zhang, Qiaolian Wang, J. Ashraf, Xiaoxu Feng, Simin Li, Xiaoqing Chen, Shang Liu
{"title":"利用基于hi - c的接近引导组装改进的雷蒙地棉基因组","authors":"G. Song, Qiuhong Yang, Dongyun Zuo, Hailiang Cheng, Youping Zhang, Qiaolian Wang, J. Ashraf, Xiaoxu Feng, Simin Li, Xiaoqing Chen, Shang Liu","doi":"10.21203/rs.3.rs-63005/v1","DOIUrl":null,"url":null,"abstract":"Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii , the putative contributor of the D subgenome of upland cotton ( G. hirsutum ), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.","PeriodicalId":15400,"journal":{"name":"Journal of Cotton Research","volume":"1 1","pages":"1-7"},"PeriodicalIF":3.1000,"publicationDate":"2020-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Gossypium raimondii genome using a Hi-C-based proximity-guided assembly\",\"authors\":\"G. Song, Qiuhong Yang, Dongyun Zuo, Hailiang Cheng, Youping Zhang, Qiaolian Wang, J. Ashraf, Xiaoxu Feng, Simin Li, Xiaoqing Chen, Shang Liu\",\"doi\":\"10.21203/rs.3.rs-63005/v1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii , the putative contributor of the D subgenome of upland cotton ( G. hirsutum ), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.\",\"PeriodicalId\":15400,\"journal\":{\"name\":\"Journal of Cotton Research\",\"volume\":\"1 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2020-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cotton Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.21203/rs.3.rs-63005/v1\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PLANT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cotton Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.21203/rs.3.rs-63005/v1","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
Improved Gossypium raimondii genome using a Hi-C-based proximity-guided assembly
Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii , the putative contributor of the D subgenome of upland cotton ( G. hirsutum ), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.