De novo whole-genome assembly and annotation of Coffea arabica var. Geisha, a high-quality coffee variety from the primary origin of coffee.

IF 2.1 3区 生物学 Q3 GENETICS & HEREDITY
Juan F Medrano, Dario Cantu, Andrea Minio, Christian Dreischer, Theodore Gibbons, Jason Chin, Shiyu Chen, Allen Van Deynze, Amanda M Hulse-Kemp
{"title":"De novo whole-genome assembly and annotation of Coffea arabica var. Geisha, a high-quality coffee variety from the primary origin of coffee.","authors":"Juan F Medrano, Dario Cantu, Andrea Minio, Christian Dreischer, Theodore Gibbons, Jason Chin, Shiyu Chen, Allen Van Deynze, Amanda M Hulse-Kemp","doi":"10.1093/g3journal/jkae262","DOIUrl":null,"url":null,"abstract":"<p><p>Geisha coffee is recognized for its unique aromas and flavors and accordingly, has achieved the highest prices in the specialty coffee markets. We report the development of a chromosome-level, well-annotated, genome assembly of Coffea arabica var. Geisha. Geisha is considered an Ethiopian landrace that represents germplasm from the Ethiopian center of origin of coffee. We used a hybrid de novo assembly approach combining two long-reads single molecule sequencing technologies, Oxford Nanopore and Pacific Biosciences, together with scaffolding with Hi-C libraries. The final assembly is 1.03GB in size with BUSCO assessment of the assembly completeness of 97.7% of single-copy orthologs clusters. RNAseq and IsoSeq data were used as transcriptional experimental evidence for annotation and gene prediction revealing the presence of 47,062 gene loci encompassing 53,273 protein-coding transcripts. Comparison of the assembly to the progenitor subgenomes separated the set of chromosome sequences inherited from C. canephora from those of C. eugenioides. Corresponding orthologs between the two Arabica varieties, Geisha and Red Bourbon, had a 99.67% median identity, higher than what we observe with the progenitor assemblies (median 97.28%). Both Geisha and Red Bourbon contain a recombination event on Chromosome 10 relative to the two progenitors that must have happened before the geographical separation of the two varieties, consistent with a single allopolyploidization event giving rise to C. arabica. Broadening the availability of high-quality genome assemblies of Coffea arabica varieties, paves the way for understanding the evolution and domestication of coffee, as well as the genetic basis and environmental interactions of why a variety like Geisha is capable of producing beans with such exceptional and unique high-quality.</p>","PeriodicalId":12468,"journal":{"name":"G3: Genes|Genomes|Genetics","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"G3: Genes|Genomes|Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/g3journal/jkae262","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Geisha coffee is recognized for its unique aromas and flavors and accordingly, has achieved the highest prices in the specialty coffee markets. We report the development of a chromosome-level, well-annotated, genome assembly of Coffea arabica var. Geisha. Geisha is considered an Ethiopian landrace that represents germplasm from the Ethiopian center of origin of coffee. We used a hybrid de novo assembly approach combining two long-reads single molecule sequencing technologies, Oxford Nanopore and Pacific Biosciences, together with scaffolding with Hi-C libraries. The final assembly is 1.03GB in size with BUSCO assessment of the assembly completeness of 97.7% of single-copy orthologs clusters. RNAseq and IsoSeq data were used as transcriptional experimental evidence for annotation and gene prediction revealing the presence of 47,062 gene loci encompassing 53,273 protein-coding transcripts. Comparison of the assembly to the progenitor subgenomes separated the set of chromosome sequences inherited from C. canephora from those of C. eugenioides. Corresponding orthologs between the two Arabica varieties, Geisha and Red Bourbon, had a 99.67% median identity, higher than what we observe with the progenitor assemblies (median 97.28%). Both Geisha and Red Bourbon contain a recombination event on Chromosome 10 relative to the two progenitors that must have happened before the geographical separation of the two varieties, consistent with a single allopolyploidization event giving rise to C. arabica. Broadening the availability of high-quality genome assemblies of Coffea arabica varieties, paves the way for understanding the evolution and domestication of coffee, as well as the genetic basis and environmental interactions of why a variety like Geisha is capable of producing beans with such exceptional and unique high-quality.

咖啡原产地的优质咖啡品种 Coffea arabica var. Geisha 的全新全基因组组装和注释。
艺妓咖啡以其独特的香气和风味而闻名,并因此在特种咖啡市场上获得了最高的价格。我们报告了在染色体水平上对阿拉伯咖啡(Coffea arabica var.Geisha 被认为是埃塞俄比亚的一个地方品种,代表了来自埃塞俄比亚咖啡原产地中心的种质。我们采用了一种混合从头组装方法,结合了牛津纳米孔公司和太平洋生物科学公司的两种长读数单分子测序技术,并使用 Hi-C 文库搭建了脚手架。最终的组装结果大小为 1.03GB,经 BUSCO 评估,97.7% 的单拷贝同源物簇组装完整。RNAseq 和 IsoSeq 数据被用作注释和基因预测的转录实验证据,揭示了包含 53,273 个蛋白编码转录本的 47,062 个基因位点。通过与祖先亚基因组进行比较,将从 C. canephora 和 C. eugenioides 继承的染色体序列集区分开来。两个阿拉比卡品种(Geisha 和 Red Bourbon)之间的对应直向同源物的中位同一性为 99.67%,高于我们观察到的原种基因组的同一性(中位数为 97.28%)。相对于两个原种,Geisha 和 Red Bourbon 在 10 号染色体上都包含一个重组事件,该事件一定发生在两个品种地理分离之前,这与产生阿拉伯咖啡豆的单一异源多倍体事件一致。扩大阿拉伯咖啡品种高质量基因组组装的可用性,为了解咖啡的进化和驯化,以及像 Geisha 这样的品种为什么能够生产出具有如此卓越和独特品质的咖啡豆的遗传基础和环境相互作用铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
G3: Genes|Genomes|Genetics
G3: Genes|Genomes|Genetics GENETICS & HEREDITY-
CiteScore
5.10
自引率
3.80%
发文量
305
审稿时长
3-8 weeks
期刊介绍: G3: Genes, Genomes, Genetics provides a forum for the publication of high‐quality foundational research, particularly research that generates useful genetic and genomic information such as genome maps, single gene studies, genome‐wide association and QTL studies, as well as genome reports, mutant screens, and advances in methods and technology. The Editorial Board of G3 believes that rapid dissemination of these data is the necessary foundation for analysis that leads to mechanistic insights. G3, published by the Genetics Society of America, meets the critical and growing need of the genetics community for rapid review and publication of important results in all areas of genetics. G3 offers the opportunity to publish the puzzling finding or to present unpublished results that may not have been submitted for review and publication due to a perceived lack of a potential high-impact finding. G3 has earned the DOAJ Seal, which is a mark of certification for open access journals, awarded by DOAJ to journals that achieve a high level of openness, adhere to Best Practice and high publishing standards.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信