The pineapple reference genome: Telomere-to-telomere assembly, manually curated annotation, and comparative analysis

IF 9.3 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Junting Feng, Wei Zhang, Chengjie Chen, Yinlong Liang, Tangxiu Li, Ya Wu, Hui Liu, Jing Wu, Wenqiu Lin, Jiawei Li, Yehua He, Junhu He, Aiping Luan
{"title":"The pineapple reference genome: Telomere-to-telomere assembly, manually curated annotation, and comparative analysis","authors":"Junting Feng,&nbsp;Wei Zhang,&nbsp;Chengjie Chen,&nbsp;Yinlong Liang,&nbsp;Tangxiu Li,&nbsp;Ya Wu,&nbsp;Hui Liu,&nbsp;Jing Wu,&nbsp;Wenqiu Lin,&nbsp;Jiawei Li,&nbsp;Yehua He,&nbsp;Junhu He,&nbsp;Aiping Luan","doi":"10.1111/jipb.13748","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Pineapple is the third most crucial tropical fruit worldwide and available in five varieties. Genomes of different pineapple varieties have been released to date; however, none of them are complete, with all exhibiting substantial gaps and representing only two of the five pineapple varieties. This significantly hinders the advancement of pineapple breeding efforts. In this study, we sequenced the genomes of three varieties: a wild pineapple variety, a fiber pineapple variety, and a globally cultivated edible pineapple variety. We constructed the first gap-free reference genome (Ref) for pineapple. By consolidating multiple sources of evidence and manually revising each gene structure annotation, we identified 26,656 protein-coding genes. The BUSCO evaluation indicated a completeness of 99.2%, demonstrating the high quality of the gene structure annotations in this genome. Utilizing these resources, we identified 7,209 structural variations across the three varieties. Approximately 30.8% of pineapple genes were located within ±5 kb of structural variations, including 30 genes associated with anthocyanin synthesis. Further analysis and functional experiments demonstrated that the high expression of <i>AcMYB528</i> aligns with the accumulation of anthocyanins in the leaves, both of which may be affected by a 1.9-kb insertion fragment. In addition, we developed the Ananas Genome Database, which offers data browsing, retrieval, analysis, and download functions. The construction of this database addresses the lack of pineapple genome resource databases. In summary, we acquired a seamless pineapple reference genome with high-quality gene structure annotations, providing a solid foundation for pineapple genomics and a valuable reference for pineapple breeding.</p></div>","PeriodicalId":195,"journal":{"name":"Journal of Integrative Plant Biology","volume":"66 10","pages":"2208-2225"},"PeriodicalIF":9.3000,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Integrative Plant Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jipb.13748","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Pineapple is the third most crucial tropical fruit worldwide and available in five varieties. Genomes of different pineapple varieties have been released to date; however, none of them are complete, with all exhibiting substantial gaps and representing only two of the five pineapple varieties. This significantly hinders the advancement of pineapple breeding efforts. In this study, we sequenced the genomes of three varieties: a wild pineapple variety, a fiber pineapple variety, and a globally cultivated edible pineapple variety. We constructed the first gap-free reference genome (Ref) for pineapple. By consolidating multiple sources of evidence and manually revising each gene structure annotation, we identified 26,656 protein-coding genes. The BUSCO evaluation indicated a completeness of 99.2%, demonstrating the high quality of the gene structure annotations in this genome. Utilizing these resources, we identified 7,209 structural variations across the three varieties. Approximately 30.8% of pineapple genes were located within ±5 kb of structural variations, including 30 genes associated with anthocyanin synthesis. Further analysis and functional experiments demonstrated that the high expression of AcMYB528 aligns with the accumulation of anthocyanins in the leaves, both of which may be affected by a 1.9-kb insertion fragment. In addition, we developed the Ananas Genome Database, which offers data browsing, retrieval, analysis, and download functions. The construction of this database addresses the lack of pineapple genome resource databases. In summary, we acquired a seamless pineapple reference genome with high-quality gene structure annotations, providing a solid foundation for pineapple genomics and a valuable reference for pineapple breeding.

Abstract Image

菠萝参考基因组:端粒到端粒组装、人工标注和比较分析。
菠萝是全球第三大重要热带水果,有五个品种。迄今为止,已发布了不同菠萝品种的基因组,但没有一个是完整的,所有基因组都有很大差距,仅代表五个菠萝品种中的两个。这极大地阻碍了菠萝育种工作的进展。在这项研究中,我们对三个品种的基因组进行了测序:一个野生菠萝品种、一个纤维菠萝品种和一个全球栽培的食用菠萝品种。我们为菠萝构建了第一个无间隙参考基因组(Ref)。通过整合多种证据来源并人工修订每个基因结构注释,我们确定了 26656 个蛋白质编码基因。BUSCO 评估显示其完整性为 99.2%,表明该基因组中基因结构注释的质量很高。利用这些资源,我们在三个品种中发现了 7,209 个结构变异。约 30.8% 的菠萝基因位于结构变异的 ±5 kb 范围内,其中包括 30 个与花青素合成相关的基因。进一步的分析和功能实验表明,AcMYB528 的高表达与花青素在叶片中的积累相一致,两者都可能受到 1.9 kb 插入片段的影响。此外,我们还开发了 Ananas 基因组数据库,该数据库提供数据浏览、检索、分析和下载功能。该数据库的建立解决了菠萝基因组资源数据库缺乏的问题。总之,我们获得了一个具有高质量基因结构注释的无缝菠萝参考基因组,为菠萝基因组学奠定了坚实的基础,也为菠萝育种提供了宝贵的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Integrative Plant Biology
Journal of Integrative Plant Biology 生物-生化与分子生物学
CiteScore
18.00
自引率
5.30%
发文量
220
审稿时长
3 months
期刊介绍: Journal of Integrative Plant Biology is a leading academic journal reporting on the latest discoveries in plant biology.Enjoy the latest news and developments in the field, understand new and improved methods and research tools, and explore basic biological questions through reproducible experimental design, using genetic, biochemical, cell and molecular biological methods, and statistical analyses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信