Xun Gong, Hantao Zhang, Yinluo Guo, Shaoshuai Yu, Min Tang
{"title":"Chromosome-level genome assembly of Iodes seguinii and its metabonomic implications for rheumatoid arthritis treatment.","authors":"Xun Gong, Hantao Zhang, Yinluo Guo, Shaoshuai Yu, Min Tang","doi":"10.1002/tpg2.20534","DOIUrl":null,"url":null,"abstract":"<p><p>Iodes seguinii is a woody vine known for its potential therapeutic applications in treating rheumatoid arthritis (RA) due to its rich bioactive components. Here, we achieved the first chromosome-level assembly of the nuclear genome of I. seguinii using PacBio HiFi and chromatin conformation capture (Hi-C) sequencing data. The initial assembly with PacBio data produced contigs with an N50 length of 9.71 Mb, and Hi-C data anchored these contigs into 13 chromosomes, achieving a total length of 273.58 Mb, closely matching the estimated genome size. Quality assessments, including BUSCO, long terminal repeat assembly index, transcriptome mapping rates, and sequencing coverage, confirmed the high quality, completeness, and continuity of the assembly, identifying 115.28 Mb of repetitive sequences, 1062 RNA genes, and 25,270 protein-coding genes. Additionally, we assembled and annotated the 150,599 bp chloroplast genome using Illumina sequencing data, containing 121 genes including key DNA barcodes, with maturase K (matK) proving effective for species identification. Phylogenetic analysis positioned I. seguinii at the base of the Lamiales clade, identifying significant gene family expansions and contractions, particularly related to secondary metabolite synthesis and DNA damage repair. Metabolite analysis identified 84 active components in I. seguinii, including the discovery of luteolin, with 119 targets predicted for RA treatment, including core targets like AKT1, toll-like receptor 4 (TLR4), epidermal growth factor receptor (EGFR), tumor necrosis factor (TNF), TP53, NFKB1, janus kinase 2 (JAK2), BCL2, mitogen-activated protein kinase 1 (MAPK1), and spleen-associated tyrosine kinase (SYK). Key active components such as flavonoids and polyphenols with anti-inflammatory activities were highlighted. The discovery of luteolin, in particular, underscores its potential therapeutic role. These findings provide a valuable genomic resource and a scientific basis for the development and application of I. seguinii, addressing the genomic gap in the genus Iodes and the order Icacinales and underscoring the need for further research in genomics, transcriptomics, and metabolomics to fully explore its potential.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20534"},"PeriodicalIF":3.9000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/tpg2.20534","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Iodes seguinii is a woody vine known for its potential therapeutic applications in treating rheumatoid arthritis (RA) due to its rich bioactive components. Here, we achieved the first chromosome-level assembly of the nuclear genome of I. seguinii using PacBio HiFi and chromatin conformation capture (Hi-C) sequencing data. The initial assembly with PacBio data produced contigs with an N50 length of 9.71 Mb, and Hi-C data anchored these contigs into 13 chromosomes, achieving a total length of 273.58 Mb, closely matching the estimated genome size. Quality assessments, including BUSCO, long terminal repeat assembly index, transcriptome mapping rates, and sequencing coverage, confirmed the high quality, completeness, and continuity of the assembly, identifying 115.28 Mb of repetitive sequences, 1062 RNA genes, and 25,270 protein-coding genes. Additionally, we assembled and annotated the 150,599 bp chloroplast genome using Illumina sequencing data, containing 121 genes including key DNA barcodes, with maturase K (matK) proving effective for species identification. Phylogenetic analysis positioned I. seguinii at the base of the Lamiales clade, identifying significant gene family expansions and contractions, particularly related to secondary metabolite synthesis and DNA damage repair. Metabolite analysis identified 84 active components in I. seguinii, including the discovery of luteolin, with 119 targets predicted for RA treatment, including core targets like AKT1, toll-like receptor 4 (TLR4), epidermal growth factor receptor (EGFR), tumor necrosis factor (TNF), TP53, NFKB1, janus kinase 2 (JAK2), BCL2, mitogen-activated protein kinase 1 (MAPK1), and spleen-associated tyrosine kinase (SYK). Key active components such as flavonoids and polyphenols with anti-inflammatory activities were highlighted. The discovery of luteolin, in particular, underscores its potential therapeutic role. These findings provide a valuable genomic resource and a scientific basis for the development and application of I. seguinii, addressing the genomic gap in the genus Iodes and the order Icacinales and underscoring the need for further research in genomics, transcriptomics, and metabolomics to fully explore its potential.
期刊介绍:
The Plant Genome publishes original research investigating all aspects of plant genomics. Technical breakthroughs reporting improvements in the efficiency and speed of acquiring and interpreting plant genomics data are welcome. The editorial board gives preference to novel reports that use innovative genomic applications that advance our understanding of plant biology that may have applications to crop improvement. The journal also publishes invited review articles and perspectives that offer insight and commentary on recent advances in genomics and their potential for agronomic improvement.