{"title":"Codon usage patterns and genomic variation analysis of chloroplast genomes provides new insights into the evolution of Aroideae.","authors":"Xinbi Jia, Jiaqi Wei, Yuewen Chen, Chenghong Zeng, Chan Deng, Pengchen Zeng, Yufei Tang, Qinghong Zhou, Yingjin Huang, Qianglong Zhu","doi":"10.1038/s41598-025-88244-5","DOIUrl":null,"url":null,"abstract":"<p><p>Aroideae is an important subfamily of the Araceae family and contains many plants with medicinal and edible value. It is difficult to identify and classify Aroideae species accurately on the basis of morphology alone because of their polymorphic phenotypic traits. The chloroplast genome (CPG) is useful for studying on plant taxonomy and phylogeny, and the analysis of codon usage bias (CUB) in CPGs provides further insights into the intricate phylogenetic relationships among Aroideae. The results showed that the codon third position of the chloroplast genome coding sequence in Aroideae was rich in A and T, with a GC content of 37.91%. The ENC-plot and PR2-plot revealed that the codon usage bias of Aroideae was influenced by multiple factors, with natural selection as the dominant factor. Thirteen to twenty optimal codons ending in A/T were identified in 61 Aroideae species. Additionally, the comparative analysis of CPGs revealed that two single copy regions and non-coding regions were variable in Aroideae. Eight highly divergent regions (Pi > 0.064) were identified (ndhF, rpl32, ccsA, ndhE, ndhG, ndhF-rpl32, ccsA-ndhD, and ndhE-ndhG) , in which ndhE have the potential to serve as a reliable DNA marker to discriminate chloroplasts in Aroideae subfamily. Furthermore, the maximum likelihood-based phylogenetic trees constructed from complete chloroplast genomes and protein-coding sequences presented similar topologies. Principal component clustering analysis based on relative synonymous codon usage values (RSCUs) revealed that Calla was clearly deviated from Montrichardia and Anubias, and that Alocasia was closer to Colocasieae than to Arisaemateae. These findings suggest that the use of RSCU for clustering analysis could offer new theoretical support for species classification and evolution. Our research could provide a theoretical foundation for the chloroplast genetic engineering, taxonomy, and phylogenetic relationships of Aroideae chloroplasts.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"4333"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11799533/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-88244-5","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Aroideae is an important subfamily of the Araceae family and contains many plants with medicinal and edible value. It is difficult to identify and classify Aroideae species accurately on the basis of morphology alone because of their polymorphic phenotypic traits. The chloroplast genome (CPG) is useful for studying on plant taxonomy and phylogeny, and the analysis of codon usage bias (CUB) in CPGs provides further insights into the intricate phylogenetic relationships among Aroideae. The results showed that the codon third position of the chloroplast genome coding sequence in Aroideae was rich in A and T, with a GC content of 37.91%. The ENC-plot and PR2-plot revealed that the codon usage bias of Aroideae was influenced by multiple factors, with natural selection as the dominant factor. Thirteen to twenty optimal codons ending in A/T were identified in 61 Aroideae species. Additionally, the comparative analysis of CPGs revealed that two single copy regions and non-coding regions were variable in Aroideae. Eight highly divergent regions (Pi > 0.064) were identified (ndhF, rpl32, ccsA, ndhE, ndhG, ndhF-rpl32, ccsA-ndhD, and ndhE-ndhG) , in which ndhE have the potential to serve as a reliable DNA marker to discriminate chloroplasts in Aroideae subfamily. Furthermore, the maximum likelihood-based phylogenetic trees constructed from complete chloroplast genomes and protein-coding sequences presented similar topologies. Principal component clustering analysis based on relative synonymous codon usage values (RSCUs) revealed that Calla was clearly deviated from Montrichardia and Anubias, and that Alocasia was closer to Colocasieae than to Arisaemateae. These findings suggest that the use of RSCU for clustering analysis could offer new theoretical support for species classification and evolution. Our research could provide a theoretical foundation for the chloroplast genetic engineering, taxonomy, and phylogenetic relationships of Aroideae chloroplasts.
期刊介绍:
We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections.
Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021).
•Engineering
Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live.
•Physical sciences
Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics.
•Earth and environmental sciences
Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems.
•Biological sciences
Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants.
•Health sciences
The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.