Mitchell Cummins, Cadel Watson, Richard J Edwards, John S Mattick
{"title":"The evolution of ultraconserved elements in vertebrates.","authors":"Mitchell Cummins, Cadel Watson, Richard J Edwards, John S Mattick","doi":"10.1093/molbev/msae146","DOIUrl":null,"url":null,"abstract":"<p><p>Ultraconserved elements (UCEs) were discovered two decades ago, arbitrarily defined as sequences that are identical over a length ≥200 bp in the human, mouse and rat genomes. The definition was subsequently extended to sequences ≥100 bp identical in at least three of five mammalian genomes (including dog and cow), and shown to have undergone rapid expansion from ancestors in fish and strong negative selection in birds and mammals. Since then, many more genomes have become available, allowing better definition and more thorough examination of UCE distribution and evolutionary history. We developed a fast and flexible analytical pipeline for identifying UCEs in multiple genomes, dedUCE, which allows manipulation of minimum length, sequence identity, and number of species with a detectable UCE according to specified parameters. We suggest an updated definition of UCEs as sequences ≥100 bp and ≥97% sequence identity in ≥50% of placental mammal orders (12813 UCEs). By mapping UCEs to ∼200 species we find that placental UCEs appeared early in vertebrate evolution, well before land colonisation, suggesting the evolutionary pressures driving UCE selection were present in aquatic environments in the Cambrian-Devonian periods. Most (>90%) UCEs likely appeared after the divergence of gnathostomes from jawless predecessors, were largely established in sequence identity by early Sarcopterygii evolution - before the divergence of lobe-finned fishes from tetrapods - and became near fixed in the amniotes. UCEs are mainly located in the introns of protein-coding and non-coding genes involved in neurological and skeletomuscular development, enriched in regulatory elements, and dynamically expressed throughout embryonic development.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":null,"pages":null},"PeriodicalIF":11.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular biology and evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/molbev/msae146","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Ultraconserved elements (UCEs) were discovered two decades ago, arbitrarily defined as sequences that are identical over a length ≥200 bp in the human, mouse and rat genomes. The definition was subsequently extended to sequences ≥100 bp identical in at least three of five mammalian genomes (including dog and cow), and shown to have undergone rapid expansion from ancestors in fish and strong negative selection in birds and mammals. Since then, many more genomes have become available, allowing better definition and more thorough examination of UCE distribution and evolutionary history. We developed a fast and flexible analytical pipeline for identifying UCEs in multiple genomes, dedUCE, which allows manipulation of minimum length, sequence identity, and number of species with a detectable UCE according to specified parameters. We suggest an updated definition of UCEs as sequences ≥100 bp and ≥97% sequence identity in ≥50% of placental mammal orders (12813 UCEs). By mapping UCEs to ∼200 species we find that placental UCEs appeared early in vertebrate evolution, well before land colonisation, suggesting the evolutionary pressures driving UCE selection were present in aquatic environments in the Cambrian-Devonian periods. Most (>90%) UCEs likely appeared after the divergence of gnathostomes from jawless predecessors, were largely established in sequence identity by early Sarcopterygii evolution - before the divergence of lobe-finned fishes from tetrapods - and became near fixed in the amniotes. UCEs are mainly located in the introns of protein-coding and non-coding genes involved in neurological and skeletomuscular development, enriched in regulatory elements, and dynamically expressed throughout embryonic development.
期刊介绍:
Molecular Biology and Evolution
Journal Overview:
Publishes research at the interface of molecular (including genomics) and evolutionary biology
Considers manuscripts containing patterns, processes, and predictions at all levels of organization: population, taxonomic, functional, and phenotypic
Interested in fundamental discoveries, new and improved methods, resources, technologies, and theories advancing evolutionary research
Publishes balanced reviews of recent developments in genome evolution and forward-looking perspectives suggesting future directions in molecular evolution applications.