{"title":"Modeling the vertebrate regulatory sequence landscape by UUATAC-seq and deep learning","authors":"Xiaoping Han, Hanyu Wu, Xueyi Wang, Daiyuan Liu, Yuting Fu, Lei Yang, Renying Wang, Peijing Zhang, Jingjing Wang, Lifeng Ma, Jizhong Mao, Lina Zhou, Siqi Wang, Xinlian Zhang, Mengmeng Jiang, Xinru Wang, Guoxia Wen, Danmei Jia, Guoji Guo","doi":"10.1016/j.cell.2025.06.020","DOIUrl":null,"url":null,"abstract":"The regulatory sequences of vertebrate genomes remain incompletely understood. To address this, we developed an ultra-throughput, ultra-sensitive single-nucleus assay for transposase-accessible chromatin using sequencing (UUATAC-seq) protocol that enables the construction of chromatin accessibility landscapes for one species in a 1-day experiment. Using UUATAC-seq, we mapped candidate <em>cis-</em>regulatory elements (cCREs) across five representative vertebrate species. Our analysis revealed that genome size differences across species influence the number but not the size of cCREs. We introduced Nvwa <em>cis</em>-regulatory element (NvwaCE), a mega-task deep-learning model designed to interpret <em>cis-</em>regulatory grammar and predict cCRE landscapes directly from genomic sequences with high precision. NvwaCE demonstrated that regulatory grammar is more conserved than nucleotide sequences and that this grammar organizes cCREs into distinct functional modules. Moreover, NvwaCE accurately predicted the effects of synthetic mutations on lineage-specific cCRE function, aligning with causal quantitative trait loci (QTLs) and genome editing results. Together, our study provides a valuable resource for decoding the vertebrate regulatory language.","PeriodicalId":9656,"journal":{"name":"Cell","volume":"3 1","pages":""},"PeriodicalIF":45.5000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.cell.2025.06.020","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The regulatory sequences of vertebrate genomes remain incompletely understood. To address this, we developed an ultra-throughput, ultra-sensitive single-nucleus assay for transposase-accessible chromatin using sequencing (UUATAC-seq) protocol that enables the construction of chromatin accessibility landscapes for one species in a 1-day experiment. Using UUATAC-seq, we mapped candidate cis-regulatory elements (cCREs) across five representative vertebrate species. Our analysis revealed that genome size differences across species influence the number but not the size of cCREs. We introduced Nvwa cis-regulatory element (NvwaCE), a mega-task deep-learning model designed to interpret cis-regulatory grammar and predict cCRE landscapes directly from genomic sequences with high precision. NvwaCE demonstrated that regulatory grammar is more conserved than nucleotide sequences and that this grammar organizes cCREs into distinct functional modules. Moreover, NvwaCE accurately predicted the effects of synthetic mutations on lineage-specific cCRE function, aligning with causal quantitative trait loci (QTLs) and genome editing results. Together, our study provides a valuable resource for decoding the vertebrate regulatory language.
期刊介绍:
Cells is an international, peer-reviewed, open access journal that focuses on cell biology, molecular biology, and biophysics. It is affiliated with several societies, including the Spanish Society for Biochemistry and Molecular Biology (SEBBM), Nordic Autophagy Society (NAS), Spanish Society of Hematology and Hemotherapy (SEHH), and Society for Regenerative Medicine (Russian Federation) (RPO).
The journal publishes research findings of significant importance in various areas of experimental biology, such as cell biology, molecular biology, neuroscience, immunology, virology, microbiology, cancer, human genetics, systems biology, signaling, and disease mechanisms and therapeutics. The primary criterion for considering papers is whether the results contribute to significant conceptual advances or raise thought-provoking questions and hypotheses related to interesting and important biological inquiries.
In addition to primary research articles presented in four formats, Cells also features review and opinion articles in its "leading edge" section, discussing recent research advancements and topics of interest to its wide readership.