{"title":"CT-TADB predicts TAD boundaries without Hi-C by integrating DNA sequences and epigenomic features.","authors":"Tong Chen, Shuaibin Wang, Yuyu Jin, Yuan Yuan, Zhen Liang, Yin Shen","doi":"10.1038/s41540-026-00734-y","DOIUrl":null,"url":null,"abstract":"<p><p>Topologically associating domains (TADs) are fundamental units of three-dimensional genome organization, and their boundaries play important roles in gene regulation and genomic stability. However, accurate computational prediction of TAD boundaries remains challenging because of the complex interplay between DNA sequence and epigenomic regulatory signals. Here we present CT-TADB, a hybrid CNN-Transformer framework that integrates DNA sequence with histone modification and CTCF binding signals to predict TAD boundaries without requiring Hi-C data. Trained on six human cell lines, CT-TADB achieved AUC values of 0.932-0.950, demonstrating competitive or superior performance relative to existing multi-modal methods. The model maintained stable performance on strictly independent and dataset-specific boundary subsets, and exhibited robust cross-cell-type transferability and cross-species generalization between human and mouse (AUC > 0.80 for human-to-mouse transfer). Feature attribution analysis identified CTCF as the dominant boundary-associated factor, supported by active chromatin marks, and quantitative attention analysis revealed significant long-range CTCF-mediated dependencies (p < 0.001). CT-TADB further identified a clinically validated PITX2-associated TAD boundary linked to cardiac arrhythmia and captured condition-specific boundary dynamics. By leveraging routinely available epigenomic data, CT-TADB provides a practical and complementary framework for investigating chromatin architecture across diverse biological contexts.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":" ","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2026-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Systems Biology and Applications","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41540-026-00734-y","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Topologically associating domains (TADs) are fundamental units of three-dimensional genome organization, and their boundaries play important roles in gene regulation and genomic stability. However, accurate computational prediction of TAD boundaries remains challenging because of the complex interplay between DNA sequence and epigenomic regulatory signals. Here we present CT-TADB, a hybrid CNN-Transformer framework that integrates DNA sequence with histone modification and CTCF binding signals to predict TAD boundaries without requiring Hi-C data. Trained on six human cell lines, CT-TADB achieved AUC values of 0.932-0.950, demonstrating competitive or superior performance relative to existing multi-modal methods. The model maintained stable performance on strictly independent and dataset-specific boundary subsets, and exhibited robust cross-cell-type transferability and cross-species generalization between human and mouse (AUC > 0.80 for human-to-mouse transfer). Feature attribution analysis identified CTCF as the dominant boundary-associated factor, supported by active chromatin marks, and quantitative attention analysis revealed significant long-range CTCF-mediated dependencies (p < 0.001). CT-TADB further identified a clinically validated PITX2-associated TAD boundary linked to cardiac arrhythmia and captured condition-specific boundary dynamics. By leveraging routinely available epigenomic data, CT-TADB provides a practical and complementary framework for investigating chromatin architecture across diverse biological contexts.
期刊介绍:
npj Systems Biology and Applications is an online Open Access journal dedicated to publishing the premier research that takes a systems-oriented approach. The journal aims to provide a forum for the presentation of articles that help define this nascent field, as well as those that apply the advances to wider fields. We encourage studies that integrate, or aid the integration of, data, analyses and insight from molecules to organisms and broader systems. Important areas of interest include not only fundamental biological systems and drug discovery, but also applications to health, medical practice and implementation, big data, biotechnology, food science, human behaviour, broader biological systems and industrial applications of systems biology.
We encourage all approaches, including network biology, application of control theory to biological systems, computational modelling and analysis, comprehensive and/or high-content measurements, theoretical, analytical and computational studies of system-level properties of biological systems and computational/software/data platforms enabling such studies.