{"title":"Deciphering Sequence Determinants of Zygotic Genome Activation Genes: Insights From Machine Learning and the ZGAExplorer Platform.","authors":"Jixiang Xing, Siqi Yang, Yuchao Liang, Pengwei Hu, Bingjie Dai, Hanshuang Li, Yongqiang Xing, Yongchun Zuo","doi":"10.1111/cpr.70039","DOIUrl":null,"url":null,"abstract":"<p><p>The mammalian life cycle initiates with the transition of genetic control from the maternal to the embryonic genome during zygotic genome activation (ZGA), which becomes pivotal for development. Nevertheless, understanding the conservation of genes and transcription factors (TFs) that underlie mammalian ZGA remains limited. Here, we compiled a comprehensive set of ZGA genes from mice, humans, pigs, bovines and goats, including Nr5a2 and TPRX1/2. The identification of 111 homologous genes through comparative analyses was followed by the discovery of a conserved genetic coding region, suggesting potential sequence preferences for ZGA genes. Notably, an interpretable machine learning model based on k-mer core features showed excellent performance in predicting ZGA genes (area under the ROC curve [AUC] > 0.81), revealing abundant and intricate 6-base sequence specific patterns and potential binding TFs, including motifs from NR5A2 and TPRX1/2. Further analysis demonstrated that gene sequence features and epigenetic modification features play equally important roles in regulating ZGA genes. Ultimately, we developed the ZGAExplorer platform to provide an invaluable resource for screening ZGA genes. Our study unravels the sequence determinants of ZGA genes across species through multi-omics data integration and machine learning, yielding insights into ZGA regulatory mechanisms and embryonic developmental arrest.</p>","PeriodicalId":9760,"journal":{"name":"Cell Proliferation","volume":" ","pages":"e70039"},"PeriodicalIF":5.9000,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Proliferation","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/cpr.70039","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The mammalian life cycle initiates with the transition of genetic control from the maternal to the embryonic genome during zygotic genome activation (ZGA), which becomes pivotal for development. Nevertheless, understanding the conservation of genes and transcription factors (TFs) that underlie mammalian ZGA remains limited. Here, we compiled a comprehensive set of ZGA genes from mice, humans, pigs, bovines and goats, including Nr5a2 and TPRX1/2. The identification of 111 homologous genes through comparative analyses was followed by the discovery of a conserved genetic coding region, suggesting potential sequence preferences for ZGA genes. Notably, an interpretable machine learning model based on k-mer core features showed excellent performance in predicting ZGA genes (area under the ROC curve [AUC] > 0.81), revealing abundant and intricate 6-base sequence specific patterns and potential binding TFs, including motifs from NR5A2 and TPRX1/2. Further analysis demonstrated that gene sequence features and epigenetic modification features play equally important roles in regulating ZGA genes. Ultimately, we developed the ZGAExplorer platform to provide an invaluable resource for screening ZGA genes. Our study unravels the sequence determinants of ZGA genes across species through multi-omics data integration and machine learning, yielding insights into ZGA regulatory mechanisms and embryonic developmental arrest.
期刊介绍:
Cell Proliferation
Focus:
Devoted to studies into all aspects of cell proliferation and differentiation.
Covers normal and abnormal states.
Explores control systems and mechanisms at various levels: inter- and intracellular, molecular, and genetic.
Investigates modification by and interactions with chemical and physical agents.
Includes mathematical modeling and the development of new techniques.
Publication Content:
Original research papers
Invited review articles
Book reviews
Letters commenting on previously published papers and/or topics of general interest
By organizing the information in this manner, readers can quickly grasp the scope, focus, and publication content of Cell Proliferation.