{"title":"经典统计学和深度学习能在可解释的、因果驱动的目标发现上融合吗?","authors":"Liyin Chen","doi":"10.1093/dnares/dsaf024","DOIUrl":null,"url":null,"abstract":"<p><p>Understanding the molecular causes of complex diseases remains one of the most pressing challenges in biomedicine. Despite large-scale genome-wide association studies mapping thousands of risk loci, identifying which genetic variants truly drive disease remains difficult. Traditional statistical genetics has laid a strong foundation for variant discovery, but it often struggles to capture non-linear interactions and cannot fully integrate the breadth of the interconnected multi-omics data. In recent years, deep learning approaches have shown promise in bridging these gaps: modeling high-order genetic interactions, uncovering latent biological structure, and enabling multi-layered data integration. However, most current deep learning models for genomics remain exploratory in nature, and issues such as susceptibility to overfitting, difficulties in interpretability, and the general lack of standardized evaluation frameworks have limited their widespread adoption for genomics research. In this review, we explore how traditional statistical and deep learning methods can be applied to uncover causal mechanisms in complex disease. We critically compare these two frameworks for their advantages and limitations in detecting genetic associations and prioritizing causal associations. Toward the end, we propose a future direction centered around hybrid models that blend the scalability of deep learning with the inferential power of statistical genetics. Our goal is to guide researchers in developing next-generation computational tools to uncover the molecular basis of complex diseases and accelerate the translation of genetic findings into effective treatments.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can classical statistics and deep learning converge on explainable, causally driven target discovery?\",\"authors\":\"Liyin Chen\",\"doi\":\"10.1093/dnares/dsaf024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Understanding the molecular causes of complex diseases remains one of the most pressing challenges in biomedicine. Despite large-scale genome-wide association studies mapping thousands of risk loci, identifying which genetic variants truly drive disease remains difficult. Traditional statistical genetics has laid a strong foundation for variant discovery, but it often struggles to capture non-linear interactions and cannot fully integrate the breadth of the interconnected multi-omics data. In recent years, deep learning approaches have shown promise in bridging these gaps: modeling high-order genetic interactions, uncovering latent biological structure, and enabling multi-layered data integration. However, most current deep learning models for genomics remain exploratory in nature, and issues such as susceptibility to overfitting, difficulties in interpretability, and the general lack of standardized evaluation frameworks have limited their widespread adoption for genomics research. In this review, we explore how traditional statistical and deep learning methods can be applied to uncover causal mechanisms in complex disease. We critically compare these two frameworks for their advantages and limitations in detecting genetic associations and prioritizing causal associations. Toward the end, we propose a future direction centered around hybrid models that blend the scalability of deep learning with the inferential power of statistical genetics. Our goal is to guide researchers in developing next-generation computational tools to uncover the molecular basis of complex diseases and accelerate the translation of genetic findings into effective treatments.</p>\",\"PeriodicalId\":51014,\"journal\":{\"name\":\"DNA Research\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DNA Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/dnares/dsaf024\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/dnares/dsaf024","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Can classical statistics and deep learning converge on explainable, causally driven target discovery?
Understanding the molecular causes of complex diseases remains one of the most pressing challenges in biomedicine. Despite large-scale genome-wide association studies mapping thousands of risk loci, identifying which genetic variants truly drive disease remains difficult. Traditional statistical genetics has laid a strong foundation for variant discovery, but it often struggles to capture non-linear interactions and cannot fully integrate the breadth of the interconnected multi-omics data. In recent years, deep learning approaches have shown promise in bridging these gaps: modeling high-order genetic interactions, uncovering latent biological structure, and enabling multi-layered data integration. However, most current deep learning models for genomics remain exploratory in nature, and issues such as susceptibility to overfitting, difficulties in interpretability, and the general lack of standardized evaluation frameworks have limited their widespread adoption for genomics research. In this review, we explore how traditional statistical and deep learning methods can be applied to uncover causal mechanisms in complex disease. We critically compare these two frameworks for their advantages and limitations in detecting genetic associations and prioritizing causal associations. Toward the end, we propose a future direction centered around hybrid models that blend the scalability of deep learning with the inferential power of statistical genetics. Our goal is to guide researchers in developing next-generation computational tools to uncover the molecular basis of complex diseases and accelerate the translation of genetic findings into effective treatments.
期刊介绍:
DNA Research is an internationally peer-reviewed journal which aims at publishing papers of highest quality in broad aspects of DNA and genome-related research. Emphasis will be made on the following subjects: 1) Sequencing and characterization of genomes/important genomic regions, 2) Comprehensive analysis of the functions of genes, gene families and genomes, 3) Techniques and equipments useful for structural and functional analysis of genes, gene families and genomes, 4) Computer algorithms and/or their applications relevant to structural and functional analysis of genes and genomes. The journal also welcomes novel findings in other scientific disciplines related to genomes.