Integrating gene mutation spectra from tumors and the general population with gene expression topological networks to identify novel cancer driver genes.

IF 3.8 2区 生物学 Q2 GENETICS & HEREDITY
Human Genetics Pub Date : 2025-07-01 Epub Date: 2025-06-14 DOI:10.1007/s00439-025-02755-9
Shuangyu Yang, Dan He, Ling Li, Zhiya Lu, Shaoying Li, Tianjun Lan, Feiyi Liu, Huasong Zhang, David N Cooper, Huiying Zhao
{"title":"Integrating gene mutation spectra from tumors and the general population with gene expression topological networks to identify novel cancer driver genes.","authors":"Shuangyu Yang, Dan He, Ling Li, Zhiya Lu, Shaoying Li, Tianjun Lan, Feiyi Liu, Huasong Zhang, David N Cooper, Huiying Zhao","doi":"10.1007/s00439-025-02755-9","DOIUrl":null,"url":null,"abstract":"<p><p>Discovering cancer driver genes is critical for improving survival rates. Current methods often overlook the varying functional impacts of mutations. It is necessary to develop a method integrating mutation pathogenicity and gene expression data, enhancing the identification of novel cancer drivers. To predict cancer drivers, we have developed a framework (DGAT-cancer) that integrates the pathogenicity of somatic mutation in tumors and germline variants in the healthy population, with topological networks of gene expression in tumors, and the gene expressions in tumor and paracancerous tissues. This integration overcomes the limitations of current methods that assume a uniform impact of all mutations by leveraging a comprehensive view of mutation function within its biological context. These features were filtered by an unsupervised approach, Laplacian selection, and combined by Hotelling and Box-Cox transformations to score genes. By using gene scores as weights, Gibbs sampling was performed to identify cancer drivers. DGAT-cancer was applied to seven types of cancer cohorts, and achieved the best area under the precision-recall curve (AUPRC ranging from 0.646 to 0.862) compared to five commonly used methods (AUPRC ranging from 0.357 to 0.629). DGAT-cancer has identified 505 cancer drivers. Knockdown of the top ranked gene, EEF1A1 indicated a ~ 41-50% decrease in glioma size and improved the temozolomide sensitivity of glioma cells. By combining heterogeneous genomics and transcriptomics data, DGAT-cancer has significantly improved our ability to detect novel cancer drivers, and is an innovative approach revealing cancer therapeutic targets, thereby advancing the development of more precise and effective cancer treatments.</p>","PeriodicalId":13175,"journal":{"name":"Human Genetics","volume":" ","pages":"775-794"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s00439-025-02755-9","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/14 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Discovering cancer driver genes is critical for improving survival rates. Current methods often overlook the varying functional impacts of mutations. It is necessary to develop a method integrating mutation pathogenicity and gene expression data, enhancing the identification of novel cancer drivers. To predict cancer drivers, we have developed a framework (DGAT-cancer) that integrates the pathogenicity of somatic mutation in tumors and germline variants in the healthy population, with topological networks of gene expression in tumors, and the gene expressions in tumor and paracancerous tissues. This integration overcomes the limitations of current methods that assume a uniform impact of all mutations by leveraging a comprehensive view of mutation function within its biological context. These features were filtered by an unsupervised approach, Laplacian selection, and combined by Hotelling and Box-Cox transformations to score genes. By using gene scores as weights, Gibbs sampling was performed to identify cancer drivers. DGAT-cancer was applied to seven types of cancer cohorts, and achieved the best area under the precision-recall curve (AUPRC ranging from 0.646 to 0.862) compared to five commonly used methods (AUPRC ranging from 0.357 to 0.629). DGAT-cancer has identified 505 cancer drivers. Knockdown of the top ranked gene, EEF1A1 indicated a ~ 41-50% decrease in glioma size and improved the temozolomide sensitivity of glioma cells. By combining heterogeneous genomics and transcriptomics data, DGAT-cancer has significantly improved our ability to detect novel cancer drivers, and is an innovative approach revealing cancer therapeutic targets, thereby advancing the development of more precise and effective cancer treatments.

整合来自肿瘤和普通人群的基因突变谱与基因表达拓扑网络,以识别新的癌症驱动基因。
发现癌症驱动基因对提高生存率至关重要。目前的方法往往忽略了突变对不同功能的影响。有必要开发一种整合突变致病性和基因表达数据的方法,以加强对新型癌症驱动因素的识别。为了预测癌症驱动因素,我们开发了一个框架(DGAT-cancer),该框架整合了肿瘤中体细胞突变和健康人群中种系变异的致病性,肿瘤中基因表达的拓扑网络,以及肿瘤和癌旁组织中的基因表达。这种整合克服了当前方法的局限性,即通过利用突变功能在其生物学背景下的综合观点来假设所有突变的统一影响。这些特征通过一种无监督的方法来过滤,即拉普拉斯选择,并通过Hotelling和Box-Cox转换相结合来对基因进行评分。通过使用基因评分作为权重,吉布斯抽样来确定癌症驱动因素。将DGAT-cancer应用于7种类型的癌症队列,与常用的5种方法(AUPRC为0.357 ~ 0.629)相比,精确召回率曲线下面积(AUPRC为0.646 ~ 0.862)最佳。DGAT-cancer已经确定了505个癌症驱动因素。EEF1A1基因敲除后,胶质瘤的大小减少了约41-50%,胶质瘤细胞对替莫唑胺的敏感性提高。通过结合异质性基因组学和转录组学数据,DGAT-cancer显著提高了我们检测新型癌症驱动因素的能力,是一种揭示癌症治疗靶点的创新方法,从而推动了更精确、更有效的癌症治疗的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Genetics
Human Genetics 生物-遗传学
CiteScore
10.80
自引率
3.80%
发文量
94
审稿时长
1 months
期刊介绍: Human Genetics is a monthly journal publishing original and timely articles on all aspects of human genetics. The Journal particularly welcomes articles in the areas of Behavioral genetics, Bioinformatics, Cancer genetics and genomics, Cytogenetics, Developmental genetics, Disease association studies, Dysmorphology, ELSI (ethical, legal and social issues), Evolutionary genetics, Gene expression, Gene structure and organization, Genetics of complex diseases and epistatic interactions, Genetic epidemiology, Genome biology, Genome structure and organization, Genotype-phenotype relationships, Human Genomics, Immunogenetics and genomics, Linkage analysis and genetic mapping, Methods in Statistical Genetics, Molecular diagnostics, Mutation detection and analysis, Neurogenetics, Physical mapping and Population Genetics. Articles reporting animal models relevant to human biology or disease are also welcome. Preference will be given to those articles which address clinically relevant questions or which provide new insights into human biology. Unless reporting entirely novel and unusual aspects of a topic, clinical case reports, cytogenetic case reports, papers on descriptive population genetics, articles dealing with the frequency of polymorphisms or additional mutations within genes in which numerous lesions have already been described, and papers that report meta-analyses of previously published datasets will normally not be accepted. The Journal typically will not consider for publication manuscripts that report merely the isolation, map position, structure, and tissue expression profile of a gene of unknown function unless the gene is of particular interest or is a candidate gene involved in a human trait or disorder.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信