MiRNA-gene network embedding for predicting cancer driver genes.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS

ACS Applied Bio Materials Pub Date : 2023-07-17 DOI:10.1093/bfgp/elac059

Wei Peng, Rong Wu, Wei Dai, Yu Ning, Xiaodong Fu, Li Liu, Lijun Liu

{"title":"MiRNA-gene network embedding for predicting cancer driver genes.","authors":"Wei Peng, Rong Wu, Wei Dai, Yu Ning, Xiaodong Fu, Li Liu, Lijun Liu","doi":"10.1093/bfgp/elac059","DOIUrl":null,"url":null,"abstract":"<p><p>The development and progression of cancer arise due to the accumulation of mutations in driver genes. Correctly identifying the driver genes that lead to cancer development can significantly assist the drug design, cancer diagnosis and treatment. Most computer methods detect cancer drivers based on gene-gene networks by assuming that driver genes tend to work together, form protein complexes and enrich pathways. However, they ignore that microribonucleic acid (RNAs; miRNAs) regulate the expressions of their targeted genes and are related to human diseases. In this work, we propose a graph convolution network (GCN) approach called GM-GCN to identify the cancer driver genes based on a gene-miRNA network. First, we constructed a gene-miRNA network, where the nodes are miRNAs and their targeted genes. The edges connecting miRNA and genes indicate the regulatory relationship between miRNAs and genes. We prepared initial attributes for miRNA and genes according to their biological properties and used a GCN model to learn the gene feature representations in the network by aggregating the features of their neighboring miRNA nodes. And then, the learned features were passed through a 1D convolution module for feature dimensionality change. We employed the learned and original gene features to optimize model parameters. Finally, the gene features learned from the network and the initial input gene features were fed into a logistic regression model to predict whether a gene is a driver gene. We applied our model and state-of-the-art methods to predict cancer drivers for pan-cancer and individual cancer types. Experimental results show that our model performs well in terms of the area under the receiver operating characteristic curve and the area under the precision-recall curve compared to state-of-the-art methods that work on gene networks. The GM-GCN is freely available via https://github.com/weiba/GM-GCN.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bfgp/elac059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 1

Abstract

The development and progression of cancer arise due to the accumulation of mutations in driver genes. Correctly identifying the driver genes that lead to cancer development can significantly assist the drug design, cancer diagnosis and treatment. Most computer methods detect cancer drivers based on gene-gene networks by assuming that driver genes tend to work together, form protein complexes and enrich pathways. However, they ignore that microribonucleic acid (RNAs; miRNAs) regulate the expressions of their targeted genes and are related to human diseases. In this work, we propose a graph convolution network (GCN) approach called GM-GCN to identify the cancer driver genes based on a gene-miRNA network. First, we constructed a gene-miRNA network, where the nodes are miRNAs and their targeted genes. The edges connecting miRNA and genes indicate the regulatory relationship between miRNAs and genes. We prepared initial attributes for miRNA and genes according to their biological properties and used a GCN model to learn the gene feature representations in the network by aggregating the features of their neighboring miRNA nodes. And then, the learned features were passed through a 1D convolution module for feature dimensionality change. We employed the learned and original gene features to optimize model parameters. Finally, the gene features learned from the network and the initial input gene features were fed into a logistic regression model to predict whether a gene is a driver gene. We applied our model and state-of-the-art methods to predict cancer drivers for pan-cancer and individual cancer types. Experimental results show that our model performs well in terms of the area under the receiver operating characteristic curve and the area under the precision-recall curve compared to state-of-the-art methods that work on gene networks. The GM-GCN is freely available via https://github.com/weiba/GM-GCN.

查看原文本刊更多论文

mirna -基因网络嵌入预测癌症驱动基因。

癌症的发生和发展是由于驱动基因突变的积累引起的。正确识别导致癌症发展的驱动基因对药物设计、癌症诊断和治疗具有重要的辅助作用。大多数计算机方法检测基于基因-基因网络的癌症驱动因素，假设驱动基因倾向于协同工作，形成蛋白质复合物并丰富途径。然而，他们忽略了微核糖核酸(RNAs;mirna)调节其靶基因的表达，并与人类疾病有关。在这项工作中，我们提出了一种称为GM-GCN的图卷积网络(GCN)方法来识别基于基因- mirna网络的癌症驱动基因。首先，我们构建了一个基因- mirna网络，其中节点是mirna及其靶基因。连接miRNA和基因的边缘表示miRNA和基因之间的调控关系。我们根据miRNA和基因的生物学特性为其准备初始属性，并使用GCN模型通过聚合其相邻miRNA节点的特征来学习网络中基因的特征表示。然后，将学习到的特征通过一维卷积模块进行特征维数变化。我们利用学习到的和原始的基因特征来优化模型参数。最后，将从网络中学习到的基因特征和初始输入基因特征输入到逻辑回归模型中，以预测该基因是否为驱动基因。我们应用我们的模型和最先进的方法来预测泛癌症和个体癌症类型的癌症驱动因素。实验结果表明，与最先进的基因网络方法相比，我们的模型在接收者工作特征曲线下的面积和精确召回曲线下的面积方面表现良好。GM-GCN可通过https://github.com/weiba/GM-GCN免费获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACS Applied Bio Materials Chemistry-Chemistry (all)

CiteScore

9.40

自引率

2.10%

发文量

464