基于RNA-seq数据的肺癌基因亚网络生物标志物鉴定

Kritsada Sreebunpeng, Jonathan H. Chan, A. Meechai
{"title":"基于RNA-seq数据的肺癌基因亚网络生物标志物鉴定","authors":"Kritsada Sreebunpeng, Jonathan H. Chan, A. Meechai","doi":"10.1145/3429210.3429212","DOIUrl":null,"url":null,"abstract":"In recent years, the increasing availability of cancer RNA-seq datasets has provided unprecedented information and opportunities for the discovery of biomarkers for cancer. In this study, we tested our previously published Gene Sub-Network-based Feature Selection (GSNFS) method to identify gene-subnetwork biomarkers with RNA-seq-based gene expression data of lung cancer. In addition, five different filter-based feature selection techniques were explored to rank identified subnetworks. We found that the majority of the top 10 ranked subnetworks were associated with cancer pathways such as the MAPK signalling pathway. With Support Vector Machine (SVM) as a classifier based on the Area Under Curve (AUC) of the Receiver Operating Characteristic (ROC) curve using 10-fold cross-validation and cross-dataset validation, we showed that gene subnetwork biomarkers obtained by RNA-seq-based GSNFS analysis had excellent classification performance. Additionally, when comparing the top-ranked subnetworks obtained from RNA-seq-based GSNFS analysis with those top-ranked subnetworks previously obtained from DNA microarray-based GSNFS analysis, we could categorize subnetworks and found unique pathways of cancer for each data-based analysis.","PeriodicalId":164790,"journal":{"name":"CSBio '20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Gene Subnetwork Biomarkers of Lung Cancer from RNA-seq Data\",\"authors\":\"Kritsada Sreebunpeng, Jonathan H. Chan, A. Meechai\",\"doi\":\"10.1145/3429210.3429212\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the increasing availability of cancer RNA-seq datasets has provided unprecedented information and opportunities for the discovery of biomarkers for cancer. In this study, we tested our previously published Gene Sub-Network-based Feature Selection (GSNFS) method to identify gene-subnetwork biomarkers with RNA-seq-based gene expression data of lung cancer. In addition, five different filter-based feature selection techniques were explored to rank identified subnetworks. We found that the majority of the top 10 ranked subnetworks were associated with cancer pathways such as the MAPK signalling pathway. With Support Vector Machine (SVM) as a classifier based on the Area Under Curve (AUC) of the Receiver Operating Characteristic (ROC) curve using 10-fold cross-validation and cross-dataset validation, we showed that gene subnetwork biomarkers obtained by RNA-seq-based GSNFS analysis had excellent classification performance. Additionally, when comparing the top-ranked subnetworks obtained from RNA-seq-based GSNFS analysis with those top-ranked subnetworks previously obtained from DNA microarray-based GSNFS analysis, we could categorize subnetworks and found unique pathways of cancer for each data-based analysis.\",\"PeriodicalId\":164790,\"journal\":{\"name\":\"CSBio '20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CSBio '20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3429210.3429212\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CSBio '20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3429210.3429212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

近年来,越来越多的癌症RNA-seq数据集为发现癌症生物标志物提供了前所未有的信息和机会。在这项研究中,我们测试了我们之前发表的基于基因子网络的特征选择(GSNFS)方法,利用基于rna -seq的肺癌基因表达数据识别基因子网络生物标志物。此外,探索了五种不同的基于滤波器的特征选择技术来对已识别的子网进行排序。我们发现,排名前10位的子网络中的大多数与癌症通路(如MAPK信号通路)相关。采用支持向量机(SVM)作为基于受试者工作特征(ROC)曲线下面积(AUC)的分类器,通过10倍交叉验证和跨数据集验证,我们发现基于rna -seq的GSNFS分析获得的基因子网络生物标志物具有优异的分类性能。此外,当比较基于rna -seq的GSNFS分析获得的排名靠前的子网络与先前基于DNA微阵列的GSNFS分析获得的排名靠前的子网络时,我们可以对子网络进行分类,并为每个基于数据的分析发现独特的癌症途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of Gene Subnetwork Biomarkers of Lung Cancer from RNA-seq Data
In recent years, the increasing availability of cancer RNA-seq datasets has provided unprecedented information and opportunities for the discovery of biomarkers for cancer. In this study, we tested our previously published Gene Sub-Network-based Feature Selection (GSNFS) method to identify gene-subnetwork biomarkers with RNA-seq-based gene expression data of lung cancer. In addition, five different filter-based feature selection techniques were explored to rank identified subnetworks. We found that the majority of the top 10 ranked subnetworks were associated with cancer pathways such as the MAPK signalling pathway. With Support Vector Machine (SVM) as a classifier based on the Area Under Curve (AUC) of the Receiver Operating Characteristic (ROC) curve using 10-fold cross-validation and cross-dataset validation, we showed that gene subnetwork biomarkers obtained by RNA-seq-based GSNFS analysis had excellent classification performance. Additionally, when comparing the top-ranked subnetworks obtained from RNA-seq-based GSNFS analysis with those top-ranked subnetworks previously obtained from DNA microarray-based GSNFS analysis, we could categorize subnetworks and found unique pathways of cancer for each data-based analysis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信