不同分类算法对RNA-Seq癌症数据的比较研究

N. Şimşek, B. Haznedar, Cihan Kuzudişli
{"title":"不同分类算法对RNA-Seq癌症数据的比较研究","authors":"N. Şimşek, B. Haznedar, Cihan Kuzudişli","doi":"10.18844/gjpaas.v0i12.4983","DOIUrl":null,"url":null,"abstract":"Gene mutations are the most important reason of cancer diseases, and there are different kind of causing genes across these diseases. RNA-Seq technology enables us to allow for gathering information about many genes simultaneously; hence, RNA-Seq data can be used for cancer diagnosis and classification. In this study, RNA-Seq dataset for renal cell cancer is analysed using three different developed classification methods: random forest (RF), artificial neural network (ANN) and deep learning (DL). The genes in our dataset are related to the following cancer types: kidney renal papillary cell, kidney renal clear cell and kidney chromophore carcinomas. It suggests that the DL method gives the highest accuracy rate compared to RF and ANN for 95.15%, 91.83% and 89.22%, respectively. We believe that the results acquired in this study will make a contribution to the classification of cancer types and support doctors in their processes of decision making. \n  \nKeywords: Classification, gene-expression, RNA-Seq, DL.","PeriodicalId":210768,"journal":{"name":"New Trends and Issues Proceedings on Advances in Pure and Applied Sciences","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A comparative study of different classification algorithms on RNA-Seq cancer data\",\"authors\":\"N. Şimşek, B. Haznedar, Cihan Kuzudişli\",\"doi\":\"10.18844/gjpaas.v0i12.4983\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene mutations are the most important reason of cancer diseases, and there are different kind of causing genes across these diseases. RNA-Seq technology enables us to allow for gathering information about many genes simultaneously; hence, RNA-Seq data can be used for cancer diagnosis and classification. In this study, RNA-Seq dataset for renal cell cancer is analysed using three different developed classification methods: random forest (RF), artificial neural network (ANN) and deep learning (DL). The genes in our dataset are related to the following cancer types: kidney renal papillary cell, kidney renal clear cell and kidney chromophore carcinomas. It suggests that the DL method gives the highest accuracy rate compared to RF and ANN for 95.15%, 91.83% and 89.22%, respectively. We believe that the results acquired in this study will make a contribution to the classification of cancer types and support doctors in their processes of decision making. \\n  \\nKeywords: Classification, gene-expression, RNA-Seq, DL.\",\"PeriodicalId\":210768,\"journal\":{\"name\":\"New Trends and Issues Proceedings on Advances in Pure and Applied Sciences\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Trends and Issues Proceedings on Advances in Pure and Applied Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18844/gjpaas.v0i12.4983\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Trends and Issues Proceedings on Advances in Pure and Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18844/gjpaas.v0i12.4983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

基因突变是导致癌症的最重要原因,在这些疾病中有不同类型的致病基因。RNA-Seq技术使我们能够同时收集许多基因的信息;因此,RNA-Seq数据可用于癌症的诊断和分类。本研究使用随机森林(RF)、人工神经网络(ANN)和深度学习(DL)三种不同的分类方法对肾细胞癌的RNA-Seq数据集进行了分析。我们数据集中的基因与以下癌症类型有关:肾乳头状细胞癌、肾透明细胞癌和肾发色团癌。结果表明,与RF和ANN相比,DL方法的准确率最高,分别为95.15%、91.83%和89.22%。我们相信本研究获得的结果将有助于癌症类型的分类,并支持医生的决策过程。关键词:分类,基因表达,RNA-Seq, DL。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A comparative study of different classification algorithms on RNA-Seq cancer data
Gene mutations are the most important reason of cancer diseases, and there are different kind of causing genes across these diseases. RNA-Seq technology enables us to allow for gathering information about many genes simultaneously; hence, RNA-Seq data can be used for cancer diagnosis and classification. In this study, RNA-Seq dataset for renal cell cancer is analysed using three different developed classification methods: random forest (RF), artificial neural network (ANN) and deep learning (DL). The genes in our dataset are related to the following cancer types: kidney renal papillary cell, kidney renal clear cell and kidney chromophore carcinomas. It suggests that the DL method gives the highest accuracy rate compared to RF and ANN for 95.15%, 91.83% and 89.22%, respectively. We believe that the results acquired in this study will make a contribution to the classification of cancer types and support doctors in their processes of decision making.   Keywords: Classification, gene-expression, RNA-Seq, DL.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信