On integrative analysis of multi-level gene expression data in Kidney cancer subgrouping.

IF 0.8 Q4 UROLOGY & NEPHROLOGY
Pratheeba Jeyananthan, Maduranga W P N, Rodrigo S M
{"title":"On integrative analysis of multi-level gene expression data in Kidney cancer subgrouping.","authors":"Pratheeba Jeyananthan, Maduranga W P N, Rodrigo S M","doi":"10.1177/03915603241304604","DOIUrl":null,"url":null,"abstract":"<p><p>Kidney cancer is one of the most dangerous cancer mainly targeting men. In 2020, around 430, 000 people were diagnosed with this disease worldwide. It can be divided into three prime subgroups such as kidney renal cell carcinoma (KIRC), kidney renal papilliary cell carcinoma (KIRP) and kidney chromophobe (KICH). Correct identification of these subgroups on time is crucial for the initiation and determination of proper treatment. On-time identification of this disease and its subgroup can help both the clinicians and patients to improve the situation. Hence, this study checks the possibility of using multi-omics data in the kidney cancer subgrouping, whether integrating multiple omics data will increase the subgrouping accuracy or not. Four different molecular data such as genomics, proteomics, epigenomics and miRNA from The Cancer Genome Atlas (TCGA) are used in this study. As the data is in a very high dimension world, this study starts with selecting the relevant features of the study using Pearson's correlation coefficient. Those selected features are used with three different classification algorithms such as k-nearest neighbor (KNN), supporting vector machines (SVMs) and random forest. Performances are compared to see whether the integration of multi-omics data can improve the accuracy of kidney cancer subgrouping. This study shows that integration of multi-omics data can improve the performance of the kidney cancer subgrouping. The highest performance (accuracy value of 0.98±0.03) is gained by top 400 features selected from integrated multi-omics data, with support vector machines.</p>","PeriodicalId":23574,"journal":{"name":"Urologia Journal","volume":" ","pages":"3915603241304604"},"PeriodicalIF":0.8000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urologia Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/03915603241304604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Kidney cancer is one of the most dangerous cancer mainly targeting men. In 2020, around 430, 000 people were diagnosed with this disease worldwide. It can be divided into three prime subgroups such as kidney renal cell carcinoma (KIRC), kidney renal papilliary cell carcinoma (KIRP) and kidney chromophobe (KICH). Correct identification of these subgroups on time is crucial for the initiation and determination of proper treatment. On-time identification of this disease and its subgroup can help both the clinicians and patients to improve the situation. Hence, this study checks the possibility of using multi-omics data in the kidney cancer subgrouping, whether integrating multiple omics data will increase the subgrouping accuracy or not. Four different molecular data such as genomics, proteomics, epigenomics and miRNA from The Cancer Genome Atlas (TCGA) are used in this study. As the data is in a very high dimension world, this study starts with selecting the relevant features of the study using Pearson's correlation coefficient. Those selected features are used with three different classification algorithms such as k-nearest neighbor (KNN), supporting vector machines (SVMs) and random forest. Performances are compared to see whether the integration of multi-omics data can improve the accuracy of kidney cancer subgrouping. This study shows that integration of multi-omics data can improve the performance of the kidney cancer subgrouping. The highest performance (accuracy value of 0.98±0.03) is gained by top 400 features selected from integrated multi-omics data, with support vector machines.

肾癌分组中多级基因表达数据的综合分析
肾癌是最危险的癌症之一,主要针对男性。2020年,全世界约有43万人被诊断患有这种疾病。它可分为肾肾细胞癌(KIRC)、肾肾乳头状细胞癌(KIRP)和肾憎色细胞癌(KICH)三个主要亚群。及时正确识别这些亚群对于开始和确定适当的治疗至关重要。及时识别这种疾病及其亚群可以帮助临床医生和患者改善这种情况。因此,本研究将检验在肾癌亚组中使用多组学数据的可能性,以及整合多组学数据是否会提高亚组的准确性。本研究使用了来自癌症基因组图谱(TCGA)的基因组学、蛋白质组学、表观基因组学和miRNA等四种不同的分子数据。由于数据处于非常高维的世界,本研究首先使用Pearson相关系数选择研究的相关特征。这些选择的特征与三种不同的分类算法一起使用,如k-最近邻(KNN)、支持向量机(svm)和随机森林。比较性能,看看多组学数据的整合是否可以提高肾癌亚组的准确性。本研究表明,多组学数据的整合可以提高肾癌亚组的表现。采用支持向量机从综合多组学数据中选取前400个特征,准确率最高,为0.98±0.03。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Urologia Journal
Urologia Journal UROLOGY & NEPHROLOGY-
CiteScore
0.60
自引率
12.50%
发文量
66
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信