Gene Expression Analysis on Cancer Dataset

R. Vignesh, D. Deepa, Suja Cherukullapurath Mana, B. Samhitha, A. T
{"title":"Gene Expression Analysis on Cancer Dataset","authors":"R. Vignesh, D. Deepa, Suja Cherukullapurath Mana, B. Samhitha, A. T","doi":"10.1109/ICOEI51242.2021.9452965","DOIUrl":null,"url":null,"abstract":"Genes are the basis of tumor formations around the body, which is better known as cancer. They inhibit basic processes such as cell death (apoptosis) and promote cell division to an unhealthy extent. The expression of every gene provides a baseline to know the progress of cancer from the organ or tissue it originated from along with its approximated course of action. The analysis of such gene expression values using traditional machine learning methods provide a higher efficiency and accuracy in finding relationships between genes and also it may serve as a future for diagnosing the cancer by using these values. The main challenge is to use the bases that are created to efficiently compute the highly effective genes to treat specific types of cancer by using their expression values and thus, raise the question of a potential relationship between them for each type. A Random Forest Model has been used to perform Feature Selection over the dataset in order to extract the important features (i.e.) the most influential genes. They are then visualized by using traditional packages in Python (i.e. Scikit-plot, Matplotlib, Seaborn) and using a data visualization tool called Tableau to project the result of the analysis.","PeriodicalId":420826,"journal":{"name":"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI51242.2021.9452965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Genes are the basis of tumor formations around the body, which is better known as cancer. They inhibit basic processes such as cell death (apoptosis) and promote cell division to an unhealthy extent. The expression of every gene provides a baseline to know the progress of cancer from the organ or tissue it originated from along with its approximated course of action. The analysis of such gene expression values using traditional machine learning methods provide a higher efficiency and accuracy in finding relationships between genes and also it may serve as a future for diagnosing the cancer by using these values. The main challenge is to use the bases that are created to efficiently compute the highly effective genes to treat specific types of cancer by using their expression values and thus, raise the question of a potential relationship between them for each type. A Random Forest Model has been used to perform Feature Selection over the dataset in order to extract the important features (i.e.) the most influential genes. They are then visualized by using traditional packages in Python (i.e. Scikit-plot, Matplotlib, Seaborn) and using a data visualization tool called Tableau to project the result of the analysis.
癌症数据集的基因表达分析
基因是身体周围肿瘤形成的基础,也就是我们熟知的癌症。它们抑制细胞死亡(凋亡)等基本过程,并在不健康的程度上促进细胞分裂。每个基因的表达都提供了一个基线,可以从癌症起源的器官或组织以及它的大致作用过程中了解癌症的进展。使用传统的机器学习方法对这些基因表达值进行分析,在寻找基因之间的关系方面提供了更高的效率和准确性,并且可以作为利用这些值诊断癌症的未来。主要的挑战是利用已经创建的碱基来有效地计算高效基因,通过使用它们的表达值来治疗特定类型的癌症,从而提出它们之间每种类型之间潜在关系的问题。随机森林模型已被用于对数据集进行特征选择,以提取重要特征(即)最具影响力的基因。然后使用Python中的传统软件包(即Scikit-plot, Matplotlib, Seaborn)并使用称为Tableau的数据可视化工具来投影分析结果,从而将它们可视化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信