Cancer Tumor Detection Using Genetic Mutated Data and Machine Learning Models

Aniruddha Mohanty, Alok R. Prusty, Ravindranath Cherukuri
{"title":"Cancer Tumor Detection Using Genetic Mutated Data and Machine Learning Models","authors":"Aniruddha Mohanty, Alok R. Prusty, Ravindranath Cherukuri","doi":"10.1109/ICICCSP53532.2022.9862476","DOIUrl":null,"url":null,"abstract":"Early detection of a disease is a crucial task because of unavailability of proper medical facilities. Cancer is one of the critical diseases that needs early detection for survival. A cancer tumor is caused due to thousands of genetic mutations. Understanding the genetic mutations of cancer tumor is a tedious and time-consuming task. A list of genetic variations is analysed manually by a molecular pathologist. The clinical strips of indication are of nine classes, but the classification is still unknown. The objective of this implementation is to suggest a multiclass classifier which classifies the genetic mutations with respect to the clinical signs. The clinical evidences are text-evidences of gene mutations and analysed by Natural Language Processing (NLP). Various machine learning concepts like Naive Bayes, Logistic Regression, Linear Support Vector Machine, Random Forest Classifier applied on the collected dataset which contain the evidence based on genetic mutations and other clinical evidences that pathology or specialists used to classify the gene mutations. The performances of the models are analysed to get the best results. The machine learning models are implemented and analyzed with the help of gene, variance and text features. Based on the variants of gene mutation, the risk of the cancer can be detected and the medications can be prescribed accordingly.","PeriodicalId":326163,"journal":{"name":"2022 International Conference on Intelligent Controller and Computing for Smart Power (ICICCSP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Controller and Computing for Smart Power (ICICCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICCSP53532.2022.9862476","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Early detection of a disease is a crucial task because of unavailability of proper medical facilities. Cancer is one of the critical diseases that needs early detection for survival. A cancer tumor is caused due to thousands of genetic mutations. Understanding the genetic mutations of cancer tumor is a tedious and time-consuming task. A list of genetic variations is analysed manually by a molecular pathologist. The clinical strips of indication are of nine classes, but the classification is still unknown. The objective of this implementation is to suggest a multiclass classifier which classifies the genetic mutations with respect to the clinical signs. The clinical evidences are text-evidences of gene mutations and analysed by Natural Language Processing (NLP). Various machine learning concepts like Naive Bayes, Logistic Regression, Linear Support Vector Machine, Random Forest Classifier applied on the collected dataset which contain the evidence based on genetic mutations and other clinical evidences that pathology or specialists used to classify the gene mutations. The performances of the models are analysed to get the best results. The machine learning models are implemented and analyzed with the help of gene, variance and text features. Based on the variants of gene mutation, the risk of the cancer can be detected and the medications can be prescribed accordingly.
利用基因突变数据和机器学习模型进行癌症肿瘤检测
由于缺乏适当的医疗设施,及早发现疾病是一项至关重要的任务。癌症是需要早期发现才能生存的重要疾病之一。癌症是由成千上万的基因突变引起的。了解癌症肿瘤的基因突变是一项繁琐而耗时的任务。一份遗传变异的清单由分子病理学家手工分析。临床指征条有九类,但分类尚不清楚。这一实现的目的是建议一个多类分类器,其分类的基因突变相对于临床症状。临床证据是基因突变的文本证据,并通过自然语言处理(NLP)进行分析。各种机器学习概念,如朴素贝叶斯,逻辑回归,线性支持向量机,随机森林分类器应用于收集的数据集,其中包含基于基因突变的证据以及病理学或专家用于分类基因突变的其他临床证据。对模型的性能进行了分析,得到了最佳结果。利用基因、方差和文本特征实现和分析机器学习模型。根据基因突变的变异,可以检测癌症的风险,并相应地开出药物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信