RECENT DEVELOPMENTS IN SOFT COMPUTING BASED TECHNIQUES FOR FEATURE SELECTION AND DISEASE CLASSIFICATION

IF 0.2 Q4 MULTIDISCIPLINARY SCIENCES
Naiyar Iqbal, P. Kumar
{"title":"RECENT DEVELOPMENTS IN SOFT COMPUTING BASED TECHNIQUES FOR FEATURE SELECTION AND DISEASE CLASSIFICATION","authors":"Naiyar Iqbal, P. Kumar","doi":"10.55766/sujst-2023-02-e01872","DOIUrl":null,"url":null,"abstract":"Computational prediction of diseases is vital in medical research that contributes to computer-aided diagnostics and helps doctors and medical practitioners in critical decision-making for various diseases such as bacterial and viral kinds of disease, including COVID-19 of the current pandemic situation. Feature selection techniques function as a preprocessing phase for classification and prediction algorithms. For disease prediction, these features may be the patient’s clinical profiles or genomic features such as gene expression profiles from microarray and read counts from RNA-Seq. The performance of a classifier depends primarily on the selected features. In addition, genomic features are too large in numbers, resulting in the curse of dimensionality problem. In the last few years, several feature selection algorithms have been developed to overcome the existing problems to get rid of eliminating chronic diseases, such as various cancers, Zika virus, Ebola virus, and the COVID-19 pandemic. In this review article, we systematically associate soft computing-based approaches for feature selection and disease prediction by applying three data types: patients’ clinical profiles, microarray gene expression profiles, and RNA-Seq sample profiles. According to related work, when the discussion took place, the percentage of medical data types highlighted through pictorial representation and the respective ratio of percentages mentioned were 52%, 27%, 9% and 12% for clinical symptoms, gene expression, MRI-Image and other data types such as signal or text-based utilized, respectively. We also highlight the significant challenges and future directions in this research domain. \n ","PeriodicalId":43478,"journal":{"name":"Suranaree Journal of Science and Technology","volume":"4 1","pages":""},"PeriodicalIF":0.2000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Suranaree Journal of Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55766/sujst-2023-02-e01872","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Computational prediction of diseases is vital in medical research that contributes to computer-aided diagnostics and helps doctors and medical practitioners in critical decision-making for various diseases such as bacterial and viral kinds of disease, including COVID-19 of the current pandemic situation. Feature selection techniques function as a preprocessing phase for classification and prediction algorithms. For disease prediction, these features may be the patient’s clinical profiles or genomic features such as gene expression profiles from microarray and read counts from RNA-Seq. The performance of a classifier depends primarily on the selected features. In addition, genomic features are too large in numbers, resulting in the curse of dimensionality problem. In the last few years, several feature selection algorithms have been developed to overcome the existing problems to get rid of eliminating chronic diseases, such as various cancers, Zika virus, Ebola virus, and the COVID-19 pandemic. In this review article, we systematically associate soft computing-based approaches for feature selection and disease prediction by applying three data types: patients’ clinical profiles, microarray gene expression profiles, and RNA-Seq sample profiles. According to related work, when the discussion took place, the percentage of medical data types highlighted through pictorial representation and the respective ratio of percentages mentioned were 52%, 27%, 9% and 12% for clinical symptoms, gene expression, MRI-Image and other data types such as signal or text-based utilized, respectively. We also highlight the significant challenges and future directions in this research domain.  
基于软计算的特征选择和疾病分类技术的最新进展
疾病的计算预测在医学研究中至关重要,有助于计算机辅助诊断,并帮助医生和医疗从业者对各种疾病(如细菌和病毒性疾病,包括当前大流行的COVID-19)进行关键决策。特征选择技术是分类和预测算法的预处理阶段。对于疾病预测,这些特征可能是患者的临床特征或基因组特征,例如来自微阵列的基因表达谱和来自RNA-Seq的读取计数。分类器的性能主要取决于所选择的特征。另外,基因组特征数量过多,导致了维数问题的困扰。在过去的几年里,已经开发了几种特征选择算法来克服现有的问题,以摆脱消除慢性疾病,如各种癌症,寨卡病毒,埃博拉病毒和COVID-19大流行。在这篇综述文章中,我们通过应用三种数据类型系统地将基于软计算的方法用于特征选择和疾病预测:患者临床资料、微阵列基因表达谱和RNA-Seq样本谱。根据相关工作,在讨论进行时,临床症状、基因表达、MRI-Image以及基于信号或文本的其他数据类型的利用,通过图形表示突出显示的医疗数据类型百分比和提及的百分比分别为52%、27%、9%和12%。我们还强调了该研究领域的重大挑战和未来方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Suranaree Journal of Science and Technology
Suranaree Journal of Science and Technology MULTIDISCIPLINARY SCIENCES-
CiteScore
0.30
自引率
50.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信