基于模糊粗糙集熵的肿瘤耐药mirna识别集成

IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Joginder Singh , Shubhra Sankar Ray
{"title":"基于模糊粗糙集熵的肿瘤耐药mirna识别集成","authors":"Joginder Singh ,&nbsp;Shubhra Sankar Ray","doi":"10.1016/j.jocs.2025.102673","DOIUrl":null,"url":null,"abstract":"<div><div>MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the <span><math><mi>F</mi></math></span> score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of <span><math><mi>F</mi></math></span> score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at <span><span>https://www.isical.ac.in/ shubhra/WFIFRRRE.html</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"91 ","pages":"Article 102673"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating fuzzy rough set-based entropies for identifying drug-resistant miRNAs in cancer\",\"authors\":\"Joginder Singh ,&nbsp;Shubhra Sankar Ray\",\"doi\":\"10.1016/j.jocs.2025.102673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the <span><math><mi>F</mi></math></span> score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of <span><math><mi>F</mi></math></span> score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at <span><span>https://www.isical.ac.in/ shubhra/WFIFRRRE.html</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":48907,\"journal\":{\"name\":\"Journal of Computational Science\",\"volume\":\"91 \",\"pages\":\"Article 102673\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1877750325001504\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877750325001504","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

MicroRNAs (miRNAs)是癌症诊断和治疗的关键生物标志物。鉴定耐药mirna可能有助于有效治疗癌症。提出了两种新的基于z分数的模糊粗糙关联熵和冗余熵,并引入加权熵整合框架,对对照和耐药患者进行排序和选择。在这里,使用了软计算的两个关键组成部分:模糊集和粗糙集。该方法被称为基于模糊粗糙集的关联冗余熵加权积分框架(wfifrre)。z分数用于计算两个熵所需的表达式值的模糊隶属度。模糊性处理了miRNA表达谱的重叠性质,粗糙集有助于确定确切的类大小。WFIFRRRE中的权重,分配给相关熵和冗余熵,以监督的方式确定,以最大化F分,用于验证区分对照和耐药患者的分类性能。权重以0.01的步长从0到1变化,从而实现相关性和冗余熵之间的集成。从排名列表中选择一个mirna子集,并在八个耐药癌症数据集上使用三个基准分类器评估其性能。实验结果表明,WFIFRRRE比常用的比较方法具有更好的预测精度。WFIFRRRE在随机森林、朴素贝叶斯和线性支持向量机分类器上的分类精度F值分别为0.74 ~ 1.0、0.75 ~ 1.0和0.73 ~ 1.0。利用WFIFRRRE获得的mirna集合也在现有生物学研究的帮助下得到了验证。WFIFRRRE的源代码可从https://www.isical.ac.in/ shubhra/WFIFRRRE.html获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integrating fuzzy rough set-based entropies for identifying drug-resistant miRNAs in cancer
MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the F score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of F score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at https://www.isical.ac.in/ shubhra/WFIFRRRE.html.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Computational Science
Journal of Computational Science COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-COMPUTER SCIENCE, THEORY & METHODS
CiteScore
5.50
自引率
3.00%
发文量
227
审稿时长
41 days
期刊介绍: Computational Science is a rapidly growing multi- and interdisciplinary field that uses advanced computing and data analysis to understand and solve complex problems. It has reached a level of predictive capability that now firmly complements the traditional pillars of experimentation and theory. The recent advances in experimental techniques such as detectors, on-line sensor networks and high-resolution imaging techniques, have opened up new windows into physical and biological processes at many levels of detail. The resulting data explosion allows for detailed data driven modeling and simulation. This new discipline in science combines computational thinking, modern computational methods, devices and collateral technologies to address problems far beyond the scope of traditional numerical methods. Computational science typically unifies three distinct elements: • Modeling, Algorithms and Simulations (e.g. numerical and non-numerical, discrete and continuous); • Software developed to solve science (e.g., biological, physical, and social), engineering, medicine, and humanities problems; • Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components (e.g. problem solving environments).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信