{"title":"基于模糊粗糙集熵的肿瘤耐药mirna识别集成","authors":"Joginder Singh , Shubhra Sankar Ray","doi":"10.1016/j.jocs.2025.102673","DOIUrl":null,"url":null,"abstract":"<div><div>MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the <span><math><mi>F</mi></math></span> score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of <span><math><mi>F</mi></math></span> score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at <span><span>https://www.isical.ac.in/ shubhra/WFIFRRRE.html</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"91 ","pages":"Article 102673"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating fuzzy rough set-based entropies for identifying drug-resistant miRNAs in cancer\",\"authors\":\"Joginder Singh , Shubhra Sankar Ray\",\"doi\":\"10.1016/j.jocs.2025.102673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the <span><math><mi>F</mi></math></span> score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of <span><math><mi>F</mi></math></span> score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at <span><span>https://www.isical.ac.in/ shubhra/WFIFRRRE.html</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":48907,\"journal\":{\"name\":\"Journal of Computational Science\",\"volume\":\"91 \",\"pages\":\"Article 102673\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1877750325001504\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877750325001504","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Integrating fuzzy rough set-based entropies for identifying drug-resistant miRNAs in cancer
MicroRNAs (miRNAs) are key biomarkers in cancer diagnosis and treatment. Identification of drug-resistant miRNAs may help in effective treatment of cancer. Two new z score based fuzzy rough relevance and redundancy entropies are developed and then a weighted framework is introduced to integrate the entropies for ranking and selecting miRNAs in classifying control and drug resistant patients. Here, two key components of soft computing, fuzzy set and rough set are utilized. The methodology is called a weighted framework for integrating fuzzy rough set-based relevance and redundancy entropies (WFIFRRRE). The z score is used to compute the fuzzy membership of expression values required for both entropies. Fuzziness deals with the overlapping nature of miRNA expression profiles and rough set helps in determining the exact class size. The weights in WFIFRRRE, assigned to relevance and redundancy entropies, are determined in a supervised manner to maximize the score used for validating the classification performance in discriminating the control and drug-resistant patients. The weights are varied from 0 to 1 in steps of 0.01 which enables an integration between relevance and redundancy entropies. A subset of miRNAs is selected from the ranked list and the performance is evaluated using three benchmark classifiers on eight drug-resistant cancer datasets. Experimental results show that WFIFRRRE provides better prediction accuracy than the popular methods used for comparison. The classification accuracy in terms of score, achieved by WFIFRRRE, ranges from 0.74 to 1.0, 0.75 to 1.0, and 0.73 to 1.0 using random forest, Naive Bayes, and linear SVM classifiers, respectively. The resultant set of miRNAs obtained using WFIFRRRE is also verified with the help of existing biological studies. The source code of WFIFRRRE is available at https://www.isical.ac.in/ shubhra/WFIFRRRE.html.
期刊介绍:
Computational Science is a rapidly growing multi- and interdisciplinary field that uses advanced computing and data analysis to understand and solve complex problems. It has reached a level of predictive capability that now firmly complements the traditional pillars of experimentation and theory.
The recent advances in experimental techniques such as detectors, on-line sensor networks and high-resolution imaging techniques, have opened up new windows into physical and biological processes at many levels of detail. The resulting data explosion allows for detailed data driven modeling and simulation.
This new discipline in science combines computational thinking, modern computational methods, devices and collateral technologies to address problems far beyond the scope of traditional numerical methods.
Computational science typically unifies three distinct elements:
• Modeling, Algorithms and Simulations (e.g. numerical and non-numerical, discrete and continuous);
• Software developed to solve science (e.g., biological, physical, and social), engineering, medicine, and humanities problems;
• Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components (e.g. problem solving environments).