一种改进的鲸鱼优化算法用于增强机器学习中的特征选择过程

Ezaz Uddin Syed, M. Masood, M. Fouad, I. Glesk
{"title":"一种改进的鲸鱼优化算法用于增强机器学习中的特征选择过程","authors":"Ezaz Uddin Syed, M. Masood, M. Fouad, I. Glesk","doi":"10.1109/TELFOR52709.2021.9653166","DOIUrl":null,"url":null,"abstract":"In recent years, when there is an abundance of large datasets in various fields, the importance of feature selection problem has become critical for researchers. The real world applications rely on large datasets, which implies that datasets have hundreds of instances and attributes. Finding a better way of optimum feature selection could significantly improve the machine learning predictions. Recently, metaheuristics have gained momentous popularity for solving feature selection problem. Whale Optimization Algorithm has gained significant attention by the researcher community searching to solve the feature selection problem. However, the exploration problem in whale optimization algorithm still exists and remains to be researched as various parameters within the whale algorithm have been ignored. This paper proposes a new and improved version of the whale algorithm entitled Modified Whale Optimization Algorithm (MWOA) that hybrid with the machine learning models such as logistic regression, decision tree, random forest, K-nearest neighbor, support vector machine, naïve Bayes model. To test this new approach and the performance, the breast cancer dataset was used for MWOA evaluation. The test results revealed the superiority of this model when compared to the results obtained by machine learning models.","PeriodicalId":330449,"journal":{"name":"2021 29th Telecommunications Forum (TELFOR)","volume":"253 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Modified Whale Optimization Algorithm for Enhancing the Features Selection Process in Machine Learning\",\"authors\":\"Ezaz Uddin Syed, M. Masood, M. Fouad, I. Glesk\",\"doi\":\"10.1109/TELFOR52709.2021.9653166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, when there is an abundance of large datasets in various fields, the importance of feature selection problem has become critical for researchers. The real world applications rely on large datasets, which implies that datasets have hundreds of instances and attributes. Finding a better way of optimum feature selection could significantly improve the machine learning predictions. Recently, metaheuristics have gained momentous popularity for solving feature selection problem. Whale Optimization Algorithm has gained significant attention by the researcher community searching to solve the feature selection problem. However, the exploration problem in whale optimization algorithm still exists and remains to be researched as various parameters within the whale algorithm have been ignored. This paper proposes a new and improved version of the whale algorithm entitled Modified Whale Optimization Algorithm (MWOA) that hybrid with the machine learning models such as logistic regression, decision tree, random forest, K-nearest neighbor, support vector machine, naïve Bayes model. To test this new approach and the performance, the breast cancer dataset was used for MWOA evaluation. The test results revealed the superiority of this model when compared to the results obtained by machine learning models.\",\"PeriodicalId\":330449,\"journal\":{\"name\":\"2021 29th Telecommunications Forum (TELFOR)\",\"volume\":\"253 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 29th Telecommunications Forum (TELFOR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TELFOR52709.2021.9653166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 29th Telecommunications Forum (TELFOR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TELFOR52709.2021.9653166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

近年来,随着各领域大数据集的大量涌现,特征选择问题的重要性日益受到研究人员的重视。现实世界的应用程序依赖于大型数据集,这意味着数据集有数百个实例和属性。找到一种更好的最优特征选择方法可以显著提高机器学习的预测能力。近年来,元启发式算法在解决特征选择问题方面得到了广泛的应用。鲸鱼优化算法(Whale Optimization Algorithm)是一种解决特征选择问题的算法,受到了研究界的广泛关注。然而,由于忽略了鲸鱼算法中的各种参数,鲸鱼优化算法中的探索问题仍然存在,有待研究。本文提出了一种新的改进版本的鲸鱼算法,称为修正鲸鱼优化算法(Modified whale Optimization algorithm, MWOA),该算法与逻辑回归、决策树、随机森林、k近邻、支持向量机、naïve贝叶斯模型等机器学习模型相结合。为了测试这种新方法及其性能,使用乳腺癌数据集进行MWOA评估。与机器学习模型获得的结果相比,测试结果显示了该模型的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Modified Whale Optimization Algorithm for Enhancing the Features Selection Process in Machine Learning
In recent years, when there is an abundance of large datasets in various fields, the importance of feature selection problem has become critical for researchers. The real world applications rely on large datasets, which implies that datasets have hundreds of instances and attributes. Finding a better way of optimum feature selection could significantly improve the machine learning predictions. Recently, metaheuristics have gained momentous popularity for solving feature selection problem. Whale Optimization Algorithm has gained significant attention by the researcher community searching to solve the feature selection problem. However, the exploration problem in whale optimization algorithm still exists and remains to be researched as various parameters within the whale algorithm have been ignored. This paper proposes a new and improved version of the whale algorithm entitled Modified Whale Optimization Algorithm (MWOA) that hybrid with the machine learning models such as logistic regression, decision tree, random forest, K-nearest neighbor, support vector machine, naïve Bayes model. To test this new approach and the performance, the breast cancer dataset was used for MWOA evaluation. The test results revealed the superiority of this model when compared to the results obtained by machine learning models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信