A Novel Classification of Uncertain Stream Data using Ant Colony Optimization Based on Radial Basis Function

Tahsin Ali Mohammed Amin, S. Mahmood, Rebar Dara Mohammed, P. J. Karim
{"title":"A Novel Classification of Uncertain Stream Data using Ant Colony Optimization Based on Radial Basis Function","authors":"Tahsin Ali Mohammed Amin, S. Mahmood, Rebar Dara Mohammed, P. J. Karim","doi":"10.24017/science.2022.2.5","DOIUrl":null,"url":null,"abstract":"There are many potential sources of data uncertainty, such as imperfect measurement or sampling, intrusive environmental monitoring, unreliable sensor networks, and inaccurate medical diagnoses. To avoid unintended results, data mining from new applications like sensors and location-based services needs to be done with care. When attempting to classify data with a high degree of uncertainty, many researchers have turned to heuristic approaches and machine learning (ML) methods. We propose an entirely new ML method in this paper by fusing the Radial Basis Function (RBF) network based on ant colony optimization (ACO). After introducing a large amount of uncertainty into a dataset, we normalize the data and finish training on clean data. The ant colony optimization algorithm is then used to train a recurrent neural network. Finally, we evaluate our proposed method against some of the most popular ML methods, including a k-nearest neighbor, support vector machine, random forest, decision tree, logistic regression, and extreme gradient boosting (Xgboost). Error metrics show that our model significantly outperforms the gold standard and other popular ML methods. Using industry-standard performance metrics, the results of our experiments show that our proposed method does a better job of classifying uncertain data than other methods","PeriodicalId":17866,"journal":{"name":"Kurdistan Journal of Applied Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kurdistan Journal of Applied Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24017/science.2022.2.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There are many potential sources of data uncertainty, such as imperfect measurement or sampling, intrusive environmental monitoring, unreliable sensor networks, and inaccurate medical diagnoses. To avoid unintended results, data mining from new applications like sensors and location-based services needs to be done with care. When attempting to classify data with a high degree of uncertainty, many researchers have turned to heuristic approaches and machine learning (ML) methods. We propose an entirely new ML method in this paper by fusing the Radial Basis Function (RBF) network based on ant colony optimization (ACO). After introducing a large amount of uncertainty into a dataset, we normalize the data and finish training on clean data. The ant colony optimization algorithm is then used to train a recurrent neural network. Finally, we evaluate our proposed method against some of the most popular ML methods, including a k-nearest neighbor, support vector machine, random forest, decision tree, logistic regression, and extreme gradient boosting (Xgboost). Error metrics show that our model significantly outperforms the gold standard and other popular ML methods. Using industry-standard performance metrics, the results of our experiments show that our proposed method does a better job of classifying uncertain data than other methods
基于径向基函数的蚁群算法在不确定流数据分类中的应用
数据不确定性有许多潜在来源,如不完善的测量或采样、侵入性的环境监测、不可靠的传感器网络和不准确的医疗诊断。为了避免意外的结果,需要谨慎地从传感器和基于位置的服务等新应用程序中进行数据挖掘。当试图对具有高度不确定性的数据进行分类时,许多研究人员转向了启发式方法和机器学习(ML)方法。在本文中,我们提出了一种全新的ML方法,通过融合基于蚁群优化(ACO)的径向基函数(RBF)网络。在将大量的不确定性引入数据集后,我们对数据进行归一化,并在干净的数据上完成训练。然后使用蚁群优化算法来训练递归神经网络。最后,我们将我们提出的方法与一些最流行的ML方法进行了比较,包括k近邻、支持向量机、随机森林、决策树、逻辑回归和极端梯度提升(Xgboost)。误差度量表明,我们的模型显著优于黄金标准和其他流行的ML方法。使用行业标准的性能指标,我们的实验结果表明,与其他方法相比,我们提出的方法在对不确定数据进行分类方面做得更好
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
16
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信