物联网网络中自动数据标记和异常检测的混合学习方法

Rimsha Kanwal, Rimsha Kanwal, Umara Noor, Zahid Rashid
{"title":"物联网网络中自动数据标记和异常检测的混合学习方法","authors":"Rimsha Kanwal, Rimsha Kanwal, Umara Noor, Zahid Rashid","doi":"10.1109/ICAI58407.2023.10136687","DOIUrl":null,"url":null,"abstract":"Internet of Things (IoT) is an environment in which digital equipment is augmented with sensors to share and receive data through network. When devices share data it can be effected by anomalies or any attack due to corrupted data or by any other uncertainty and ambiguity in data. The data can also be corrupted through a damage in device. These attacks or anomalies damage the working of the IoT networks. The anomalous data can be detected through detection techniques however most anomaly detection techniques depend upon labelled data but for IoT datasets, usually class labels are not available. Labeling is performed by a manual process which is time consuming and also costly. As data in IoT increases day by day so there is a need to label and classify data for future unseen data. In this paper a hybrid algorithm is proposed in which both clustering and classification techniques are applied for automatic labeling and classifying on IoT dataset. The model contains two function. In the first phase k-means clustering is employed for labelling dataset instances as normal or anomalous. In the second phase labelled dataset is used to train Random Forest model to detect anomalies in IoT networks. The results show that the proposed model is detecting anomalies in IoT networks with an accuracy 98%, precision 98 %, recall 98%, and F-meausre 0.98%.","PeriodicalId":161809,"journal":{"name":"2023 3rd International Conference on Artificial Intelligence (ICAI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Learning Approach for Automatic Data Labelling and Anomaly Detection in IoT Networks\",\"authors\":\"Rimsha Kanwal, Rimsha Kanwal, Umara Noor, Zahid Rashid\",\"doi\":\"10.1109/ICAI58407.2023.10136687\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Internet of Things (IoT) is an environment in which digital equipment is augmented with sensors to share and receive data through network. When devices share data it can be effected by anomalies or any attack due to corrupted data or by any other uncertainty and ambiguity in data. The data can also be corrupted through a damage in device. These attacks or anomalies damage the working of the IoT networks. The anomalous data can be detected through detection techniques however most anomaly detection techniques depend upon labelled data but for IoT datasets, usually class labels are not available. Labeling is performed by a manual process which is time consuming and also costly. As data in IoT increases day by day so there is a need to label and classify data for future unseen data. In this paper a hybrid algorithm is proposed in which both clustering and classification techniques are applied for automatic labeling and classifying on IoT dataset. The model contains two function. In the first phase k-means clustering is employed for labelling dataset instances as normal or anomalous. In the second phase labelled dataset is used to train Random Forest model to detect anomalies in IoT networks. The results show that the proposed model is detecting anomalies in IoT networks with an accuracy 98%, precision 98 %, recall 98%, and F-meausre 0.98%.\",\"PeriodicalId\":161809,\"journal\":{\"name\":\"2023 3rd International Conference on Artificial Intelligence (ICAI)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Artificial Intelligence (ICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAI58407.2023.10136687\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI58407.2023.10136687","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

物联网(IoT)是一种环境,在这种环境中,数字设备通过传感器增强,通过网络共享和接收数据。当设备共享数据时,由于数据损坏或数据中的任何其他不确定性和模糊性,可能会受到异常或任何攻击的影响。数据也可能因设备损坏而损坏。这些攻击或异常会破坏物联网网络的工作。异常数据可以通过检测技术检测到,但是大多数异常检测技术依赖于标记数据,但对于物联网数据集,通常不可用类标签。贴标签是由人工过程进行的,这是耗时且昂贵的。随着物联网中的数据日益增加,因此需要为未来看不见的数据标记和分类数据。本文提出了一种将聚类和分类技术相结合的物联网数据集自动标注和分类的混合算法。该模型包含两个功能。在第一阶段,使用k-means聚类将数据集实例标记为正常或异常。在第二阶段,使用标记数据集训练随机森林模型来检测物联网网络中的异常。结果表明,该模型检测物联网网络异常的准确率为98%,精密度为98%,召回率为98%,f - measure为0.98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Hybrid Learning Approach for Automatic Data Labelling and Anomaly Detection in IoT Networks
Internet of Things (IoT) is an environment in which digital equipment is augmented with sensors to share and receive data through network. When devices share data it can be effected by anomalies or any attack due to corrupted data or by any other uncertainty and ambiguity in data. The data can also be corrupted through a damage in device. These attacks or anomalies damage the working of the IoT networks. The anomalous data can be detected through detection techniques however most anomaly detection techniques depend upon labelled data but for IoT datasets, usually class labels are not available. Labeling is performed by a manual process which is time consuming and also costly. As data in IoT increases day by day so there is a need to label and classify data for future unseen data. In this paper a hybrid algorithm is proposed in which both clustering and classification techniques are applied for automatic labeling and classifying on IoT dataset. The model contains two function. In the first phase k-means clustering is employed for labelling dataset instances as normal or anomalous. In the second phase labelled dataset is used to train Random Forest model to detect anomalies in IoT networks. The results show that the proposed model is detecting anomalies in IoT networks with an accuracy 98%, precision 98 %, recall 98%, and F-meausre 0.98%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信