Liwei Liu , Muhammad Ajmal Azad , Harjinder Lallie , Hany Atlam
{"title":"识别:使用设备指纹和机器学习的智能设备识别","authors":"Liwei Liu , Muhammad Ajmal Azad , Harjinder Lallie , Hany Atlam","doi":"10.1016/j.pmcj.2025.102103","DOIUrl":null,"url":null,"abstract":"<div><div>The Internet of Things (IoT) consists of a rapidly growing network of heterogeneous devices that autonomously monitor, collect, and exchange data across a wide range of application domains. The rapid increase of IoT devices highlighted the importance of scalable, secure, and adaptive network management strategies for dynamic networks. A key challenge in this context is the automatic identification of devices, which is critical for detecting and mitigating malicious devices that can compromise network integrity. Accurate device identification strengthens the security of dynamic IoT environments by facilitating early detection of anomalous or adversarial traffic. Device fingerprinting offers a non-intrusive solution by leveraging protocol and traffic characteristics, without relying on vendor-specific identifiers. In this work, we propose a lightweight and efficient framework for IoT device identification based on machine learning. Our model utilises a Random Forest classifier in conjunction with a data-driven feature selection strategy that emphasises low-overhead features derived from packet headers and traffic flow statistics. The proposed approach achieves high classification performance, attaining 97.32% accuracy in identifying general device categories and 94.39% accuracy for specific device types. It also demonstrates approximately a 40% improvement in computational efficiency compared to traditional classifiers, making it well-suited for deployment in resource-constrained edge environments. We evaluate the model under various real-world conditions, including spatiotemporal traffic variations, changes in operational modes, and different sampling intervals. Comparative experiments with established classifiers—such as J48, SMO, BayesNet, and Naive Bayes—are performed using standard metrics, including precision, recall, F1-score, and inference latency. Our approach strengthens network security by automatically identifying and classifying IoT devices in dynamic, heterogeneous environments. It is lightweight, scalable, and well-suited for deployment in resource-constrained IoT scenarios.</div></div>","PeriodicalId":49005,"journal":{"name":"Pervasive and Mobile Computing","volume":"114 ","pages":"Article 102103"},"PeriodicalIF":3.5000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IDENTIFY: Intelligent device identification using device fingerprints and machine learning\",\"authors\":\"Liwei Liu , Muhammad Ajmal Azad , Harjinder Lallie , Hany Atlam\",\"doi\":\"10.1016/j.pmcj.2025.102103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Internet of Things (IoT) consists of a rapidly growing network of heterogeneous devices that autonomously monitor, collect, and exchange data across a wide range of application domains. The rapid increase of IoT devices highlighted the importance of scalable, secure, and adaptive network management strategies for dynamic networks. A key challenge in this context is the automatic identification of devices, which is critical for detecting and mitigating malicious devices that can compromise network integrity. Accurate device identification strengthens the security of dynamic IoT environments by facilitating early detection of anomalous or adversarial traffic. Device fingerprinting offers a non-intrusive solution by leveraging protocol and traffic characteristics, without relying on vendor-specific identifiers. In this work, we propose a lightweight and efficient framework for IoT device identification based on machine learning. Our model utilises a Random Forest classifier in conjunction with a data-driven feature selection strategy that emphasises low-overhead features derived from packet headers and traffic flow statistics. The proposed approach achieves high classification performance, attaining 97.32% accuracy in identifying general device categories and 94.39% accuracy for specific device types. It also demonstrates approximately a 40% improvement in computational efficiency compared to traditional classifiers, making it well-suited for deployment in resource-constrained edge environments. We evaluate the model under various real-world conditions, including spatiotemporal traffic variations, changes in operational modes, and different sampling intervals. Comparative experiments with established classifiers—such as J48, SMO, BayesNet, and Naive Bayes—are performed using standard metrics, including precision, recall, F1-score, and inference latency. Our approach strengthens network security by automatically identifying and classifying IoT devices in dynamic, heterogeneous environments. It is lightweight, scalable, and well-suited for deployment in resource-constrained IoT scenarios.</div></div>\",\"PeriodicalId\":49005,\"journal\":{\"name\":\"Pervasive and Mobile Computing\",\"volume\":\"114 \",\"pages\":\"Article 102103\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pervasive and Mobile Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574119225000926\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pervasive and Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574119225000926","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
IDENTIFY: Intelligent device identification using device fingerprints and machine learning
The Internet of Things (IoT) consists of a rapidly growing network of heterogeneous devices that autonomously monitor, collect, and exchange data across a wide range of application domains. The rapid increase of IoT devices highlighted the importance of scalable, secure, and adaptive network management strategies for dynamic networks. A key challenge in this context is the automatic identification of devices, which is critical for detecting and mitigating malicious devices that can compromise network integrity. Accurate device identification strengthens the security of dynamic IoT environments by facilitating early detection of anomalous or adversarial traffic. Device fingerprinting offers a non-intrusive solution by leveraging protocol and traffic characteristics, without relying on vendor-specific identifiers. In this work, we propose a lightweight and efficient framework for IoT device identification based on machine learning. Our model utilises a Random Forest classifier in conjunction with a data-driven feature selection strategy that emphasises low-overhead features derived from packet headers and traffic flow statistics. The proposed approach achieves high classification performance, attaining 97.32% accuracy in identifying general device categories and 94.39% accuracy for specific device types. It also demonstrates approximately a 40% improvement in computational efficiency compared to traditional classifiers, making it well-suited for deployment in resource-constrained edge environments. We evaluate the model under various real-world conditions, including spatiotemporal traffic variations, changes in operational modes, and different sampling intervals. Comparative experiments with established classifiers—such as J48, SMO, BayesNet, and Naive Bayes—are performed using standard metrics, including precision, recall, F1-score, and inference latency. Our approach strengthens network security by automatically identifying and classifying IoT devices in dynamic, heterogeneous environments. It is lightweight, scalable, and well-suited for deployment in resource-constrained IoT scenarios.
期刊介绍:
As envisioned by Mark Weiser as early as 1991, pervasive computing systems and services have truly become integral parts of our daily lives. Tremendous developments in a multitude of technologies ranging from personalized and embedded smart devices (e.g., smartphones, sensors, wearables, IoTs, etc.) to ubiquitous connectivity, via a variety of wireless mobile communications and cognitive networking infrastructures, to advanced computing techniques (including edge, fog and cloud) and user-friendly middleware services and platforms have significantly contributed to the unprecedented advances in pervasive and mobile computing. Cutting-edge applications and paradigms have evolved, such as cyber-physical systems and smart environments (e.g., smart city, smart energy, smart transportation, smart healthcare, etc.) that also involve human in the loop through social interactions and participatory and/or mobile crowd sensing, for example. The goal of pervasive computing systems is to improve human experience and quality of life, without explicit awareness of the underlying communications and computing technologies.
The Pervasive and Mobile Computing Journal (PMC) is a high-impact, peer-reviewed technical journal that publishes high-quality scientific articles spanning theory and practice, and covering all aspects of pervasive and mobile computing and systems.