Zhen Lei , Yijia Li , Zhen Li , Xin Huang , Dan Yu , Nian Xue , Yongle Chen
{"title":"一个细粒度框架,用于通过版本演变分析在线物联网设备固件识别","authors":"Zhen Lei , Yijia Li , Zhen Li , Xin Huang , Dan Yu , Nian Xue , Yongle Chen","doi":"10.1016/j.iot.2025.101767","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid expansion of IoT networks has outpaced the capabilities of firmware management protocols, leaving numerous Internet-connected devices operating on outdated firmware that contains exploitable vulnerabilities. As vulnerabilities are closely tied to specific firmware versions, fine-grained version identification is critical for effective device management and security risk assessment. However, high firmware heterogeneity and subjective biases in feature selection pose significant challenges to online firmware version identification (OFVI) of IoT devices. To address these challenges, we first construct a dataset comprising 444,195 embedded web pages extracted from 1,000 successfully simulated firmware images. Through analyzing update patterns of embedded web interfaces during firmware version evolution, we propose <em>FirmID</em>, a novel OFVI framework for IoT devices that utilizes directory and content changes in embedded web interfaces. To handle the heterogeneity of firmware across different vendors, we introduce the Hierarchical Multimodal Attention Network (HMANet), a machine learning model specifically designed to capture differences across structural, textual, and functional modalities. To overcome the challenge of distinguishing hard samples caused by the frequent reuse of web pages in firmware iteration versions, we design a Hard Negative Mining Contrastive Loss that enhances intra-class compactness and inter-class separability. Moreover, to improve identification efficiency under uncertain network conditions, FirmID incorporates a complementary heuristic search algorithm, Firmware Identification with Monte Carlo Tree Search (FIMCTS). Experimental results demonstrate that FirmID surpasses state-of-the-art methods by 30.2% in accuracy and reduces file requests by 23.3% in recognition efficiency.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"34 ","pages":"Article 101767"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A fine-grained framework for online IoT device firmware identification via version evolution analysis\",\"authors\":\"Zhen Lei , Yijia Li , Zhen Li , Xin Huang , Dan Yu , Nian Xue , Yongle Chen\",\"doi\":\"10.1016/j.iot.2025.101767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The rapid expansion of IoT networks has outpaced the capabilities of firmware management protocols, leaving numerous Internet-connected devices operating on outdated firmware that contains exploitable vulnerabilities. As vulnerabilities are closely tied to specific firmware versions, fine-grained version identification is critical for effective device management and security risk assessment. However, high firmware heterogeneity and subjective biases in feature selection pose significant challenges to online firmware version identification (OFVI) of IoT devices. To address these challenges, we first construct a dataset comprising 444,195 embedded web pages extracted from 1,000 successfully simulated firmware images. Through analyzing update patterns of embedded web interfaces during firmware version evolution, we propose <em>FirmID</em>, a novel OFVI framework for IoT devices that utilizes directory and content changes in embedded web interfaces. To handle the heterogeneity of firmware across different vendors, we introduce the Hierarchical Multimodal Attention Network (HMANet), a machine learning model specifically designed to capture differences across structural, textual, and functional modalities. To overcome the challenge of distinguishing hard samples caused by the frequent reuse of web pages in firmware iteration versions, we design a Hard Negative Mining Contrastive Loss that enhances intra-class compactness and inter-class separability. Moreover, to improve identification efficiency under uncertain network conditions, FirmID incorporates a complementary heuristic search algorithm, Firmware Identification with Monte Carlo Tree Search (FIMCTS). Experimental results demonstrate that FirmID surpasses state-of-the-art methods by 30.2% in accuracy and reduces file requests by 23.3% in recognition efficiency.</div></div>\",\"PeriodicalId\":29968,\"journal\":{\"name\":\"Internet of Things\",\"volume\":\"34 \",\"pages\":\"Article 101767\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet of Things\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2542660525002811\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525002811","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A fine-grained framework for online IoT device firmware identification via version evolution analysis
The rapid expansion of IoT networks has outpaced the capabilities of firmware management protocols, leaving numerous Internet-connected devices operating on outdated firmware that contains exploitable vulnerabilities. As vulnerabilities are closely tied to specific firmware versions, fine-grained version identification is critical for effective device management and security risk assessment. However, high firmware heterogeneity and subjective biases in feature selection pose significant challenges to online firmware version identification (OFVI) of IoT devices. To address these challenges, we first construct a dataset comprising 444,195 embedded web pages extracted from 1,000 successfully simulated firmware images. Through analyzing update patterns of embedded web interfaces during firmware version evolution, we propose FirmID, a novel OFVI framework for IoT devices that utilizes directory and content changes in embedded web interfaces. To handle the heterogeneity of firmware across different vendors, we introduce the Hierarchical Multimodal Attention Network (HMANet), a machine learning model specifically designed to capture differences across structural, textual, and functional modalities. To overcome the challenge of distinguishing hard samples caused by the frequent reuse of web pages in firmware iteration versions, we design a Hard Negative Mining Contrastive Loss that enhances intra-class compactness and inter-class separability. Moreover, to improve identification efficiency under uncertain network conditions, FirmID incorporates a complementary heuristic search algorithm, Firmware Identification with Monte Carlo Tree Search (FIMCTS). Experimental results demonstrate that FirmID surpasses state-of-the-art methods by 30.2% in accuracy and reduces file requests by 23.3% in recognition efficiency.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.