Suparna Kar, Kaif Ali Khan P, Ravi Surendra Nalawade, Vanga Aravind Shounik, Vikas Ravi Patil, Kotaro Kataoka
{"title":"HRL-D3: High resolution and lightweight defective data detection for IoT data integrity","authors":"Suparna Kar, Kaif Ali Khan P, Ravi Surendra Nalawade, Vanga Aravind Shounik, Vikas Ravi Patil, Kotaro Kataoka","doi":"10.1016/j.future.2025.108089","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the limited storage capacity of Internet of Things (IoT) devices, the use of third-party cloud storage service is an integral part of IoT based systems. Ensuring data integrity in cloud storage services is paramount for maintaining the safety and trustworthiness of the data generated and consumed by IoT applications. While verifying data integrity through defective data detection, the number of False Positives and False Negatives should be fewer so that the resolution is higher. However, increasing the resolution also incurs an increase in metadata for integrity verification and results in higher storage overhead. This paper proposes High Resolution and Lightweight Defective Data Detection (HRL-D3) for IoT data integrity with a short verification time, low storage overhead and minimal computational cost. HRL-D3 introduces 1) the use of Merkle Hash Tree and the novel concept of Intermediate Hash for enabling faster Data Integrity Verification (DIV) and higher resolution, and 2) an Adaptive Data Chunking Algorithm for balancing the trade-off between resolution and storage overhead. Our security analysis examined the risks of potential attacks to HRL-D3, and outlined the prevention provided by the proposed solution as well as the mitigation through an operational workaround. A Proof of Concept implementation HRL-D3 was evaluated and demonstrated its effectiveness in balancing the trade off between the resolution and the storage overhead tradeoff as well as achieving low-latency DIV.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"175 ","pages":"Article 108089"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25003838","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Due to the limited storage capacity of Internet of Things (IoT) devices, the use of third-party cloud storage service is an integral part of IoT based systems. Ensuring data integrity in cloud storage services is paramount for maintaining the safety and trustworthiness of the data generated and consumed by IoT applications. While verifying data integrity through defective data detection, the number of False Positives and False Negatives should be fewer so that the resolution is higher. However, increasing the resolution also incurs an increase in metadata for integrity verification and results in higher storage overhead. This paper proposes High Resolution and Lightweight Defective Data Detection (HRL-D3) for IoT data integrity with a short verification time, low storage overhead and minimal computational cost. HRL-D3 introduces 1) the use of Merkle Hash Tree and the novel concept of Intermediate Hash for enabling faster Data Integrity Verification (DIV) and higher resolution, and 2) an Adaptive Data Chunking Algorithm for balancing the trade-off between resolution and storage overhead. Our security analysis examined the risks of potential attacks to HRL-D3, and outlined the prevention provided by the proposed solution as well as the mitigation through an operational workaround. A Proof of Concept implementation HRL-D3 was evaluated and demonstrated its effectiveness in balancing the trade off between the resolution and the storage overhead tradeoff as well as achieving low-latency DIV.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.