Self-Sovereign and Secure Data Sharing Through Docker Containers for Machine Learning on Remote Node

IF 1 4区计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Web Engineering Pub Date : 2024-07-01 DOI:10.13052/jwe1540-9589.2352

Jungchul Seo;Younggyo Lee;Young Yoon

{"title":"Self-Sovereign and Secure Data Sharing Through Docker Containers for Machine Learning on Remote Node","authors":"Jungchul Seo;Younggyo Lee;Young Yoon","doi":"10.13052/jwe1540-9589.2352","DOIUrl":null,"url":null,"abstract":"Collecting personal data from various sources and using it for machine learning (ML) is prevalent. However, there are increasing concerns about the monopolization and potential breach of private data by greedy and malicious organizations. Interest in Web 3.0 systems is on the rise as an alternative. These systems aim to guarantee the self-sovereignty of personal data in a decentralized setting. Users can share data with others directly for fair compensation. Nevertheless, malicious remote users can still violate the integrity and confidentiality of personal data. Therefore, this paper proposes a novel method of preventing unwanted leakage and counterfeiting of the private data lent on the premise of remote users. This paper focuses on the decentralized nature of Web 3.0 to leverage existing personal storage so that the burden of collecting secure data is relieved. Data owners create a lightweight Docker container to encapsulate their private data sources. The data owners generate another container to be deployed on a remote premise for taking and executing any ML algorithms remote users create. Between the containers forming a distributed trusted execution environment (TEE), data are read through a secure channel. Since the TEE is strictly controlled by the data owner, no malicious ML application can leak or breach the private information. This paper explains the engineering details of how this new method is realized.","PeriodicalId":49952,"journal":{"name":"Journal of Web Engineering","volume":"23 5","pages":"637-655"},"PeriodicalIF":1.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10654693","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Web Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10654693/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Collecting personal data from various sources and using it for machine learning (ML) is prevalent. However, there are increasing concerns about the monopolization and potential breach of private data by greedy and malicious organizations. Interest in Web 3.0 systems is on the rise as an alternative. These systems aim to guarantee the self-sovereignty of personal data in a decentralized setting. Users can share data with others directly for fair compensation. Nevertheless, malicious remote users can still violate the integrity and confidentiality of personal data. Therefore, this paper proposes a novel method of preventing unwanted leakage and counterfeiting of the private data lent on the premise of remote users. This paper focuses on the decentralized nature of Web 3.0 to leverage existing personal storage so that the burden of collecting secure data is relieved. Data owners create a lightweight Docker container to encapsulate their private data sources. The data owners generate another container to be deployed on a remote premise for taking and executing any ML algorithms remote users create. Between the containers forming a distributed trusted execution environment (TEE), data are read through a secure channel. Since the TEE is strictly controlled by the data owner, no malicious ML application can leak or breach the private information. This paper explains the engineering details of how this new method is realized.

查看原文本刊更多论文

通过 Docker 容器在远程节点上进行机器学习的自主安全数据共享

从各种来源收集个人数据并将其用于机器学习（ML）的做法十分普遍。然而，人们越来越担心私人数据会被贪婪和恶意的组织垄断并可能遭到侵犯。作为一种替代方案，人们对 Web 3.0 系统的兴趣与日俱增。这些系统旨在保证个人数据在分散环境中的自我主权。用户可以直接与他人共享数据，并获得公平的补偿。然而，恶意的远程用户仍有可能破坏个人数据的完整性和保密性。因此，本文提出了一种新颖的方法，可在远程用户的前提下防止私人数据的意外泄漏和伪造。本文重点关注 Web 3.0 的去中心化特性，利用现有的个人存储，从而减轻收集安全数据的负担。数据所有者创建一个轻量级 Docker 容器来封装他们的私人数据源。数据所有者生成另一个容器，部署在远程前提下，用于接收和执行远程用户创建的任何 ML 算法。容器之间形成一个分布式可信执行环境（TEE），通过安全通道读取数据。由于 TEE 由数据所有者严格控制，因此任何恶意 ML 应用程序都无法泄露或破坏私人信息。本文解释了如何实现这种新方法的工程细节。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Web Engineering 工程技术-计算机：理论方法

CiteScore

1.80

自引率

12.50%

发文量

审稿时长

9 months

期刊介绍： The World Wide Web and its associated technologies have become a major implementation and delivery platform for a large variety of applications, ranging from simple institutional information Web sites to sophisticated supply-chain management systems, financial applications, e-government, distance learning, and entertainment, among others. Such applications, in addition to their intrinsic functionality, also exhibit the more complex behavior of distributed applications.