Zamna:用于安全可靠地存储、共享和使用数据科学应用中的大型数据集的工具

D. Carrizales-Espinoza, J. L. González-Compeán, M. Morales-Sandoval
{"title":"Zamna:用于安全可靠地存储、共享和使用数据科学应用中的大型数据集的工具","authors":"D. Carrizales-Espinoza, J. L. González-Compeán, M. Morales-Sandoval","doi":"10.1109/ENC56672.2022.9882938","DOIUrl":null,"url":null,"abstract":"The exponential data growth (volume), continuously produced from different sources (variety) such as sensors, endusers, and IoT devices, has motivated the creation of efficient data science applications to obtain useful information (value and veracity) in critical decision-making processes (velocity). In health scenarios, the building of data science systems not only must include efficient modules for the acquisition, processing, and visualization of the data, but also transversal layers are required for the management, delivery and storage of data in a secure, reliable, and efficient manner in accordance with different norms or regulations for the handling of sensitive data. This is even more complex when the data science systems are distributed in environments of multiple infrastructures or in current and future computing models (for example, edge, fog, cloud or any combination of them). This article presents Zamna, a computational tool that allows end-users to support data science services by providing the resources for the secure and reliable data management, delivery and storage. Zamna manages the exchange and preparation of data in automatic and transparent manners, from acquisition until consumption by decision makers, which is a fundamental part in practically any data science application. Through its services, Zamna allows the automatic and transparent meeting of standards and regulations of data management, guaranteeing privacy, confidentiality, integrity, and availability of content, as well as tolerance to service failures and traceability. In a case study in the e-health domain, Zamna, allows achieving 70% compliance with international standards for the secure handling of sensitive data at the time that exhibited high performance due to the parallel patterns used in its construction.","PeriodicalId":145622,"journal":{"name":"2022 IEEE Mexican International Conference on Computer Science (ENC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Zamna: a tool for the secure and reliable storage, sharing, and usage of large data sets in data science applications\",\"authors\":\"D. Carrizales-Espinoza, J. L. González-Compeán, M. Morales-Sandoval\",\"doi\":\"10.1109/ENC56672.2022.9882938\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The exponential data growth (volume), continuously produced from different sources (variety) such as sensors, endusers, and IoT devices, has motivated the creation of efficient data science applications to obtain useful information (value and veracity) in critical decision-making processes (velocity). In health scenarios, the building of data science systems not only must include efficient modules for the acquisition, processing, and visualization of the data, but also transversal layers are required for the management, delivery and storage of data in a secure, reliable, and efficient manner in accordance with different norms or regulations for the handling of sensitive data. This is even more complex when the data science systems are distributed in environments of multiple infrastructures or in current and future computing models (for example, edge, fog, cloud or any combination of them). This article presents Zamna, a computational tool that allows end-users to support data science services by providing the resources for the secure and reliable data management, delivery and storage. Zamna manages the exchange and preparation of data in automatic and transparent manners, from acquisition until consumption by decision makers, which is a fundamental part in practically any data science application. Through its services, Zamna allows the automatic and transparent meeting of standards and regulations of data management, guaranteeing privacy, confidentiality, integrity, and availability of content, as well as tolerance to service failures and traceability. In a case study in the e-health domain, Zamna, allows achieving 70% compliance with international standards for the secure handling of sensitive data at the time that exhibited high performance due to the parallel patterns used in its construction.\",\"PeriodicalId\":145622,\"journal\":{\"name\":\"2022 IEEE Mexican International Conference on Computer Science (ENC)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Mexican International Conference on Computer Science (ENC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ENC56672.2022.9882938\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Mexican International Conference on Computer Science (ENC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ENC56672.2022.9882938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

从传感器、终端用户和物联网设备等不同来源(种类)不断产生的指数级数据增长(数量)推动了高效数据科学应用程序的创建,以在关键决策过程(速度)中获得有用的信息(价值和准确性)。在健康场景中,数据科学系统的构建不仅需要包含高效的数据采集、处理和可视化模块,还需要横向层,按照不同的规范或规定,以安全、可靠、高效的方式管理、传递和存储数据,以处理敏感数据。当数据科学系统分布在多个基础设施的环境中,或者在当前和未来的计算模型中(例如,边缘、雾、云或它们的任何组合),情况就更加复杂了。本文介绍了Zamna,这是一个计算工具,它允许最终用户通过提供安全可靠的数据管理、交付和存储资源来支持数据科学服务。Zamna以自动和透明的方式管理数据的交换和准备,从获取到决策者的消费,这几乎是任何数据科学应用程序的基本部分。通过其服务,Zamna允许自动和透明地满足数据管理的标准和法规,保证内容的隐私性,保密性,完整性和可用性,以及对服务故障的容忍度和可追溯性。在电子医疗领域的案例研究中,Zamna允许在安全处理敏感数据时达到70%的国际标准,由于在其构建中使用了并行模式,因此表现出高性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Zamna: a tool for the secure and reliable storage, sharing, and usage of large data sets in data science applications
The exponential data growth (volume), continuously produced from different sources (variety) such as sensors, endusers, and IoT devices, has motivated the creation of efficient data science applications to obtain useful information (value and veracity) in critical decision-making processes (velocity). In health scenarios, the building of data science systems not only must include efficient modules for the acquisition, processing, and visualization of the data, but also transversal layers are required for the management, delivery and storage of data in a secure, reliable, and efficient manner in accordance with different norms or regulations for the handling of sensitive data. This is even more complex when the data science systems are distributed in environments of multiple infrastructures or in current and future computing models (for example, edge, fog, cloud or any combination of them). This article presents Zamna, a computational tool that allows end-users to support data science services by providing the resources for the secure and reliable data management, delivery and storage. Zamna manages the exchange and preparation of data in automatic and transparent manners, from acquisition until consumption by decision makers, which is a fundamental part in practically any data science application. Through its services, Zamna allows the automatic and transparent meeting of standards and regulations of data management, guaranteeing privacy, confidentiality, integrity, and availability of content, as well as tolerance to service failures and traceability. In a case study in the e-health domain, Zamna, allows achieving 70% compliance with international standards for the secure handling of sensitive data at the time that exhibited high performance due to the parallel patterns used in its construction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信