Algorithms, methods and approaches to details and enrich data, including personal data

A. Malyavko, Vladimir V. Reutov, Igor V. Korotkikh, Vladimir K. Shperling
{"title":"Algorithms, methods and approaches to details and enrich data, including personal data","authors":"A. Malyavko, Vladimir V. Reutov, Igor V. Korotkikh, Vladimir K. Shperling","doi":"10.17212/2782-2230-2022-4-9-26","DOIUrl":null,"url":null,"abstract":"This article presents the results of a study of algorithms, methods and approaches to depersonalization and enrichment of data, including personal data. Among the types of data enrichment, demographic, geographic and behavioral data enrichments were considered, as well as statistical, semantic and pragmatic data enrichment algorithms were studied. In addition to data enrichment, categories of ontology enrichment were considered, namely expressive ontologies, lightweight ontologies such as taxonomies, and a category that includes works that use reasoning to partially replace traditional methods of knowledge extraction. Ontology enrichment is a broad area of research that can be divided into three categories of work devoted to extracting semantic knowledge from heterogeneous data. As a result of the analysis, it was found that data enrichment processes optimize sales, as well as reduce business costs, by saving finances through information management. The advantages and disadvantages of the considered approaches and methods of data enrichment and ontologies were presented. The main benefit of fortification is the increased value and accuracy of information that helps companies make important business decisions. The main disadvantage is the risk of growing redundant data, which can lead to incorrect analytics and, accordingly, to wrong business decisions, which in turn harms the business. The significance of the analysis is also presented – on the basis of the studies carried out, it is planned to form a technical proposal for creating the basic infrastructure of the project of the NTI Central Committee \"Trusted Information Exchange Environment\" for further research on the topic of data enrichment and depersonalization.","PeriodicalId":207311,"journal":{"name":"Digital Technology Security","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Technology Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17212/2782-2230-2022-4-9-26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This article presents the results of a study of algorithms, methods and approaches to depersonalization and enrichment of data, including personal data. Among the types of data enrichment, demographic, geographic and behavioral data enrichments were considered, as well as statistical, semantic and pragmatic data enrichment algorithms were studied. In addition to data enrichment, categories of ontology enrichment were considered, namely expressive ontologies, lightweight ontologies such as taxonomies, and a category that includes works that use reasoning to partially replace traditional methods of knowledge extraction. Ontology enrichment is a broad area of research that can be divided into three categories of work devoted to extracting semantic knowledge from heterogeneous data. As a result of the analysis, it was found that data enrichment processes optimize sales, as well as reduce business costs, by saving finances through information management. The advantages and disadvantages of the considered approaches and methods of data enrichment and ontologies were presented. The main benefit of fortification is the increased value and accuracy of information that helps companies make important business decisions. The main disadvantage is the risk of growing redundant data, which can lead to incorrect analytics and, accordingly, to wrong business decisions, which in turn harms the business. The significance of the analysis is also presented – on the basis of the studies carried out, it is planned to form a technical proposal for creating the basic infrastructure of the project of the NTI Central Committee "Trusted Information Exchange Environment" for further research on the topic of data enrichment and depersonalization.
算法、方法和途径详细和丰富的数据,包括个人数据
本文介绍了一项算法、方法和方法的研究结果,以去个性化和丰富数据,包括个人数据。在数据充实类型中,考虑了人口统计、地理和行为数据充实,并研究了统计、语义和语用数据充实算法。除了数据丰富之外,还考虑了本体丰富的类别,即表达性本体,轻量级本体(如分类法),以及包括使用推理部分取代传统知识提取方法的作品的类别。本体丰富是一个广泛的研究领域,可分为三类致力于从异构数据中提取语义知识的工作。分析结果发现,数据丰富流程通过信息管理节省财务,从而优化销售,并降低业务成本。介绍了目前所考虑的数据充实和本体论方法的优缺点。强化的主要好处是增加了信息的价值和准确性,帮助公司做出重要的业务决策。主要的缺点是数据冗余的风险,这可能导致不正确的分析,从而导致错误的业务决策,进而损害业务。分析的意义也被提出-在所进行的研究的基础上,计划形成一个技术建议,以创建NTI中央委员会“可信信息交换环境”项目的基础设施,以进一步研究数据丰富和去个性化的主题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信