不完全大数据寻找优势研究综述

Anu V Kottath, Prince V Jose
{"title":"不完全大数据寻找优势研究综述","authors":"Anu V Kottath, Prince V Jose","doi":"10.1109/ICOEI.2019.8862597","DOIUrl":null,"url":null,"abstract":"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.","PeriodicalId":212501,"journal":{"name":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Review on Finding Dominance on Incomplete Big Data\",\"authors\":\"Anu V Kottath, Prince V Jose\",\"doi\":\"10.1109/ICOEI.2019.8862597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.\",\"PeriodicalId\":212501,\"journal\":{\"name\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOEI.2019.8862597\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI.2019.8862597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

大数据是一个术语,用来表示庞大的数据规模,并且随着时间的推移仍呈指数级增长。简而言之,所有的数据集都是庞大而复杂的。现有的传统数据管理工具不能有效地存储和处理大型数据集。在包含不完整数据且其维度中有随机分布的缺失节点的数据集中。当数据集很大时,很难从这种类型的数据集中获取数据。优势值是数据集中最具影响力的值。需要进行深入分析以确定数据集中的top-k优势值。现有的查找top-k优势值的方法有:对比较、基于Skyline的算法、基于上界的算法、位图索引引导算法。但这些方法的主要问题主要是只适用于小数据集,复杂性随着数据的增加而增加,需要大量的值之间的比较,分别数据处理速度较慢。本文详细讨论了现有的不完备数据集优势值查找方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Review on Finding Dominance on Incomplete Big Data
Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信