不完全大数据寻找优势研究综述

2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI) Pub Date : 2019-04-01 DOI:10.1109/ICOEI.2019.8862597

Anu V Kottath, Prince V Jose

{"title":"不完全大数据寻找优势研究综述","authors":"Anu V Kottath, Prince V Jose","doi":"10.1109/ICOEI.2019.8862597","DOIUrl":null,"url":null,"abstract":"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.","PeriodicalId":212501,"journal":{"name":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Review on Finding Dominance on Incomplete Big Data\",\"authors\":\"Anu V Kottath, Prince V Jose\",\"doi\":\"10.1109/ICOEI.2019.8862597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.\",\"PeriodicalId\":212501,\"journal\":{\"name\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOEI.2019.8862597\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI.2019.8862597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

大数据是一个术语，用来表示庞大的数据规模，并且随着时间的推移仍呈指数级增长。简而言之，所有的数据集都是庞大而复杂的。现有的传统数据管理工具不能有效地存储和处理大型数据集。在包含不完整数据且其维度中有随机分布的缺失节点的数据集中。当数据集很大时，很难从这种类型的数据集中获取数据。优势值是数据集中最具影响力的值。需要进行深入分析以确定数据集中的top-k优势值。现有的查找top-k优势值的方法有:对比较、基于Skyline的算法、基于上界的算法、位图索引引导算法。但这些方法的主要问题主要是只适用于小数据集，复杂性随着数据的增加而增加，需要大量的值之间的比较，分别数据处理速度较慢。本文详细讨论了现有的不完备数据集优势值查找方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Review on Finding Dominance on Incomplete Big Data

Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)

自引率

0.00%

发文量