{"title":"不完全大数据寻找优势研究综述","authors":"Anu V Kottath, Prince V Jose","doi":"10.1109/ICOEI.2019.8862597","DOIUrl":null,"url":null,"abstract":"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.","PeriodicalId":212501,"journal":{"name":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Review on Finding Dominance on Incomplete Big Data\",\"authors\":\"Anu V Kottath, Prince V Jose\",\"doi\":\"10.1109/ICOEI.2019.8862597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.\",\"PeriodicalId\":212501,\"journal\":{\"name\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOEI.2019.8862597\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI.2019.8862597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Review on Finding Dominance on Incomplete Big Data
Big Data is a term used to represent huge size of data and still growing exponentially with time. In short, all data sets are large and complex. The existing traditional data management tools are not able to store and process the large data sets effectively. In Data sets which contains incomplete data and they having random-distributed missing nodes in its dimensions. It is very hard to get back datas from this type of data set when it is large. Dominance value is the most influential value in the data set. A deep analysis is need to identify top-k dominance value in the data set. Some of the existing methods to find the top-k dominant values are Pair wise comparison, Skyline based algorithm, Upper bound based algorithm, Bitmap index guided algorithm. But the major problems of these methods are mainly applicable only to small data sets, complexity increases with increasing data, require numerous comparisons between values, slower data processing respectively. In this review discuss in detail the existing methods to find the dominance values on incomplete data set.