{"title":"BDDS: An Efficient Data Screening Algorithm Based on Binary Digit","authors":"Haowei Lin, Xiaolong Xu","doi":"10.1109/CYBERC.2018.00096","DOIUrl":null,"url":null,"abstract":"In order to improve the efficiency and accuracy of data analysis, excellent data screening algorithms are needed in this era of big data. This paper proposes a binary-digit-based data screening algorithm (BDDS), utilizing the binary storage form of data in hardware and recording the data changes over a period of time with a binary-bit recorder, whose number of digits in the binary form is used to remove the influence of the medium-distance data from the current data, left shift of the binary form is used to reduce the influence of the data which are very close to the current data, and the decimal meaning is used to determine whether the data are valid data. Experiments have proved that the algorithm, as an auxiliary algorithm, can be better combined with the current mainstream data analysis algorithms to reduce the impact of marginal data, save additional storage space, and improve the accuracy and efficiency of subsequent data analysis.","PeriodicalId":282903,"journal":{"name":"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","volume":"145 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBERC.2018.00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to improve the efficiency and accuracy of data analysis, excellent data screening algorithms are needed in this era of big data. This paper proposes a binary-digit-based data screening algorithm (BDDS), utilizing the binary storage form of data in hardware and recording the data changes over a period of time with a binary-bit recorder, whose number of digits in the binary form is used to remove the influence of the medium-distance data from the current data, left shift of the binary form is used to reduce the influence of the data which are very close to the current data, and the decimal meaning is used to determine whether the data are valid data. Experiments have proved that the algorithm, as an auxiliary algorithm, can be better combined with the current mainstream data analysis algorithms to reduce the impact of marginal data, save additional storage space, and improve the accuracy and efficiency of subsequent data analysis.