{"title":"Big Data Sampling Algorithm Based on Peak Detection","authors":"Mengyu Liu, Yuhang Wang, Ruishi Lin, Shenhang Wang, Wei Zheng","doi":"10.1109/CCDC52312.2021.9601711","DOIUrl":null,"url":null,"abstract":"Domestic mass data processing system in aerospace field uses big data simple sampling algorithm for data specification in the data preprocessing stage. This paper analyzes the data curve distortion caused by this algorithm, and proposes an optimization method for that. Finally, a big data sampling algorithm based on peak detection is adopted to achieve the purpose of quickly viewing the fidelity and complete picture of massive historical data, while ensuring the correctness of the data interpretation after data preprocessing at the same time. Through the using of real test data for verification, in the data preprocessing stage of the domestic mass data processing system, the large data sampling algorithm based on peak detection is adopted to achieve the high fidelity of the data curve after sampling.","PeriodicalId":143976,"journal":{"name":"2021 33rd Chinese Control and Decision Conference (CCDC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 33rd Chinese Control and Decision Conference (CCDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCDC52312.2021.9601711","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Domestic mass data processing system in aerospace field uses big data simple sampling algorithm for data specification in the data preprocessing stage. This paper analyzes the data curve distortion caused by this algorithm, and proposes an optimization method for that. Finally, a big data sampling algorithm based on peak detection is adopted to achieve the purpose of quickly viewing the fidelity and complete picture of massive historical data, while ensuring the correctness of the data interpretation after data preprocessing at the same time. Through the using of real test data for verification, in the data preprocessing stage of the domestic mass data processing system, the large data sampling algorithm based on peak detection is adopted to achieve the high fidelity of the data curve after sampling.