{"title":"Time Series Data Processing Algorithm in Deep Water Drilling","authors":"Ruidong Zhao, Zhiming Yin, Yonghua Li","doi":"10.1109/IC-NIDC54101.2021.9660524","DOIUrl":null,"url":null,"abstract":"The machine learning technology can be applied to the drilling data of deep water. The drilling data is an internally associated multi -dimensional time series data set, which contains a certain amount of abnormal data that greatly affect the data mining process. Because of the high dimension and large scale in drilling data set, existing detection algorithms perform poorly in drilling data sets. To achieve more effective outlier detection, we propose a data processing algorithm based on fusion outlier detection method. Firstly, Isolation Forest, Elliptic Envelope and Local Outlier Factor are used to detect outliers, judge the abnormal data in different conditions, which are weighted to judge and remove the outliers. Secondly, Savitzky-Golay(SG) filter is used to smooth the data, which eliminates the burrs in the data and get clean time series data. Finally, the proposed algorithm is tested in the real drilling data sets. Compared with existing methods, the experiments show that the proposed algorithm can achieve better performance, the RMSE and MAE values are 0.454 and 0.361, respectively.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"38 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The machine learning technology can be applied to the drilling data of deep water. The drilling data is an internally associated multi -dimensional time series data set, which contains a certain amount of abnormal data that greatly affect the data mining process. Because of the high dimension and large scale in drilling data set, existing detection algorithms perform poorly in drilling data sets. To achieve more effective outlier detection, we propose a data processing algorithm based on fusion outlier detection method. Firstly, Isolation Forest, Elliptic Envelope and Local Outlier Factor are used to detect outliers, judge the abnormal data in different conditions, which are weighted to judge and remove the outliers. Secondly, Savitzky-Golay(SG) filter is used to smooth the data, which eliminates the burrs in the data and get clean time series data. Finally, the proposed algorithm is tested in the real drilling data sets. Compared with existing methods, the experiments show that the proposed algorithm can achieve better performance, the RMSE and MAE values are 0.454 and 0.361, respectively.