{"title":"A statistical method for the attribution of change-points in segmented Integrated Water Vapor difference time series","authors":"Khanh Ninh Nguyen, Olivier Bock, Emilie Lebarbier","doi":"10.1002/joc.8441","DOIUrl":null,"url":null,"abstract":"<p>Many segmentation or change-point detection methods for homogenizing climate time series compare candidate station data with reference data to eliminate common climate signals and more efficiently detect spurious, non-climatic changes. One drawback is that it is difficult to decide whether the detected change-point is due to the candidate series or to the reference. A so-called attribution procedure is typically applied in a post-processing step for each detected change-point. This article describes a new statistical method for the attribution of change-points detected in Global Navigation Satellite System (GNSS) minus reanalysis series of integrated water vapour. It requires at least one nearby station with similar GNSS and reanalysis data. Six series of differences are formed from the four base series (BS) and are tested for a significant jump at the time of the change-point detected in the candidate station. The six test results are analysed with a statistical predictive rule to attribute the change-point to one, or several, of the four BS. Original aspects of our method are: (1) the significance test, which is based on a generalized linear regression approach, taking both heteroscedasticity and autocorrelation into account; (2) the predictive rule, which uses a machine learning method and is constructed from the test results obtained with the real data by using a resampling strategy. Four popular machine learning methods have been compared using cross-validation and the best one was applied to a real data set (49 main stations with 114 change-points). The results depend on the choice of the test significance level and the aggregation method combining the prediction results when several nearby stations are available. We find that 62% of the change-points are attributed to GNSS, 19% to the reanalysis, and 10% are due to coincident detections.</p>","PeriodicalId":13779,"journal":{"name":"International Journal of Climatology","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/joc.8441","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Climatology","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/joc.8441","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Many segmentation or change-point detection methods for homogenizing climate time series compare candidate station data with reference data to eliminate common climate signals and more efficiently detect spurious, non-climatic changes. One drawback is that it is difficult to decide whether the detected change-point is due to the candidate series or to the reference. A so-called attribution procedure is typically applied in a post-processing step for each detected change-point. This article describes a new statistical method for the attribution of change-points detected in Global Navigation Satellite System (GNSS) minus reanalysis series of integrated water vapour. It requires at least one nearby station with similar GNSS and reanalysis data. Six series of differences are formed from the four base series (BS) and are tested for a significant jump at the time of the change-point detected in the candidate station. The six test results are analysed with a statistical predictive rule to attribute the change-point to one, or several, of the four BS. Original aspects of our method are: (1) the significance test, which is based on a generalized linear regression approach, taking both heteroscedasticity and autocorrelation into account; (2) the predictive rule, which uses a machine learning method and is constructed from the test results obtained with the real data by using a resampling strategy. Four popular machine learning methods have been compared using cross-validation and the best one was applied to a real data set (49 main stations with 114 change-points). The results depend on the choice of the test significance level and the aggregation method combining the prediction results when several nearby stations are available. We find that 62% of the change-points are attributed to GNSS, 19% to the reanalysis, and 10% are due to coincident detections.
期刊介绍:
The International Journal of Climatology aims to span the well established but rapidly growing field of climatology, through the publication of research papers, short communications, major reviews of progress and reviews of new books and reports in the area of climate science. The Journal’s main role is to stimulate and report research in climatology, from the expansive fields of the atmospheric, biophysical, engineering and social sciences. Coverage includes: Climate system science; Local to global scale climate observations and modelling; Seasonal to interannual climate prediction; Climatic variability and climate change; Synoptic, dynamic and urban climatology, hydroclimatology, human bioclimatology, ecoclimatology, dendroclimatology, palaeoclimatology, marine climatology and atmosphere-ocean interactions; Application of climatological knowledge to environmental assessment and management and economic production; Climate and society interactions