Unsupervised segmentation and clustering time series approach to Southern Africa rainfall regime changes

IF 3.3 3区 地球科学 Q2 GEOSCIENCES, MULTIDISCIPLINARY
Lovemore Chipindu, Walter Mupangwa, Isaiah Nyagumbo, Mainassara Zaman-Allah
{"title":"Unsupervised segmentation and clustering time series approach to Southern Africa rainfall regime changes","authors":"Lovemore Chipindu,&nbsp;Walter Mupangwa,&nbsp;Isaiah Nyagumbo,&nbsp;Mainassara Zaman-Allah","doi":"10.1002/gdj3.228","DOIUrl":null,"url":null,"abstract":"<p>Analysis of hydro-climatological time series and spatiotemporal dynamics of meteorological variables has become critical in the context of climate change, especially in Southern African countries where rain-fed agriculture is predominant. In this work, we compared modern unsupervised time series and segmentation approaches and commonly used time series models to analyse rainfall regime changes in the coastal, sub-humid and semi-arid regions of Southern Africa. Rainfall regimes change modelling and prediction inform farming strategies especially when choosing measures for mixed crop–livestock farming systems, as farmers can decide to do rainwater harvesting and moisture conservation or supplementary irrigation if water resources are available. The main goal of this study was to predict/identify rainfall cluster trends over time using regression with hidden logistic process (RHLP) or hidden Markov model regression (HMMR) supplemented by autoregressive integrated moving average (ARIMA) and Facebook Prophet models. Historical time series rainfall data was sourced from meteorological services departments for selected site over an average period of 55 years. Commonly used approaches forecasted an upward rainfall trend in the coastal and sub-humid regions and a declining trend in semi-arid areas with high variability between and within seasons. For all sites, Ljung-Box Test Statistics suggested the existence of autocorrelation in rainfall time series data. Prediction capabilities were investigated using the root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) which indicated not much difference between ARIMA and Facebook Prophet models. RHLP and HMMR offered a unique clustering and segmentation approach examining between and within-season rainfall variability. A maximum of 20 unique rainfall clusters with similar trend characteristics were determined as going beyond this brought non-significant difference to regime changes. A clear trend was exhibited from 1980 going backwards as compared to recent years signifying how unpredictable is rainfall in Southern Africa. The unsupervised approaches predicted a clear cluster trend in coastal than in sub-humid and semi-arid and the performance was assessed using Akaike information criteria and log-likelihood which showed improvement in prediction power as the number of segmentation clusters approaches 20.</p>","PeriodicalId":54351,"journal":{"name":"Geoscience Data Journal","volume":"11 4","pages":"514-530"},"PeriodicalIF":3.3000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gdj3.228","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience Data Journal","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gdj3.228","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Analysis of hydro-climatological time series and spatiotemporal dynamics of meteorological variables has become critical in the context of climate change, especially in Southern African countries where rain-fed agriculture is predominant. In this work, we compared modern unsupervised time series and segmentation approaches and commonly used time series models to analyse rainfall regime changes in the coastal, sub-humid and semi-arid regions of Southern Africa. Rainfall regimes change modelling and prediction inform farming strategies especially when choosing measures for mixed crop–livestock farming systems, as farmers can decide to do rainwater harvesting and moisture conservation or supplementary irrigation if water resources are available. The main goal of this study was to predict/identify rainfall cluster trends over time using regression with hidden logistic process (RHLP) or hidden Markov model regression (HMMR) supplemented by autoregressive integrated moving average (ARIMA) and Facebook Prophet models. Historical time series rainfall data was sourced from meteorological services departments for selected site over an average period of 55 years. Commonly used approaches forecasted an upward rainfall trend in the coastal and sub-humid regions and a declining trend in semi-arid areas with high variability between and within seasons. For all sites, Ljung-Box Test Statistics suggested the existence of autocorrelation in rainfall time series data. Prediction capabilities were investigated using the root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) which indicated not much difference between ARIMA and Facebook Prophet models. RHLP and HMMR offered a unique clustering and segmentation approach examining between and within-season rainfall variability. A maximum of 20 unique rainfall clusters with similar trend characteristics were determined as going beyond this brought non-significant difference to regime changes. A clear trend was exhibited from 1980 going backwards as compared to recent years signifying how unpredictable is rainfall in Southern Africa. The unsupervised approaches predicted a clear cluster trend in coastal than in sub-humid and semi-arid and the performance was assessed using Akaike information criteria and log-likelihood which showed improvement in prediction power as the number of segmentation clusters approaches 20.

Abstract Image

Abstract Image

针对南部非洲降雨机制变化的无监督分割和聚类时间序列方法
在气候变化的背景下,水文气候学时间序列和气象变量时空动态分析变得至关重要,尤其是在以雨水灌溉农业为主的南部非洲国家。在这项工作中,我们比较了现代无监督时间序列和分割方法以及常用的时间序列模型,以分析南部非洲沿海、亚湿润和半干旱地区的降雨机制变化。雨量变化模型和预测为农业战略提供了信息,尤其是在选择作物-牲畜混合耕作系统的措施时,因为农民可以决定收集雨水和保墒,或者在有水资源的情况下进行补充灌溉。本研究的主要目标是利用隐藏逻辑过程回归(RHLP)或隐藏马尔可夫模型回归(HMMR),辅以自回归综合移动平均(ARIMA)和 Facebook Prophet 模型,预测/识别降雨集群的长期趋势。选定地点平均 55 年的历史时间序列降雨量数据来自气象服务部门。常用方法预测沿海和半湿润地区的降雨量呈上升趋势,而半干旱地区的降雨量呈下降趋势,且季节间和季节内的变化很大。Ljung-Box 检验统计表明,所有地点的降雨时间序列数据都存在自相关性。使用均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)对预测能力进行了研究,结果表明,ARIMA 模型和 Facebook 先知模型之间的差异不大。RHLP 和 HMMR 提供了一种独特的聚类和细分方法,可检查季节间和季节内的降雨量变化。最多可确定 20 个具有相似趋势特征的独特降雨集群,因为超过这个集群,降雨系统变化的差异就不明显了。与近几年的降雨量相比,1980 年以来的降雨量呈明显的倒退趋势,这表明南部非洲的降雨量是多么难以预测。使用阿凯克信息标准和对数似然法对性能进行了评估,结果表明,当细分集群数量接近 20 个时,预测能力有所提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Geoscience Data Journal
Geoscience Data Journal GEOSCIENCES, MULTIDISCIPLINARYMETEOROLOGY-METEOROLOGY & ATMOSPHERIC SCIENCES
CiteScore
5.90
自引率
9.40%
发文量
35
审稿时长
4 weeks
期刊介绍: Geoscience Data Journal provides an Open Access platform where scientific data can be formally published, in a way that includes scientific peer-review. Thus the dataset creator attains full credit for their efforts, while also improving the scientific record, providing version control for the community and allowing major datasets to be fully described, cited and discovered. An online-only journal, GDJ publishes short data papers cross-linked to – and citing – datasets that have been deposited in approved data centres and awarded DOIs. The journal will also accept articles on data services, and articles which support and inform data publishing best practices. Data is at the heart of science and scientific endeavour. The curation of data and the science associated with it is as important as ever in our understanding of the changing earth system and thereby enabling us to make future predictions. Geoscience Data Journal is working with recognised Data Centres across the globe to develop the future strategy for data publication, the recognition of the value of data and the communication and exploitation of data to the wider science and stakeholder communities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信