工业水监测异常数据的多尺度挖掘与重构策略

IF 3 4区 环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES
Feng Zhang, Qingyang Lu
{"title":"工业水监测异常数据的多尺度挖掘与重构策略","authors":"Feng Zhang,&nbsp;Qingyang Lu","doi":"10.1007/s10661-026-15410-1","DOIUrl":null,"url":null,"abstract":"<div><p>Improving the quality of water intake monitoring data is an urgent issue in current water management. The industrial water intake monitoring data obtained during the National Water Resources Monitoring Capacity Building Project promotion project was taken as a sample, and the common abnormal categories of water intake monitoring data were summarized, and the strategy of “rough screening–fine identification–reconstruction” was proposed. Considering the seasonal fluctuation law of water monitoring data, the multiscale industrial water monitoring abnormal data identification models were constructed based on segmented 3<i>σ</i> criterion, wavelet transform, and Fourier function. Moreover, the least squares support vector machine (LSSVM) model with adaptive inertia function and particle swarm optimization (PSO) was used to reconstruct the recovered anomaly data. The results indicate that the segmented 3<i>σ</i> criterion performs well for the rough processing of water intake monitoring data, identifying 26 data points that fall outside the corresponding threshold intervals. The Fourier function can effectively reduce the information loss associated with the wavelet transform, thereby improving the accuracy of abnormal data identification; based on verification feedback from monitoring users, 31 of the 38 detected abnormal points were confirmed as “demand-driven anomalies,” yielding an identification accuracy of 81.6%. Furthermore, the inertia function–particle swarm optimization LSSVM model meets the high-precision requirements for abnormal data reconstruction and recovery, and its reconstruction accuracy is higher than that of the LSSVM, the PSO-LSSVM, and the traditional curve fitting method. Specifically, the inertia function–particle swarm optimization LSSVM achieves an average fitting error of 0.0286, representing reductions of 46.2% and 44.4% compared with the LSSVM (0.0532) and PSO-LSSVM (0.0514), respectively; moreover, when compared with the ground-truth values obtained from verification feedback, the reconstruction error rate is below 5%. Overall, the proposed multiscale mining and reconstruction strategy for industrial water intake monitoring abnormal data can provide a valuable methodological reference for enhancing the decision support capability of data in the National Water Resources Monitoring Capacity Building Project.</p></div>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"198 6","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2026-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiscale mining and reconstruction strategy for industrial water monitoring abnormal data\",\"authors\":\"Feng Zhang,&nbsp;Qingyang Lu\",\"doi\":\"10.1007/s10661-026-15410-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Improving the quality of water intake monitoring data is an urgent issue in current water management. The industrial water intake monitoring data obtained during the National Water Resources Monitoring Capacity Building Project promotion project was taken as a sample, and the common abnormal categories of water intake monitoring data were summarized, and the strategy of “rough screening–fine identification–reconstruction” was proposed. Considering the seasonal fluctuation law of water monitoring data, the multiscale industrial water monitoring abnormal data identification models were constructed based on segmented 3<i>σ</i> criterion, wavelet transform, and Fourier function. Moreover, the least squares support vector machine (LSSVM) model with adaptive inertia function and particle swarm optimization (PSO) was used to reconstruct the recovered anomaly data. The results indicate that the segmented 3<i>σ</i> criterion performs well for the rough processing of water intake monitoring data, identifying 26 data points that fall outside the corresponding threshold intervals. The Fourier function can effectively reduce the information loss associated with the wavelet transform, thereby improving the accuracy of abnormal data identification; based on verification feedback from monitoring users, 31 of the 38 detected abnormal points were confirmed as “demand-driven anomalies,” yielding an identification accuracy of 81.6%. Furthermore, the inertia function–particle swarm optimization LSSVM model meets the high-precision requirements for abnormal data reconstruction and recovery, and its reconstruction accuracy is higher than that of the LSSVM, the PSO-LSSVM, and the traditional curve fitting method. Specifically, the inertia function–particle swarm optimization LSSVM achieves an average fitting error of 0.0286, representing reductions of 46.2% and 44.4% compared with the LSSVM (0.0532) and PSO-LSSVM (0.0514), respectively; moreover, when compared with the ground-truth values obtained from verification feedback, the reconstruction error rate is below 5%. Overall, the proposed multiscale mining and reconstruction strategy for industrial water intake monitoring abnormal data can provide a valuable methodological reference for enhancing the decision support capability of data in the National Water Resources Monitoring Capacity Building Project.</p></div>\",\"PeriodicalId\":544,\"journal\":{\"name\":\"Environmental Monitoring and Assessment\",\"volume\":\"198 6\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2026-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Monitoring and Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10661-026-15410-1\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-026-15410-1","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

提高取水监测数据的质量是当前水资源管理中亟待解决的问题。以国家水资源监测能力建设工程推进项目中获得的工业取水监测数据为样本,总结了取水监测数据常见的异常类别,提出了“粗筛-精识别-重构”策略。考虑水监测数据的季节波动规律,基于分段3σ准则、小波变换和傅立叶函数,构建了多尺度工业水监测异常数据识别模型。利用最小二乘支持向量机(LSSVM)自适应惯性函数模型和粒子群算法(PSO)对恢复的异常数据进行重构。结果表明,分割的3σ准则对取水监测数据进行了较好的粗处理,识别出了26个超出相应阈值区间的数据点。傅里叶函数可以有效减少小波变换带来的信息损失,从而提高异常数据识别的准确性;根据监测用户的验证反馈,38个检测到的异常点中有31个被确认为“需求驱动的异常”,识别准确率为81.6%。此外,惯性函数-粒子群优化LSSVM模型满足异常数据重建与恢复的高精度要求,其重建精度高于LSSVM、PSO-LSSVM和传统曲线拟合方法。其中,惯性函数-粒子群优化LSSVM的平均拟合误差为0.0286,比LSSVM(0.0532)和PSO-LSSVM(0.0514)分别降低46.2%和44.4%;与验证反馈得到的真值相比,重构错误率在5%以下。综上所述,本文提出的工业取水监测异常数据多尺度挖掘与重构策略,可为提高全国水资源监测能力建设项目中数据的决策支持能力提供有价值的方法论参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Multiscale mining and reconstruction strategy for industrial water monitoring abnormal data

Multiscale mining and reconstruction strategy for industrial water monitoring abnormal data

Improving the quality of water intake monitoring data is an urgent issue in current water management. The industrial water intake monitoring data obtained during the National Water Resources Monitoring Capacity Building Project promotion project was taken as a sample, and the common abnormal categories of water intake monitoring data were summarized, and the strategy of “rough screening–fine identification–reconstruction” was proposed. Considering the seasonal fluctuation law of water monitoring data, the multiscale industrial water monitoring abnormal data identification models were constructed based on segmented 3σ criterion, wavelet transform, and Fourier function. Moreover, the least squares support vector machine (LSSVM) model with adaptive inertia function and particle swarm optimization (PSO) was used to reconstruct the recovered anomaly data. The results indicate that the segmented 3σ criterion performs well for the rough processing of water intake monitoring data, identifying 26 data points that fall outside the corresponding threshold intervals. The Fourier function can effectively reduce the information loss associated with the wavelet transform, thereby improving the accuracy of abnormal data identification; based on verification feedback from monitoring users, 31 of the 38 detected abnormal points were confirmed as “demand-driven anomalies,” yielding an identification accuracy of 81.6%. Furthermore, the inertia function–particle swarm optimization LSSVM model meets the high-precision requirements for abnormal data reconstruction and recovery, and its reconstruction accuracy is higher than that of the LSSVM, the PSO-LSSVM, and the traditional curve fitting method. Specifically, the inertia function–particle swarm optimization LSSVM achieves an average fitting error of 0.0286, representing reductions of 46.2% and 44.4% compared with the LSSVM (0.0532) and PSO-LSSVM (0.0514), respectively; moreover, when compared with the ground-truth values obtained from verification feedback, the reconstruction error rate is below 5%. Overall, the proposed multiscale mining and reconstruction strategy for industrial water intake monitoring abnormal data can provide a valuable methodological reference for enhancing the decision support capability of data in the National Water Resources Monitoring Capacity Building Project.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Monitoring and Assessment
Environmental Monitoring and Assessment 环境科学-环境科学
CiteScore
4.70
自引率
6.70%
发文量
1000
审稿时长
7.3 months
期刊介绍: Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书