An open-source workflow for identifying hydrodynamic water quality events in rivers by continuous water quality monitoring and time-series data processing using R and US EPA CANARY
L. Cronin , C.M. Taylor , C. Briciu Burghina , F.E. Lucy , F. Regan
{"title":"An open-source workflow for identifying hydrodynamic water quality events in rivers by continuous water quality monitoring and time-series data processing using R and US EPA CANARY","authors":"L. Cronin , C.M. Taylor , C. Briciu Burghina , F.E. Lucy , F. Regan","doi":"10.1016/j.mex.2025.103538","DOIUrl":null,"url":null,"abstract":"<div><div>Improving European surface water quality requires urgent action to address diffuse pollution sources particularly from agriculture, with increased frequency and intensity of hydroclimatic events also a key driver of pollutant export to waters and water quality decline worldwide.</div><div>However, the need for comprehensive, practical protocols for sensor deployment, sensor maintenance and data management for the adoption of high frequency water quality monitoring has been highlighted, along with the challenges for citizen scientists in analyzing millions of water quality data points and sharing metadata. The practical method presented, with reproducibility built into the workflow, is designed for multiple users and a step-by-step application of the workflow is demonstrated including:<ul><li><span>•</span><span><div>Deployment arrangement for water quality sondes in two temporary monitoring stations with different site characteristics.</div></span></li><li><span>•</span><span><div>Data collection and data validation methods.</div></span></li><li><span>•</span><span><div>Concise, reproducible, open-source workflow detailing the use of R, R markdown and US EPA CANARY software for data import, data cleaning, data visualization, data integrity, along with site-specific CANARY event system configuration for the detection of potential water quality events.</div></span></li></ul>Results for two monitoring stations on different rivers show CANARY successfully identified 100 % (n 47) and 97 % (n 39) of the manually identified turbidity events.</div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"15 ","pages":"Article 103538"},"PeriodicalIF":1.9000,"publicationDate":"2025-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125003826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Improving European surface water quality requires urgent action to address diffuse pollution sources particularly from agriculture, with increased frequency and intensity of hydroclimatic events also a key driver of pollutant export to waters and water quality decline worldwide.
However, the need for comprehensive, practical protocols for sensor deployment, sensor maintenance and data management for the adoption of high frequency water quality monitoring has been highlighted, along with the challenges for citizen scientists in analyzing millions of water quality data points and sharing metadata. The practical method presented, with reproducibility built into the workflow, is designed for multiple users and a step-by-step application of the workflow is demonstrated including:
•
Deployment arrangement for water quality sondes in two temporary monitoring stations with different site characteristics.
•
Data collection and data validation methods.
•
Concise, reproducible, open-source workflow detailing the use of R, R markdown and US EPA CANARY software for data import, data cleaning, data visualization, data integrity, along with site-specific CANARY event system configuration for the detection of potential water quality events.
Results for two monitoring stations on different rivers show CANARY successfully identified 100 % (n 47) and 97 % (n 39) of the manually identified turbidity events.