{"title":"Low-cost sensor-based algal bloom labeling: a comparative study of SVM and logic methods","authors":"Jai-yeop Lee","doi":"10.1007/s10661-025-13815-y","DOIUrl":null,"url":null,"abstract":"<p>This study explores a low-cost sensor system for real-time algae bloom detection and water management. Harmful algal blooms (HABs) threaten water quality, ecosystems, and public health. Traditional detection methods, like satellite imagery and unmanned aerial vehicle (UAV), are expensive and not always suited for real-time monitoring. The proposed system uses sunlight and illuminance sensors to predict algae blooms and water level fluctuations. Data from these sensors are analyzed using support vector machines (SVM) and logical algorithms, with labeling based on sensor readings (e.g., “algae”, “sunny”, “shade”, “aqua”). A multiple linear regression (MLR) model is also developed to predict chlorophyll-a (Chl-a) concentrations. Both SVM and logical algorithms proved effective. The classification using SVM with four sensor values achieved an accuracy of 92.6%, while applying principal component analysis (PCA) before SVM classification resulted in 91.0% accuracy. In contrast, applying a sequential logical sequence to the boundary conditions of a single SVM model improved accuracy to 95.1%, and incorporating PCA-transformed SVM boundary conditions achieved 100.0% accuracy. This surpassed the performance of nonlinear decision models such as random forest and gradient boosting, which achieved 99.2% accuracy. The MLR model successfully predicted Chl-a levels with a 14.3% error rate for values above 5 mg/L. The developed system is an efficient alternative to traditional methods, enhancing real-time monitoring in water quality management.</p>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 4","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-025-13815-y","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study explores a low-cost sensor system for real-time algae bloom detection and water management. Harmful algal blooms (HABs) threaten water quality, ecosystems, and public health. Traditional detection methods, like satellite imagery and unmanned aerial vehicle (UAV), are expensive and not always suited for real-time monitoring. The proposed system uses sunlight and illuminance sensors to predict algae blooms and water level fluctuations. Data from these sensors are analyzed using support vector machines (SVM) and logical algorithms, with labeling based on sensor readings (e.g., “algae”, “sunny”, “shade”, “aqua”). A multiple linear regression (MLR) model is also developed to predict chlorophyll-a (Chl-a) concentrations. Both SVM and logical algorithms proved effective. The classification using SVM with four sensor values achieved an accuracy of 92.6%, while applying principal component analysis (PCA) before SVM classification resulted in 91.0% accuracy. In contrast, applying a sequential logical sequence to the boundary conditions of a single SVM model improved accuracy to 95.1%, and incorporating PCA-transformed SVM boundary conditions achieved 100.0% accuracy. This surpassed the performance of nonlinear decision models such as random forest and gradient boosting, which achieved 99.2% accuracy. The MLR model successfully predicted Chl-a levels with a 14.3% error rate for values above 5 mg/L. The developed system is an efficient alternative to traditional methods, enhancing real-time monitoring in water quality management.
期刊介绍:
Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.