Do Hyuck Kwon, Min Jun Lee, Heewon Jeong, Sanghun Park, Kyung Hwa Cho
{"title":"Multi-modal learning-based algae phyla identification using image and particle modalities","authors":"Do Hyuck Kwon, Min Jun Lee, Heewon Jeong, Sanghun Park, Kyung Hwa Cho","doi":"10.1016/j.watres.2025.123172","DOIUrl":null,"url":null,"abstract":"Algal blooms in freshwater, which are exacerbated by urbanization and climate change, pose significant challenges in the water treatment process. These blooms affect water quality and treatment efficiency. Effective identification of algal proliferation based on the dominant species is important to ensure safe drinking water and a clean water supply. Traditional algae identification techniques, such as microscopy and molecular techniques, are time-consuming and depend on the expertise of the practitioner. This study introduced an artificial intelligence (AI)-based multi-modal approach, which is a recent advancement in techniques for improving algal identification by integrating algal images and particle properties. We employed multi-modal learning to integrate multiple data modalities, including algal images and particle properties acquired using Flow Cam, to provide robustness and reliability for classifying algal phyla, such as <em>Anabaena, Aphanizomenon, Microcystis, Oscillatoria,</em> and <em>Synedra.</em> This study involved acquiring images and particle modalities, which were conducted to integrate the dataset using early, late, and hybrid fusion methods. In addition, explainable AI approaches, including SHapley Additive exPlanations (SHAP) and gradient-weighted class activation mapping (Grad-CAM), were used to investigate the contributions of the algal image and particle modalities to the proposed multi-modal algorithm. The multi-modal algae identifier with late fusion achieved an average F1 score of 0.91 and 0.88 for training and tests related to identifying algal phyla, respectively. Furthermore, compared with other modalities, the image and particle modalities showed significant potential as complementary and reliable components of deep-learning algorithms for algal identification in the water treatment process. These findings can contribute to a safe and clean water supply by effectively identifying the dominant algal species in the water treatment process.","PeriodicalId":443,"journal":{"name":"Water Research","volume":"57 1","pages":""},"PeriodicalIF":11.4000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.watres.2025.123172","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Algal blooms in freshwater, which are exacerbated by urbanization and climate change, pose significant challenges in the water treatment process. These blooms affect water quality and treatment efficiency. Effective identification of algal proliferation based on the dominant species is important to ensure safe drinking water and a clean water supply. Traditional algae identification techniques, such as microscopy and molecular techniques, are time-consuming and depend on the expertise of the practitioner. This study introduced an artificial intelligence (AI)-based multi-modal approach, which is a recent advancement in techniques for improving algal identification by integrating algal images and particle properties. We employed multi-modal learning to integrate multiple data modalities, including algal images and particle properties acquired using Flow Cam, to provide robustness and reliability for classifying algal phyla, such as Anabaena, Aphanizomenon, Microcystis, Oscillatoria, and Synedra. This study involved acquiring images and particle modalities, which were conducted to integrate the dataset using early, late, and hybrid fusion methods. In addition, explainable AI approaches, including SHapley Additive exPlanations (SHAP) and gradient-weighted class activation mapping (Grad-CAM), were used to investigate the contributions of the algal image and particle modalities to the proposed multi-modal algorithm. The multi-modal algae identifier with late fusion achieved an average F1 score of 0.91 and 0.88 for training and tests related to identifying algal phyla, respectively. Furthermore, compared with other modalities, the image and particle modalities showed significant potential as complementary and reliable components of deep-learning algorithms for algal identification in the water treatment process. These findings can contribute to a safe and clean water supply by effectively identifying the dominant algal species in the water treatment process.
期刊介绍:
Water Research, along with its open access companion journal Water Research X, serves as a platform for publishing original research papers covering various aspects of the science and technology related to the anthropogenic water cycle, water quality, and its management worldwide. The audience targeted by the journal comprises biologists, chemical engineers, chemists, civil engineers, environmental engineers, limnologists, and microbiologists. The scope of the journal include:
•Treatment processes for water and wastewaters (municipal, agricultural, industrial, and on-site treatment), including resource recovery and residuals management;
•Urban hydrology including sewer systems, stormwater management, and green infrastructure;
•Drinking water treatment and distribution;
•Potable and non-potable water reuse;
•Sanitation, public health, and risk assessment;
•Anaerobic digestion, solid and hazardous waste management, including source characterization and the effects and control of leachates and gaseous emissions;
•Contaminants (chemical, microbial, anthropogenic particles such as nanoparticles or microplastics) and related water quality sensing, monitoring, fate, and assessment;
•Anthropogenic impacts on inland, tidal, coastal and urban waters, focusing on surface and ground waters, and point and non-point sources of pollution;
•Environmental restoration, linked to surface water, groundwater and groundwater remediation;
•Analysis of the interfaces between sediments and water, and between water and atmosphere, focusing specifically on anthropogenic impacts;
•Mathematical modelling, systems analysis, machine learning, and beneficial use of big data related to the anthropogenic water cycle;
•Socio-economic, policy, and regulations studies.