Xianglin Liu, , , Yanqing Deng, , , Sizhi Chen, , , Jin Wang, , , Yan Zhang, , , Ming Li, , , Wenjun Zhong, , , Lijuan Zhang, , and , Xiaowei Zhang*,
{"title":"通过机器学习识别大型水生生态系统中藻华的关键分类群","authors":"Xianglin Liu, , , Yanqing Deng, , , Sizhi Chen, , , Jin Wang, , , Yan Zhang, , , Ming Li, , , Wenjun Zhong, , , Lijuan Zhang, , and , Xiaowei Zhang*, ","doi":"10.1021/acs.est.5c08910","DOIUrl":null,"url":null,"abstract":"<p >Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, <i>p</i> < 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. <i>Nostocales</i> and <i>Stephanodiscales</i> were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga <i>Nostocales</i> exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.</p>","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"59 38","pages":"20499–20511"},"PeriodicalIF":11.3000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning\",\"authors\":\"Xianglin Liu, , , Yanqing Deng, , , Sizhi Chen, , , Jin Wang, , , Yan Zhang, , , Ming Li, , , Wenjun Zhong, , , Lijuan Zhang, , and , Xiaowei Zhang*, \",\"doi\":\"10.1021/acs.est.5c08910\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, <i>p</i> < 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. <i>Nostocales</i> and <i>Stephanodiscales</i> were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga <i>Nostocales</i> exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.</p>\",\"PeriodicalId\":36,\"journal\":{\"name\":\"环境科学与技术\",\"volume\":\"59 38\",\"pages\":\"20499–20511\"},\"PeriodicalIF\":11.3000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"环境科学与技术\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.est.5c08910\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.est.5c08910","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning
Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, p < 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. Nostocales and Stephanodiscales were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga Nostocales exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.
期刊介绍:
Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences.
Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.