通过机器学习识别大型水生生态系统中藻华的关键分类群

IF 11.3 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Xianglin Liu, , , Yanqing Deng, , , Sizhi Chen, , , Jin Wang, , , Yan Zhang, , , Ming Li, , , Wenjun Zhong, , , Lijuan Zhang, , and , Xiaowei Zhang*, 
{"title":"通过机器学习识别大型水生生态系统中藻华的关键分类群","authors":"Xianglin Liu,&nbsp;, ,&nbsp;Yanqing Deng,&nbsp;, ,&nbsp;Sizhi Chen,&nbsp;, ,&nbsp;Jin Wang,&nbsp;, ,&nbsp;Yan Zhang,&nbsp;, ,&nbsp;Ming Li,&nbsp;, ,&nbsp;Wenjun Zhong,&nbsp;, ,&nbsp;Lijuan Zhang,&nbsp;, and ,&nbsp;Xiaowei Zhang*,&nbsp;","doi":"10.1021/acs.est.5c08910","DOIUrl":null,"url":null,"abstract":"<p >Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, <i>p</i> &lt; 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. <i>Nostocales</i> and <i>Stephanodiscales</i> were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga <i>Nostocales</i> exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.</p>","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"59 38","pages":"20499–20511"},"PeriodicalIF":11.3000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning\",\"authors\":\"Xianglin Liu,&nbsp;, ,&nbsp;Yanqing Deng,&nbsp;, ,&nbsp;Sizhi Chen,&nbsp;, ,&nbsp;Jin Wang,&nbsp;, ,&nbsp;Yan Zhang,&nbsp;, ,&nbsp;Ming Li,&nbsp;, ,&nbsp;Wenjun Zhong,&nbsp;, ,&nbsp;Lijuan Zhang,&nbsp;, and ,&nbsp;Xiaowei Zhang*,&nbsp;\",\"doi\":\"10.1021/acs.est.5c08910\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, <i>p</i> &lt; 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. <i>Nostocales</i> and <i>Stephanodiscales</i> were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga <i>Nostocales</i> exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.</p>\",\"PeriodicalId\":36,\"journal\":{\"name\":\"环境科学与技术\",\"volume\":\"59 38\",\"pages\":\"20499–20511\"},\"PeriodicalIF\":11.3000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"环境科学与技术\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.est.5c08910\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.est.5c08910","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

根据浮藻指数(FAI)确定导致藻类群落过度生长的关键物种,对于制定有针对性的管理策略来控制藻华(ABs)至关重要。然而,目前在大型水生生态系统中进行藻类生物监测的方法受到分类分辨率低或空间覆盖不足的限制。为了解决这些限制,本研究开发了一种监督机器学习(ML)方法,该方法集成了环境DNA元条形码、遥感和水质参数,以识别关键的藻华物种并绘制其空间分布。结果表明,梯度增强树模型具有较高的预测精度,不同藻类类群的平均MAPE为11.20%。利用该模型生成了34个藻类类群的空间分布图。通过将模型输出与形态学调查数据进行比较,进一步验证了预测的准确性,发现75%的物种存在显著的正相关(Spearman相关系数0.366-0.709,p < 0.05)。采用主成分回归方法,结合藻类分布空间制图和FAI,量化了不同区域不同藻类类群对群落整体结构的贡献。结果表明,Nostocales和Stephanodiscales是整个鄱阳湖FAI变化的关键类群,其中毒藻Nostocales在北部地区的影响大于其他物种。本研究为大规模物种水平的藻类动态模拟提供了一个新的框架,为更精确、更全面地监测和管理藻华提供了重要的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning

Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning

Identifying Key Taxa for Algal Blooms in a Large Aquatic Ecosystem through Machine Learning

Identifying key species responsible for excessive growth of algae communities, as reflected by the floating algae index (FAI), is crucial for developing targeted management strategies to control algal blooms (ABs). However, current approaches for algal biomonitoring in large aquatic ecosystems are limited by either low taxonomic resolution or insufficient spatial coverage. To address these limitations, this study developed a supervised machine learning (ML) approach that integrates environmental DNA metabarcoding, remote sensing, and water quality parameters to identify the key algal bloom species and map their spatial distribution. Results demonstrated that the gradient boosting tree model achieved high predictive accuracy, with a mean MAPE of 11.20% across different algal taxa. Using this model, the spatial distribution maps were generated for 34 algal taxa. Prediction accuracy was further validated by comparing model outputs with morphological survey data, revealing a significant positive correlation (Spearman’s correlation coefficient 0.366–0.709, p < 0.05) for 75% of the species. By integrating spatial mapping of algal distributions and FAI with principal component regression, the contributions of various algae taxa to the overall community structure were quantified across different regions. Nostocales and Stephanodiscales were identified as the key taxa driving FAI variations throughout Poyang Lake, with the toxic alga Nostocales exerting a greater influence in the northern region compared to other species. This study presents a novel framework for large-scale species-level simulation of algal dynamics, representing a significant advance toward more precise and comprehensive monitoring and management of algal blooms.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
环境科学与技术
环境科学与技术 环境科学-工程:环境
CiteScore
17.50
自引率
9.60%
发文量
12359
审稿时长
2.8 months
期刊介绍: Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences. Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信