基于机器学习和Sentinel-3图像的呼伦湖不同藻类生物量丰度时空变化

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Zhaojiang Yan, Chong Fang, Kaishan Song, Xiangyu Wang, Zhidan Wen, Yingxin Shang, Hui Tao, Yunfeng Lyu
{"title":"基于机器学习和Sentinel-3图像的呼伦湖不同藻类生物量丰度时空变化","authors":"Zhaojiang Yan, Chong Fang, Kaishan Song, Xiangyu Wang, Zhidan Wen, Yingxin Shang, Hui Tao, Yunfeng Lyu","doi":"10.1038/s41598-025-87338-4","DOIUrl":null,"url":null,"abstract":"<p><p>Climate change and human activities affect the biomass of different algal and the succession of dominant species. In the past, phytoplankton phyla inversion has been focused on oceanic and continental shelf waters, while phytoplankton phyla inversion in inland lakes and reservoirs is still in the initial and exploratory stage, and the research results are relatively few. Especially for mid-to-high latitude lakes, the research is even more blank. Therefore, this study proposes a machine learning method based on OLCI/Sentinel-3 satellite imagery to retrieve algal biomass abundance. Remote sensing models were developed to estimate the biomass abundance of three major algal groups: Cyanophyta, Chlorophyta, and Bacillariophyta. This study compared and evaluated 6 commonly used machine learning models, including extreme gradient boosting (XGBoost), support vector regression (SVR), backpropagation neural network (BP), gradient boosting decision tree (GBDT), random forest (RF), and categorical boosting (CatBoost). The results indicated that XGBoost exhibited the highest accuracy (R<sup>2</sup> = 0.92, RMSE = 1.78%, MAPE = 9.96%) in estimating Cyanophyta's biomass abundance. The RF model demonstrated the highest accuracy for estimating Chlorophyta's biomass abundance (R<sup>2</sup> = 0.72, RMSE = 6.57%, MAPE = 50.8%), while the GBDT model exhibited the highest accuracy for estimating Bacillariophyta's biomass abundance (R<sup>2</sup> = 0.9, RMSE = 4.66%, MAPE = 47.87%). The models were subsequently applied to all cloud-free OLCI images from Hulun Lake during the ice-free periods from 2016 to 2023, producing spatiotemporal distribution maps of the different phytoplankton biomass abundance. Cyanophyta dominated the biomass abundance (44.62 ± 3.47%), followed by Bacillariophyta (36.35 ± 2.68%), and Chlorophyta had the lowest proportion (10.42 ± 1.08%). Together, these three algae groups constituted 91.4 ± 1.55% of all phytoplankton in Hulun Lake. Significant annual variations in the biomass abundance of Cyanophyta and Bacillariophyta were observed, whereas those of Chlorophyta remained stable. Additionally, this study examined the effects of climatic factors and water quality parameters on the biomass abundance of algae. The findings suggest that temperature, wind speed, and atmospheric pressure are critical factors influencing the biomass abundance of the different algae groups. This study not only fills the gaps in the related field, but also provides a new method for monitoring algae, as well as a strong support for realizing the goals of sustainable management of water resources and ecological protection.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"2739"},"PeriodicalIF":3.9000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751342/pdf/","citationCount":"0","resultStr":"{\"title\":\"Spatiotemporal variation in biomass abundance of different algal species in Lake Hulun using machine learning and Sentinel-3 images.\",\"authors\":\"Zhaojiang Yan, Chong Fang, Kaishan Song, Xiangyu Wang, Zhidan Wen, Yingxin Shang, Hui Tao, Yunfeng Lyu\",\"doi\":\"10.1038/s41598-025-87338-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Climate change and human activities affect the biomass of different algal and the succession of dominant species. In the past, phytoplankton phyla inversion has been focused on oceanic and continental shelf waters, while phytoplankton phyla inversion in inland lakes and reservoirs is still in the initial and exploratory stage, and the research results are relatively few. Especially for mid-to-high latitude lakes, the research is even more blank. Therefore, this study proposes a machine learning method based on OLCI/Sentinel-3 satellite imagery to retrieve algal biomass abundance. Remote sensing models were developed to estimate the biomass abundance of three major algal groups: Cyanophyta, Chlorophyta, and Bacillariophyta. This study compared and evaluated 6 commonly used machine learning models, including extreme gradient boosting (XGBoost), support vector regression (SVR), backpropagation neural network (BP), gradient boosting decision tree (GBDT), random forest (RF), and categorical boosting (CatBoost). The results indicated that XGBoost exhibited the highest accuracy (R<sup>2</sup> = 0.92, RMSE = 1.78%, MAPE = 9.96%) in estimating Cyanophyta's biomass abundance. The RF model demonstrated the highest accuracy for estimating Chlorophyta's biomass abundance (R<sup>2</sup> = 0.72, RMSE = 6.57%, MAPE = 50.8%), while the GBDT model exhibited the highest accuracy for estimating Bacillariophyta's biomass abundance (R<sup>2</sup> = 0.9, RMSE = 4.66%, MAPE = 47.87%). The models were subsequently applied to all cloud-free OLCI images from Hulun Lake during the ice-free periods from 2016 to 2023, producing spatiotemporal distribution maps of the different phytoplankton biomass abundance. Cyanophyta dominated the biomass abundance (44.62 ± 3.47%), followed by Bacillariophyta (36.35 ± 2.68%), and Chlorophyta had the lowest proportion (10.42 ± 1.08%). Together, these three algae groups constituted 91.4 ± 1.55% of all phytoplankton in Hulun Lake. Significant annual variations in the biomass abundance of Cyanophyta and Bacillariophyta were observed, whereas those of Chlorophyta remained stable. Additionally, this study examined the effects of climatic factors and water quality parameters on the biomass abundance of algae. The findings suggest that temperature, wind speed, and atmospheric pressure are critical factors influencing the biomass abundance of the different algae groups. This study not only fills the gaps in the related field, but also provides a new method for monitoring algae, as well as a strong support for realizing the goals of sustainable management of water resources and ecological protection.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"2739\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751342/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-87338-4\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-87338-4","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

气候变化和人类活动影响了不同藻类的生物量和优势种的演替。过去,浮游植物门的反演主要集中在海洋和大陆架水域,而内陆湖泊和水库的浮游植物门的反演还处于起步和探索阶段,研究成果相对较少。特别是对于中高纬度湖泊,研究更是空白。因此,本研究提出了一种基于OLCI/Sentinel-3卫星图像的机器学习方法来检索藻类生物量丰度。利用遥感模型估算了三种主要藻类群:蓝藻、绿藻和硅藻的生物量丰度。本研究比较并评估了6种常用的机器学习模型,包括极端梯度增强(XGBoost)、支持向量回归(SVR)、反向传播神经网络(BP)、梯度增强决策树(GBDT)、随机森林(RF)和分类增强(CatBoost)。结果表明,XGBoost对蓝藻生物量丰度的估计精度最高(R2 = 0.92, RMSE = 1.78%, MAPE = 9.96%)。RF模型对绿藻生物量丰度的估计精度最高(R2 = 0.72, RMSE = 6.57%, MAPE = 50.8%), GBDT模型对硅藻生物量丰度的估计精度最高(R2 = 0.9, RMSE = 4.66%, MAPE = 47.87%)。随后,将该模型应用于2016 - 2023年无冰期呼伦湖所有无云OLCI图像,得到不同浮游植物生物量丰度的时空分布图。藻门生物量丰度最高(44.62±3.47%),硅藻门次之(36.35±2.68%),绿藻门最低(10.42±1.08%)。这3类藻类合计占呼伦湖浮游植物总数的91.4±1.55%。蓝藻和硅藻的生物量丰度呈显著的年变化,而绿藻的生物量丰度保持稳定。此外,本研究还考察了气候因素和水质参数对藻类生物量丰度的影响。研究结果表明,温度、风速和大气压力是影响不同藻类种群生物量丰度的关键因素。本研究不仅填补了相关领域的空白,而且为藻类监测提供了一种新的方法,也为实现水资源可持续管理和生态保护的目标提供了有力支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Spatiotemporal variation in biomass abundance of different algal species in Lake Hulun using machine learning and Sentinel-3 images.

Climate change and human activities affect the biomass of different algal and the succession of dominant species. In the past, phytoplankton phyla inversion has been focused on oceanic and continental shelf waters, while phytoplankton phyla inversion in inland lakes and reservoirs is still in the initial and exploratory stage, and the research results are relatively few. Especially for mid-to-high latitude lakes, the research is even more blank. Therefore, this study proposes a machine learning method based on OLCI/Sentinel-3 satellite imagery to retrieve algal biomass abundance. Remote sensing models were developed to estimate the biomass abundance of three major algal groups: Cyanophyta, Chlorophyta, and Bacillariophyta. This study compared and evaluated 6 commonly used machine learning models, including extreme gradient boosting (XGBoost), support vector regression (SVR), backpropagation neural network (BP), gradient boosting decision tree (GBDT), random forest (RF), and categorical boosting (CatBoost). The results indicated that XGBoost exhibited the highest accuracy (R2 = 0.92, RMSE = 1.78%, MAPE = 9.96%) in estimating Cyanophyta's biomass abundance. The RF model demonstrated the highest accuracy for estimating Chlorophyta's biomass abundance (R2 = 0.72, RMSE = 6.57%, MAPE = 50.8%), while the GBDT model exhibited the highest accuracy for estimating Bacillariophyta's biomass abundance (R2 = 0.9, RMSE = 4.66%, MAPE = 47.87%). The models were subsequently applied to all cloud-free OLCI images from Hulun Lake during the ice-free periods from 2016 to 2023, producing spatiotemporal distribution maps of the different phytoplankton biomass abundance. Cyanophyta dominated the biomass abundance (44.62 ± 3.47%), followed by Bacillariophyta (36.35 ± 2.68%), and Chlorophyta had the lowest proportion (10.42 ± 1.08%). Together, these three algae groups constituted 91.4 ± 1.55% of all phytoplankton in Hulun Lake. Significant annual variations in the biomass abundance of Cyanophyta and Bacillariophyta were observed, whereas those of Chlorophyta remained stable. Additionally, this study examined the effects of climatic factors and water quality parameters on the biomass abundance of algae. The findings suggest that temperature, wind speed, and atmospheric pressure are critical factors influencing the biomass abundance of the different algae groups. This study not only fills the gaps in the related field, but also provides a new method for monitoring algae, as well as a strong support for realizing the goals of sustainable management of water resources and ecological protection.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信