Comparative assessment of machine learning algorithms for retrieving colored dissolved organic matter (CDOM) from Sentinel-2/MSI images in the coastal waters of the Persian Gulf

IF 5.8 2区 环境科学与生态学 Q1 ECOLOGY
Bonyad Ahmadi , Mehdi Gholamalifard , Seyed Mahmoud Ghasempouri , Tiit Kutser
{"title":"Comparative assessment of machine learning algorithms for retrieving colored dissolved organic matter (CDOM) from Sentinel-2/MSI images in the coastal waters of the Persian Gulf","authors":"Bonyad Ahmadi ,&nbsp;Mehdi Gholamalifard ,&nbsp;Seyed Mahmoud Ghasempouri ,&nbsp;Tiit Kutser","doi":"10.1016/j.ecoinf.2025.103171","DOIUrl":null,"url":null,"abstract":"<div><div>Colored Dissolved Organic Matter, a pivotal component of aquatic biogeochemical cycles, plays a critical role in regulating water quality and ecosystem functionality. This study provides the first comprehensive assessment of CDOM dynamics in the Persian Gulf's industrialized coastal waters, focusing on the Pars Special Economic Energy Zone (PSEEZ)—a global energy epicenter and the world's largest natural gas reserve. Seasonal field campaigns conducted in 2023 acquired 199 in situ samples stratified across four seasons (Spring: <em>n</em> = 62, Summer: <em>n</em> = 18, Fall: <em>n</em> = 55, Winter: <em>n</em> = 64) using a CTD-integrated Cyclops-7 fluorometer. Sampling intervals were methodologically synchronized with satellite overpasses (±3 h) to minimize temporal discrepancies between ground-truth measurements and remotely sensed data, thereby ensuring spatiotemporal coherence essential for robust algorithm calibration and validation. Contrary to expectations, CDOM concentrations in petrochemical-influenced areas (e.g., stations P7: 0.29 ppb, P13: 0.35 ppb) were markedly lower than in natural mangrove ecosystems (stations N13: 19.61 ppb, NA2: 12.91 ppb), underscoring the antagonistic effects of industrial pollutants on organic matter stability. Initial CDOM retrieval algorithms yielded suboptimal accuracy (MAE = 1.16, RMSLE = 1.2). A regionally tuned band ratio algorithm improved performance by 27 % (MAE = 0.85) and 22 % (RMSLE = 0.94). Machine learning models further enhanced retrievals, with the Mixture Density Network (MDN) emerging as the superior framework. The MDN achieved an RMSLE of 0.47 (17.5 % improvement over MLP, 14.5 % over SVM) and reduced systematic bias (SSPB) by 26.12 units compared to Bayesian Ridge Regression (BRR), outperforming conventional models like SVM (MAE = 0.61, RMSLE = 0.55). While the MDN exhibited marginally higher absolute error (MAE = 0.53) than deterministic models, its probabilistic architecture uniquely addressed the Persian Gulf's optical complexity, characterized by overlapping signals from SGD-driven organics, hydrocarbon plumes, and sediment resuspension. This study establishes MDN as a transformative tool for CDOM retrieval in optically heterogeneous, anthropogenically stressed waters, while advocating for regionally adaptive frameworks to advance precision water quality monitoring in critical marine ecosystems.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"89 ","pages":"Article 103171"},"PeriodicalIF":5.8000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001803","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Colored Dissolved Organic Matter, a pivotal component of aquatic biogeochemical cycles, plays a critical role in regulating water quality and ecosystem functionality. This study provides the first comprehensive assessment of CDOM dynamics in the Persian Gulf's industrialized coastal waters, focusing on the Pars Special Economic Energy Zone (PSEEZ)—a global energy epicenter and the world's largest natural gas reserve. Seasonal field campaigns conducted in 2023 acquired 199 in situ samples stratified across four seasons (Spring: n = 62, Summer: n = 18, Fall: n = 55, Winter: n = 64) using a CTD-integrated Cyclops-7 fluorometer. Sampling intervals were methodologically synchronized with satellite overpasses (±3 h) to minimize temporal discrepancies between ground-truth measurements and remotely sensed data, thereby ensuring spatiotemporal coherence essential for robust algorithm calibration and validation. Contrary to expectations, CDOM concentrations in petrochemical-influenced areas (e.g., stations P7: 0.29 ppb, P13: 0.35 ppb) were markedly lower than in natural mangrove ecosystems (stations N13: 19.61 ppb, NA2: 12.91 ppb), underscoring the antagonistic effects of industrial pollutants on organic matter stability. Initial CDOM retrieval algorithms yielded suboptimal accuracy (MAE = 1.16, RMSLE = 1.2). A regionally tuned band ratio algorithm improved performance by 27 % (MAE = 0.85) and 22 % (RMSLE = 0.94). Machine learning models further enhanced retrievals, with the Mixture Density Network (MDN) emerging as the superior framework. The MDN achieved an RMSLE of 0.47 (17.5 % improvement over MLP, 14.5 % over SVM) and reduced systematic bias (SSPB) by 26.12 units compared to Bayesian Ridge Regression (BRR), outperforming conventional models like SVM (MAE = 0.61, RMSLE = 0.55). While the MDN exhibited marginally higher absolute error (MAE = 0.53) than deterministic models, its probabilistic architecture uniquely addressed the Persian Gulf's optical complexity, characterized by overlapping signals from SGD-driven organics, hydrocarbon plumes, and sediment resuspension. This study establishes MDN as a transformative tool for CDOM retrieval in optically heterogeneous, anthropogenically stressed waters, while advocating for regionally adaptive frameworks to advance precision water quality monitoring in critical marine ecosystems.
从波斯湾沿海水域的Sentinel-2/MSI图像中检索彩色溶解有机物(CDOM)的机器学习算法的比较评估
有色溶解有机质是水生生物地球化学循环的重要组成部分,在调节水质和生态系统功能方面发挥着重要作用。该研究首次对波斯湾工业化沿海水域的CDOM动态进行了全面评估,重点关注Pars经济能源特区(PSEEZ)——全球能源中心和世界上最大的天然气储量。在2023年进行的季节性野外活动中,使用集成ctd的Cyclops-7荧光计获得了199个原位样品,分四季分层(春季:n = 62,夏季:n = 18,秋季:n = 55,冬季:n = 64)。采样间隔在方法上与卫星立交桥同步(±3小时),以尽量减少地面真实测量值与遥感数据之间的时间差异,从而确保对鲁棒算法校准和验证至关重要的时空一致性。与预期相反,受石化影响地区(例如,P7站:0.29 ppb, P13站:0.35 ppb)的CDOM浓度明显低于天然红树林生态系统(N13站:19.61 ppb, NA2站:12.91 ppb),强调了工业污染物对有机物稳定性的拮抗作用。初始的CDOM检索算法产生了次优的准确性(MAE = 1.16, RMSLE = 1.2)。区域调谐带比算法提高了27% (MAE = 0.85)和22% (RMSLE = 0.94)的性能。机器学习模型进一步增强了检索,混合密度网络(MDN)成为优越的框架。与贝叶斯岭回归(BRR)相比,MDN的RMSLE为0.47(比MLP提高17.5%,比SVM提高14.5%),减少了26.12个单位的系统偏差(SSPB),优于SVM等传统模型(MAE = 0.61, RMSLE = 0.55)。虽然MDN的绝对误差(MAE = 0.53)略高于确定性模型,但其概率结构独特地解决了波斯湾的光学复杂性,其特征是sgd驱动的有机物,碳氢化合物烟雾和沉积物再悬浮的重叠信号。本研究将MDN作为一种变革性的工具,用于在光学异质性、人为压力的水域中检索CDOM,同时倡导区域自适应框架,以推进关键海洋生态系统的精确水质监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信