A machine learning algorithm to retrieve the red peak of phytoplankton absorption spectra from ocean-colour remote sensing

IF 4.5 Q2 ENVIRONMENTAL SCIENCES
Mohammad Ashphaq, Shovonlal Roy
{"title":"A machine learning algorithm to retrieve the red peak of phytoplankton absorption spectra from ocean-colour remote sensing","authors":"Mohammad Ashphaq,&nbsp;Shovonlal Roy","doi":"10.1016/j.rsase.2025.101702","DOIUrl":null,"url":null,"abstract":"<div><div>Light absorption by microscopic phytoplankton in marine ecosystems is a crucial process underpinning biological production and global biogeochemical cycles. Accurate estimation of phytoplankton absorption coefficients, an inherent optical property of ocean water, can improve remote sensing applications including spectral photosynthesis models and assessments of ocean health, biodiversity, and climate change impacts. However, considerable uncertainty exists in current satellite retrievals of phytoplankton absorption coefficients, particularly for <em>ɑ</em><sub><em>ph</em></sub>(676) - the phytoplankton absorption peak at red wavelengths near 676 nm - which is an input to several novel and advanced satellite algorithms. This uncertainty hinders operational use of algorithms for assessing phytoplankton physiology, size structure and oceanic carbon pools from space. We aimed to improve satellite-based estimation of <em>ɑ</em><sub><em>ph</em></sub> (676) using advanced machine learning (ML) techniques. We compiled a comprehensive <em>in situ</em> dataset (n = 1576) of <em>ɑ</em><sub><em>ph</em></sub>(676) from published databases and matched with remote-sensing reflectance <em>Rrs</em> at six wavelengths (412, 443, 490, 510, 560, and 665 nm) from the Ocean Colour Climate Change Initiative. We extensively evaluated multiple base ML algorithms: Random Forest (RF), Gradient Boosting Machines, and Linear Regression; and implemented ensemble ML models: RF with Grid Search Cross-Validation, eXtreme Gradient Boosting Ensembled Model, Ensemble Forecast, Stacked Voting, Optimised Ensemble and Meta Stacking, integrating the base models through cross-validated hyperparameter tuning. Meta Stacking outperformed individual ML models in predictive accuracy across temporal resolutions, showing best results with daily composites. Our study addresses key limitations of previous models, including small training datasets, inconsistent performances, and lack of ensemble comparisons. We present a robust, extensively trained and validated ensemble ML model that significantly improves <em>ɑ</em><sub><em>ph</em></sub>(676) estimation and opens the possibility of routinely using in ocean colour remote sensing.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"39 ","pages":"Article 101702"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Light absorption by microscopic phytoplankton in marine ecosystems is a crucial process underpinning biological production and global biogeochemical cycles. Accurate estimation of phytoplankton absorption coefficients, an inherent optical property of ocean water, can improve remote sensing applications including spectral photosynthesis models and assessments of ocean health, biodiversity, and climate change impacts. However, considerable uncertainty exists in current satellite retrievals of phytoplankton absorption coefficients, particularly for ɑph(676) - the phytoplankton absorption peak at red wavelengths near 676 nm - which is an input to several novel and advanced satellite algorithms. This uncertainty hinders operational use of algorithms for assessing phytoplankton physiology, size structure and oceanic carbon pools from space. We aimed to improve satellite-based estimation of ɑph (676) using advanced machine learning (ML) techniques. We compiled a comprehensive in situ dataset (n = 1576) of ɑph(676) from published databases and matched with remote-sensing reflectance Rrs at six wavelengths (412, 443, 490, 510, 560, and 665 nm) from the Ocean Colour Climate Change Initiative. We extensively evaluated multiple base ML algorithms: Random Forest (RF), Gradient Boosting Machines, and Linear Regression; and implemented ensemble ML models: RF with Grid Search Cross-Validation, eXtreme Gradient Boosting Ensembled Model, Ensemble Forecast, Stacked Voting, Optimised Ensemble and Meta Stacking, integrating the base models through cross-validated hyperparameter tuning. Meta Stacking outperformed individual ML models in predictive accuracy across temporal resolutions, showing best results with daily composites. Our study addresses key limitations of previous models, including small training datasets, inconsistent performances, and lack of ensemble comparisons. We present a robust, extensively trained and validated ensemble ML model that significantly improves ɑph(676) estimation and opens the possibility of routinely using in ocean colour remote sensing.
海洋色彩遥感中浮游植物吸收光谱红色峰的机器学习算法
海洋生态系统中微小浮游植物的光吸收是支撑生物生产和全球生物地球化学循环的重要过程。准确估计浮游植物吸收系数(海水固有的光学特性)可以改善遥感应用,包括光谱光合作用模型和海洋健康、生物多样性和气候变化影响评估。然而,目前浮游植物吸收系数的卫星反演存在相当大的不确定性,特别是浮游植物吸收峰在676 nm附近的红色波长,这是几个新颖和先进的卫星算法的输入。这种不确定性阻碍了从空间评估浮游植物生理、大小结构和海洋碳库的算法的实际使用。我们的目标是使用先进的机器学习(ML)技术改进基于卫星的估算。我们从已发表的数据库中编译了一个完整的原位数据集(n = 1576),并与来自海洋颜色气候变化倡议的6个波长(412、443、490、510、560和665 nm)的遥感反射率Rrs进行了匹配。我们广泛评估了多种基本机器学习算法:随机森林(RF)、梯度增强机和线性回归;并实现了集成ML模型:具有网格搜索交叉验证的RF,极端梯度增强集成模型,集成预测,堆叠投票,优化集成和Meta堆叠,通过交叉验证的超参数调优集成基本模型。Meta Stacking在跨时间分辨率的预测准确性方面优于单个ML模型,在日常复合材料中显示出最佳结果。我们的研究解决了以前模型的主要局限性,包括小的训练数据集、不一致的性能和缺乏整体比较。我们提出了一个鲁棒的,经过广泛训练和验证的集成ML模型,该模型显着提高了ph(676)估计,并打开了在海洋颜色遥感中常规使用的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
8.50%
发文量
204
审稿时长
65 days
期刊介绍: The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信