{"title":"Using Machine Learning to Uncover Ecological Mechanisms Controlling Abundance of Phytoplankton Size Classes From Large-Scale Observations","authors":"Sandupal Dutta, Anand Gnanadesikan","doi":"10.1029/2025GB009036","DOIUrl":null,"url":null,"abstract":"<p>Phytoplankton size classes (PSCs) and Phytoplankton functional types (PFTs) determine many fundamental biogeochemical processes including nutrient uptake, energy transfer through marine food webs, ocean carbon export, and gas exchange with the atmosphere. Discerning the causes of spatio-temporal variability of PSCs is a scientific priority for understanding the ocean's role in and response to climate change. This study intends to decipher the relationships between the abundance of PSCs and environmental predictors using explainable machine learning (XAI) techniques. The target variables were PSCs obtained using three different satellite products: size-resolved phytoplankton carbon from Kostadinov, Milutinović, et al. (2016), https://doi.org/10.5194/os-12-561-2016, chlorophyll from MODIS divided according to the algorithm of Hirata et al. (2011), https://doi.org/10.5194/bg-8-311-2011, and a third product from the Copernicus Marine Service. The environmental predictors were nutrients, light, mixed layer depth, salinity, sea surface temperature (sst), and upwelling. The ML algorithm used was the Random Forest Regressor (RFR). XAI techniques were used to discern the relationship between predictors and PSCs abundance. About 85%–95% of the variability of the size classes in the observational data sets was accounted for by environmental variables known to influence phytoplankton biomass. Although different size classes responded similarly to the environmental drivers (with the exception of Copernicus picoplankton) their scale of response varied. The dominant predictors were found to be shortwave radiation, ammonia, dissolved iron and sea surface temperature. The different satellite products show sensitivity to iron, shortwave radiation, sst and ammonia across the same range of values, but with different magnitudes. Copernicus picoplankton is the only product which is positively related to sst.</p>","PeriodicalId":12729,"journal":{"name":"Global Biogeochemical Cycles","volume":"40 4","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2026-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025GB009036","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Biogeochemical Cycles","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025GB009036","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Phytoplankton size classes (PSCs) and Phytoplankton functional types (PFTs) determine many fundamental biogeochemical processes including nutrient uptake, energy transfer through marine food webs, ocean carbon export, and gas exchange with the atmosphere. Discerning the causes of spatio-temporal variability of PSCs is a scientific priority for understanding the ocean's role in and response to climate change. This study intends to decipher the relationships between the abundance of PSCs and environmental predictors using explainable machine learning (XAI) techniques. The target variables were PSCs obtained using three different satellite products: size-resolved phytoplankton carbon from Kostadinov, Milutinović, et al. (2016), https://doi.org/10.5194/os-12-561-2016, chlorophyll from MODIS divided according to the algorithm of Hirata et al. (2011), https://doi.org/10.5194/bg-8-311-2011, and a third product from the Copernicus Marine Service. The environmental predictors were nutrients, light, mixed layer depth, salinity, sea surface temperature (sst), and upwelling. The ML algorithm used was the Random Forest Regressor (RFR). XAI techniques were used to discern the relationship between predictors and PSCs abundance. About 85%–95% of the variability of the size classes in the observational data sets was accounted for by environmental variables known to influence phytoplankton biomass. Although different size classes responded similarly to the environmental drivers (with the exception of Copernicus picoplankton) their scale of response varied. The dominant predictors were found to be shortwave radiation, ammonia, dissolved iron and sea surface temperature. The different satellite products show sensitivity to iron, shortwave radiation, sst and ammonia across the same range of values, but with different magnitudes. Copernicus picoplankton is the only product which is positively related to sst.
期刊介绍:
Global Biogeochemical Cycles (GBC) features research on regional to global biogeochemical interactions, as well as more local studies that demonstrate fundamental implications for biogeochemical processing at regional or global scales. Published papers draw on a wide array of methods and knowledge and extend in time from the deep geologic past to recent historical and potential future interactions. This broad scope includes studies that elucidate human activities as interactive components of biogeochemical cycles and physical Earth Systems including climate. Authors are required to make their work accessible to a broad interdisciplinary range of scientists.