{"title":"基于多源数据方法估算季节性湖泊总氮浓度的新方法","authors":"Xianqiang Xia , Jiayi Pan , Jintao Pei","doi":"10.1016/j.ecoinf.2024.102807","DOIUrl":null,"url":null,"abstract":"<div><p>Nitrogen, a key limiter in lake eutrophication, presents serious threats to both human health and ecological balance. Despite its non-optically active nature, this study introduces an advanced retrieval approach for total nitrogen, utilizing a synthesis of multi-source data and sophisticated machine learning algorithms to markedly boost estimation precision. This innovative method integrates environmental variables, such as water temperature, depth, and flow rate with spectral reflectance, significantly enhancing the predictive accuracy of our machine learning models with high stability. The models tested, including Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGB), Multilayer Perceptron (MLP), and Convolutional Neural Networks (CNN), with XGB outperforming others by achieving robust metrics: an R<sup>2</sup> of 0.78, a Mean Absolute Error (MAE) of 0.21 mg/L, and a Mean Absolute Percentage Error (MAPE) of 16.04 %. Applying the optimized XGB model, we documented fluctuations in nitrogen concentrations within Poyang Lake across different hydrological phases in 2021, revealing the lowest nitrogen levels during the flood season and the highest in low water periods, with high concentrations at the inlets of the North Branch of the Ganjiang River and the Raohe River estuaries. Monte Carlo simulations reveal that the model is not much sensitive to input feature errors, validating its stability. The approach proposed in this study may help more precise total nitrogen retrieval in other similar lake waters.</p></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1574954124003492/pdfft?md5=3fbee532adeef089e743a635d2cf4108&pid=1-s2.0-S1574954124003492-main.pdf","citationCount":"0","resultStr":"{\"title\":\"A new approach to estimate total nitrogen concentration in a seasonal lake based on multi-source data methodology\",\"authors\":\"Xianqiang Xia , Jiayi Pan , Jintao Pei\",\"doi\":\"10.1016/j.ecoinf.2024.102807\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Nitrogen, a key limiter in lake eutrophication, presents serious threats to both human health and ecological balance. Despite its non-optically active nature, this study introduces an advanced retrieval approach for total nitrogen, utilizing a synthesis of multi-source data and sophisticated machine learning algorithms to markedly boost estimation precision. This innovative method integrates environmental variables, such as water temperature, depth, and flow rate with spectral reflectance, significantly enhancing the predictive accuracy of our machine learning models with high stability. The models tested, including Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGB), Multilayer Perceptron (MLP), and Convolutional Neural Networks (CNN), with XGB outperforming others by achieving robust metrics: an R<sup>2</sup> of 0.78, a Mean Absolute Error (MAE) of 0.21 mg/L, and a Mean Absolute Percentage Error (MAPE) of 16.04 %. Applying the optimized XGB model, we documented fluctuations in nitrogen concentrations within Poyang Lake across different hydrological phases in 2021, revealing the lowest nitrogen levels during the flood season and the highest in low water periods, with high concentrations at the inlets of the North Branch of the Ganjiang River and the Raohe River estuaries. Monte Carlo simulations reveal that the model is not much sensitive to input feature errors, validating its stability. The approach proposed in this study may help more precise total nitrogen retrieval in other similar lake waters.</p></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1574954124003492/pdfft?md5=3fbee532adeef089e743a635d2cf4108&pid=1-s2.0-S1574954124003492-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954124003492\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954124003492","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
A new approach to estimate total nitrogen concentration in a seasonal lake based on multi-source data methodology
Nitrogen, a key limiter in lake eutrophication, presents serious threats to both human health and ecological balance. Despite its non-optically active nature, this study introduces an advanced retrieval approach for total nitrogen, utilizing a synthesis of multi-source data and sophisticated machine learning algorithms to markedly boost estimation precision. This innovative method integrates environmental variables, such as water temperature, depth, and flow rate with spectral reflectance, significantly enhancing the predictive accuracy of our machine learning models with high stability. The models tested, including Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGB), Multilayer Perceptron (MLP), and Convolutional Neural Networks (CNN), with XGB outperforming others by achieving robust metrics: an R2 of 0.78, a Mean Absolute Error (MAE) of 0.21 mg/L, and a Mean Absolute Percentage Error (MAPE) of 16.04 %. Applying the optimized XGB model, we documented fluctuations in nitrogen concentrations within Poyang Lake across different hydrological phases in 2021, revealing the lowest nitrogen levels during the flood season and the highest in low water periods, with high concentrations at the inlets of the North Branch of the Ganjiang River and the Raohe River estuaries. Monte Carlo simulations reveal that the model is not much sensitive to input feature errors, validating its stability. The approach proposed in this study may help more precise total nitrogen retrieval in other similar lake waters.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.