Information extraction of seasonal dissolved oxygen in urban water bodies based on machine learning using sentinel-2 imagery: An open access application in Baiyangdian Lake
Leilei Shi, Chen Gao, Tuo Wang, Lixiang Liu, Yue Wu, Xiaogang You
{"title":"Information extraction of seasonal dissolved oxygen in urban water bodies based on machine learning using sentinel-2 imagery: An open access application in Baiyangdian Lake","authors":"Leilei Shi, Chen Gao, Tuo Wang, Lixiang Liu, Yue Wu, Xiaogang You","doi":"10.1016/j.ecoinf.2024.102782","DOIUrl":null,"url":null,"abstract":"Water bodies are crucial components of urban ecology. The development of rapid and timely water-quality assessment tools using easily measured variables is essential for the health management of urban water bodies. In this study, we focused on the dissolved oxygen (DO) of Baiyangdian Lake using 251 sets of empirically measured water quality data and corresponding Sentinel-2 satellite images. Nine machine learning algorithms were then used to develop a rapid detection algorithm for the spatial distribution of the DO concentration in Baiyangdian Lake. This study successfully applied these methods to invert the DO concentration in Baiyangdian Lake during spring, summer, and autumn. The results indicated that extra tree regression (ETR) provided the most accurate and stable results for inverting the DO concentration among the nine machine learning methods. In contrast, AdaBoost regression (ABR), Bayesian ridge regression (BRR), and support vector machines (SVM) exhibit relatively poor regression performance and lack sensitivity to DO concentrations. Moreover, the DO concentration in Baiyangdian Lake ranged from approximately 0 to 12 mg/L, with notable spatiotemporal variations. The highest overall DO concentration was observed in the spring, particularly in the southern region. The DO concentration significantly decreased during summer compared to that in spring, with higher values in the southwestern area and lower values in the northern region. The DO concentration reached its lowest value in autumn, with slightly higher values in the southern region. This study focused on the estimation and inversion of DO concentrations in the water bodies of Baiyangdian Lake. By introducing and comparing the performances of commonly used machine learning models, a rapid estimation of the DO concentration was achieved, thereby overcoming the limitations of traditional water quality monitoring methods in DO inversion. It not only intuitively explained the temporal and spatial variation patterns of DO concentration but also laid a foundation for further in-depth exploration of the interactions between DO and other water quality parameters.","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"25 1","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.ecoinf.2024.102782","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Water bodies are crucial components of urban ecology. The development of rapid and timely water-quality assessment tools using easily measured variables is essential for the health management of urban water bodies. In this study, we focused on the dissolved oxygen (DO) of Baiyangdian Lake using 251 sets of empirically measured water quality data and corresponding Sentinel-2 satellite images. Nine machine learning algorithms were then used to develop a rapid detection algorithm for the spatial distribution of the DO concentration in Baiyangdian Lake. This study successfully applied these methods to invert the DO concentration in Baiyangdian Lake during spring, summer, and autumn. The results indicated that extra tree regression (ETR) provided the most accurate and stable results for inverting the DO concentration among the nine machine learning methods. In contrast, AdaBoost regression (ABR), Bayesian ridge regression (BRR), and support vector machines (SVM) exhibit relatively poor regression performance and lack sensitivity to DO concentrations. Moreover, the DO concentration in Baiyangdian Lake ranged from approximately 0 to 12 mg/L, with notable spatiotemporal variations. The highest overall DO concentration was observed in the spring, particularly in the southern region. The DO concentration significantly decreased during summer compared to that in spring, with higher values in the southwestern area and lower values in the northern region. The DO concentration reached its lowest value in autumn, with slightly higher values in the southern region. This study focused on the estimation and inversion of DO concentrations in the water bodies of Baiyangdian Lake. By introducing and comparing the performances of commonly used machine learning models, a rapid estimation of the DO concentration was achieved, thereby overcoming the limitations of traditional water quality monitoring methods in DO inversion. It not only intuitively explained the temporal and spatial variation patterns of DO concentration but also laid a foundation for further in-depth exploration of the interactions between DO and other water quality parameters.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.