Zhangmu Jing , Yi Zhang , Xiaoling Liu , Qingqian Li , Yanji Hao , Yeqing Li , Hongjie Gao
{"title":"基于微生物群落测序和源分类器机器学习的水污染人类活动识别","authors":"Zhangmu Jing , Yi Zhang , Xiaoling Liu , Qingqian Li , Yanji Hao , Yeqing Li , Hongjie Gao","doi":"10.1016/j.envint.2024.109240","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying and differentiating human activities is crucial for effectively preventing the threats posed by environmental pollution to aquatic ecosystems and human health. Machine learning (ML) is a powerful analytical tool for tracking human impacts on river ecosystems based on high-through datasets. This study employed an ML framework and 16S rRNA sequencing data to reveal microbial dynamics and trace human activities across China. The results revealed that the microbial assembly was mainly dominated by deterministic factors (environmental factors and interactions between species), and the metacommunity partition was significantly associated with human activities in both water and sediment (Chi-square test<sub>w</sub> <em>P</em> = 1.93 × 10<sup>-22</sup>; Chi-square test<sub>s</sub> <em>P</em> = 6.00 × 10<sup>-6</sup>). Human activities increased the vulnerability of interspecific occurrence networks and the influence of environmental factors on the OTUs similarity and phylogenetic distance. Combined of microbiological indices (MBIs), microbial relative abundance (MRA), and environmental and geographical indices (EGIs), the source classifier machine learning (SCML) algorithm was used to categorize five human activities (i.e., low human-impact, agricultural inputs, domestic inputs, industrial inputs, and dam construction). The SCML optimal configuration is (MBIs + MRA + EGIs) exhibited strong performance with Test<sub>W</sub> R<sup>2</sup> of 0.882 and Test<sub>S</sub> R<sup>2</sup> of 0.924. This study provides valuable insights for improving ecosystem management, supporting sustainable water resource management and advancing pollution mitigation efforts.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"195 ","pages":"Article 109240"},"PeriodicalIF":9.7000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying human activities causing water pollution based on microbial community sequencing and source classifier machine learning\",\"authors\":\"Zhangmu Jing , Yi Zhang , Xiaoling Liu , Qingqian Li , Yanji Hao , Yeqing Li , Hongjie Gao\",\"doi\":\"10.1016/j.envint.2024.109240\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Identifying and differentiating human activities is crucial for effectively preventing the threats posed by environmental pollution to aquatic ecosystems and human health. Machine learning (ML) is a powerful analytical tool for tracking human impacts on river ecosystems based on high-through datasets. This study employed an ML framework and 16S rRNA sequencing data to reveal microbial dynamics and trace human activities across China. The results revealed that the microbial assembly was mainly dominated by deterministic factors (environmental factors and interactions between species), and the metacommunity partition was significantly associated with human activities in both water and sediment (Chi-square test<sub>w</sub> <em>P</em> = 1.93 × 10<sup>-22</sup>; Chi-square test<sub>s</sub> <em>P</em> = 6.00 × 10<sup>-6</sup>). Human activities increased the vulnerability of interspecific occurrence networks and the influence of environmental factors on the OTUs similarity and phylogenetic distance. Combined of microbiological indices (MBIs), microbial relative abundance (MRA), and environmental and geographical indices (EGIs), the source classifier machine learning (SCML) algorithm was used to categorize five human activities (i.e., low human-impact, agricultural inputs, domestic inputs, industrial inputs, and dam construction). The SCML optimal configuration is (MBIs + MRA + EGIs) exhibited strong performance with Test<sub>W</sub> R<sup>2</sup> of 0.882 and Test<sub>S</sub> R<sup>2</sup> of 0.924. This study provides valuable insights for improving ecosystem management, supporting sustainable water resource management and advancing pollution mitigation efforts.</div></div>\",\"PeriodicalId\":308,\"journal\":{\"name\":\"Environment International\",\"volume\":\"195 \",\"pages\":\"Article 109240\"},\"PeriodicalIF\":9.7000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment International\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0160412024008274\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412024008274","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Identifying human activities causing water pollution based on microbial community sequencing and source classifier machine learning
Identifying and differentiating human activities is crucial for effectively preventing the threats posed by environmental pollution to aquatic ecosystems and human health. Machine learning (ML) is a powerful analytical tool for tracking human impacts on river ecosystems based on high-through datasets. This study employed an ML framework and 16S rRNA sequencing data to reveal microbial dynamics and trace human activities across China. The results revealed that the microbial assembly was mainly dominated by deterministic factors (environmental factors and interactions between species), and the metacommunity partition was significantly associated with human activities in both water and sediment (Chi-square testwP = 1.93 × 10-22; Chi-square testsP = 6.00 × 10-6). Human activities increased the vulnerability of interspecific occurrence networks and the influence of environmental factors on the OTUs similarity and phylogenetic distance. Combined of microbiological indices (MBIs), microbial relative abundance (MRA), and environmental and geographical indices (EGIs), the source classifier machine learning (SCML) algorithm was used to categorize five human activities (i.e., low human-impact, agricultural inputs, domestic inputs, industrial inputs, and dam construction). The SCML optimal configuration is (MBIs + MRA + EGIs) exhibited strong performance with TestW R2 of 0.882 and TestS R2 of 0.924. This study provides valuable insights for improving ecosystem management, supporting sustainable water resource management and advancing pollution mitigation efforts.
期刊介绍:
Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review.
It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.