Huijuan Hao, Panpan Li, Ke Li, Yongping Shan, Feng Liu, Naiwen Hu, Bo Zhang, Man Li, Xudong Sang, Xiaotong Xu, Yuntao Lv, Wanming Chen, Wentao Jiao
{"title":"以图表示学习为驱动的重金属浓度新型预测方法。","authors":"Huijuan Hao, Panpan Li, Ke Li, Yongping Shan, Feng Liu, Naiwen Hu, Bo Zhang, Man Li, Xudong Sang, Xiaotong Xu, Yuntao Lv, Wanming Chen, Wentao Jiao","doi":"10.1016/j.scitotenv.2024.174713","DOIUrl":null,"url":null,"abstract":"<p><p>The potential risk of heavy metals (HMs) to public health is an issue of great concern. Early prediction is an effective means to reduce the accumulation of HMs. The current prediction methods rarely take internal correlations between environmental factors into consideration, which negatively affects the accuracy of the prediction model and the interpretability of intrinsic mechanisms. Graph representation learning (GraRL) can simultaneously learn the attribute relationships between environmental factors and graph structural information. Herein, we developed the GraRL-HM method to predict the HM concentrations in soil-rice systems. The method consists of two modules, which are PeTPG and GCN-HM. In PeTPG, a graphic structure was generated using graph representation and communitization technology to explore the correlations and transmission paths of different environmental factors. Subsequently, the GCN-HM model based on the graph convolutional neural network (GCN) was used to predict the HM concentrations. The GraRL-HM method was validated by 2295 sets of data covering 21 environmental factors. The results indicated that the PeTPG model simplified correlation paths between factor nodes from 396 to 184, reducing by 53.5 % graph scale by eliminating the invalid paths. The concise and efficient graph structure enhanced the learning efficiency and representation accuracy of downstream prediction models. The GCN-HM model was superior to the four benchmark models in predicting the HM concentration in the crop, improving R<sup>2</sup> by 36.1 %. This study develops a novel approach to improve the prediction accuracy of pollutant accumulation and provides valuable insights into intelligent regulation and planting guidance for heavy metal pollution control.</p>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":" ","pages":"174713"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel prediction approach driven by graph representation learning for heavy metal concentrations.\",\"authors\":\"Huijuan Hao, Panpan Li, Ke Li, Yongping Shan, Feng Liu, Naiwen Hu, Bo Zhang, Man Li, Xudong Sang, Xiaotong Xu, Yuntao Lv, Wanming Chen, Wentao Jiao\",\"doi\":\"10.1016/j.scitotenv.2024.174713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The potential risk of heavy metals (HMs) to public health is an issue of great concern. Early prediction is an effective means to reduce the accumulation of HMs. The current prediction methods rarely take internal correlations between environmental factors into consideration, which negatively affects the accuracy of the prediction model and the interpretability of intrinsic mechanisms. Graph representation learning (GraRL) can simultaneously learn the attribute relationships between environmental factors and graph structural information. Herein, we developed the GraRL-HM method to predict the HM concentrations in soil-rice systems. The method consists of two modules, which are PeTPG and GCN-HM. In PeTPG, a graphic structure was generated using graph representation and communitization technology to explore the correlations and transmission paths of different environmental factors. Subsequently, the GCN-HM model based on the graph convolutional neural network (GCN) was used to predict the HM concentrations. The GraRL-HM method was validated by 2295 sets of data covering 21 environmental factors. The results indicated that the PeTPG model simplified correlation paths between factor nodes from 396 to 184, reducing by 53.5 % graph scale by eliminating the invalid paths. The concise and efficient graph structure enhanced the learning efficiency and representation accuracy of downstream prediction models. The GCN-HM model was superior to the four benchmark models in predicting the HM concentration in the crop, improving R<sup>2</sup> by 36.1 %. This study develops a novel approach to improve the prediction accuracy of pollutant accumulation and provides valuable insights into intelligent regulation and planting guidance for heavy metal pollution control.</p>\",\"PeriodicalId\":422,\"journal\":{\"name\":\"Science of the Total Environment\",\"volume\":\" \",\"pages\":\"174713\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science of the Total Environment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.scitotenv.2024.174713\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/7/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.scitotenv.2024.174713","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
A novel prediction approach driven by graph representation learning for heavy metal concentrations.
The potential risk of heavy metals (HMs) to public health is an issue of great concern. Early prediction is an effective means to reduce the accumulation of HMs. The current prediction methods rarely take internal correlations between environmental factors into consideration, which negatively affects the accuracy of the prediction model and the interpretability of intrinsic mechanisms. Graph representation learning (GraRL) can simultaneously learn the attribute relationships between environmental factors and graph structural information. Herein, we developed the GraRL-HM method to predict the HM concentrations in soil-rice systems. The method consists of two modules, which are PeTPG and GCN-HM. In PeTPG, a graphic structure was generated using graph representation and communitization technology to explore the correlations and transmission paths of different environmental factors. Subsequently, the GCN-HM model based on the graph convolutional neural network (GCN) was used to predict the HM concentrations. The GraRL-HM method was validated by 2295 sets of data covering 21 environmental factors. The results indicated that the PeTPG model simplified correlation paths between factor nodes from 396 to 184, reducing by 53.5 % graph scale by eliminating the invalid paths. The concise and efficient graph structure enhanced the learning efficiency and representation accuracy of downstream prediction models. The GCN-HM model was superior to the four benchmark models in predicting the HM concentration in the crop, improving R2 by 36.1 %. This study develops a novel approach to improve the prediction accuracy of pollutant accumulation and provides valuable insights into intelligent regulation and planting guidance for heavy metal pollution control.
期刊介绍:
The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere.
The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.