{"title":"Development, Evaluation, and Application of Machine Learning Models for Accurate Prediction of Root Uptake of Per- and Polyfluoroalkyl Substances","authors":"Lei Xiang, Jing Qiu, Qian-Qi Chen, Peng-Fei Yu, Bai-Lin Liu, Hai-Ming Zhao, Yan-Wen Li, Nai-Xian Feng, Quan-Ying Cai, Ce-Hui Mo* and Qing X. Li, ","doi":"10.1021/acs.est.2c09788","DOIUrl":null,"url":null,"abstract":"<p >Machine learning (ML) models were developed for understanding the root uptake of per- and polyfluoroalkyl substances (PFASs) under complex PFAS-crop-soil interactions. Three hundred root concentration factor (RCF) data points and 26 features associated with PFAS structures, crop properties, soil properties, and cultivation conditions were used for the model development. The optimal ML model, obtained by stratified sampling, Bayesian optimization, and 5-fold cross-validation, was explained by permutation feature importance, individual conditional expectation plot, and 3D interaction plot. The results showed that soil organic carbon contents, pH, chemical logP, soil PFAS concentration, root protein contents, and exposure time greatly affected the root uptake of PFASs with 0.43, 0.25, 0.10, 0.05, 0.05, and 0.05 of relative importance, respectively. Furthermore, these factors presented the key threshold ranges in favor of the PFAS uptake. Carbon-chain length was identified as the critical molecular structure affecting root uptake of PFASs with 0.12 of relative importance, based on the extended connectivity fingerprints. A user-friendly model was established with symbolic regression for accurately predicting RCF values of the PFASs (including branched PFAS isomerides). The present study provides a novel approach for profound insight into the uptake of PFASs by crops under complex PFAS-crop-soil interactions, aiming to ensure food safety and human health.</p>","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"57 46","pages":"18317–18328"},"PeriodicalIF":11.3000,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.est.2c09788","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 3
Abstract
Machine learning (ML) models were developed for understanding the root uptake of per- and polyfluoroalkyl substances (PFASs) under complex PFAS-crop-soil interactions. Three hundred root concentration factor (RCF) data points and 26 features associated with PFAS structures, crop properties, soil properties, and cultivation conditions were used for the model development. The optimal ML model, obtained by stratified sampling, Bayesian optimization, and 5-fold cross-validation, was explained by permutation feature importance, individual conditional expectation plot, and 3D interaction plot. The results showed that soil organic carbon contents, pH, chemical logP, soil PFAS concentration, root protein contents, and exposure time greatly affected the root uptake of PFASs with 0.43, 0.25, 0.10, 0.05, 0.05, and 0.05 of relative importance, respectively. Furthermore, these factors presented the key threshold ranges in favor of the PFAS uptake. Carbon-chain length was identified as the critical molecular structure affecting root uptake of PFASs with 0.12 of relative importance, based on the extended connectivity fingerprints. A user-friendly model was established with symbolic regression for accurately predicting RCF values of the PFASs (including branched PFAS isomerides). The present study provides a novel approach for profound insight into the uptake of PFASs by crops under complex PFAS-crop-soil interactions, aiming to ensure food safety and human health.
期刊介绍:
Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences.
Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.