{"title":"通过生成式机器学习模型探索离子液体溶解二氧化碳的化学空间","authors":"Xiuxian Chen, Guzhong Chen, Kunchi Xie, Jie Cheng, Jiahui Chen, Zhen Song, Zhiwen Qi","doi":"10.1016/j.gce.2024.06.005","DOIUrl":null,"url":null,"abstract":"<div><div>For discovering uncharted chemical space of ionic liquids (ILs) for CO<sub>2</sub> dissolution, a reliable generative framework combining re-balanced variational autoencoder (VAE), artificial neural network (ANN), and particle swarm optimization (PSO) is developed based on a comprehensive experimental solubility database from literature. The re-balanced VAE transforms the chemical space of ILs into continuous latent space, which is demonstrated by t-distributed stochastic neighbor embedding (t-SNE) visualization and sampled ions of the latent space. ANN is connected with the re-balanced VAE to predict the CO<sub>2</sub> solubility and the resultant VAE-ANN model achieves a low mean absolute error (MAE) of 0.022 on the test set. Lastly, the PSO algorithm is employed to search the latent space for optimal IL structures with the highest predicted solubility. A total of 5120 ILs are generated and optimized through 10 parallel runs of PSO. Their CO<sub>2</sub> solubilities are predicted and compared to those of the 3735 ILs combined with the already-known cations and anions in the CO<sub>2</sub> solubility database under 298.15 K and 100 kPa. The results demonstrate a notably larger distribution of higher CO<sub>2</sub> solubility in optimized ILs after PSO, which effectively points out the significance and directions for exploring the wide IL chemical space.</div></div>","PeriodicalId":66474,"journal":{"name":"Green Chemical Engineering","volume":"6 3","pages":"Pages 335-343"},"PeriodicalIF":7.6000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring the chemical space of ionic liquids for CO2 dissolution through generative machine learning models\",\"authors\":\"Xiuxian Chen, Guzhong Chen, Kunchi Xie, Jie Cheng, Jiahui Chen, Zhen Song, Zhiwen Qi\",\"doi\":\"10.1016/j.gce.2024.06.005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>For discovering uncharted chemical space of ionic liquids (ILs) for CO<sub>2</sub> dissolution, a reliable generative framework combining re-balanced variational autoencoder (VAE), artificial neural network (ANN), and particle swarm optimization (PSO) is developed based on a comprehensive experimental solubility database from literature. The re-balanced VAE transforms the chemical space of ILs into continuous latent space, which is demonstrated by t-distributed stochastic neighbor embedding (t-SNE) visualization and sampled ions of the latent space. ANN is connected with the re-balanced VAE to predict the CO<sub>2</sub> solubility and the resultant VAE-ANN model achieves a low mean absolute error (MAE) of 0.022 on the test set. Lastly, the PSO algorithm is employed to search the latent space for optimal IL structures with the highest predicted solubility. A total of 5120 ILs are generated and optimized through 10 parallel runs of PSO. Their CO<sub>2</sub> solubilities are predicted and compared to those of the 3735 ILs combined with the already-known cations and anions in the CO<sub>2</sub> solubility database under 298.15 K and 100 kPa. The results demonstrate a notably larger distribution of higher CO<sub>2</sub> solubility in optimized ILs after PSO, which effectively points out the significance and directions for exploring the wide IL chemical space.</div></div>\",\"PeriodicalId\":66474,\"journal\":{\"name\":\"Green Chemical Engineering\",\"volume\":\"6 3\",\"pages\":\"Pages 335-343\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Green Chemical Engineering\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666952824000414\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Green Chemical Engineering","FirstCategoryId":"1089","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666952824000414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Exploring the chemical space of ionic liquids for CO2 dissolution through generative machine learning models
For discovering uncharted chemical space of ionic liquids (ILs) for CO2 dissolution, a reliable generative framework combining re-balanced variational autoencoder (VAE), artificial neural network (ANN), and particle swarm optimization (PSO) is developed based on a comprehensive experimental solubility database from literature. The re-balanced VAE transforms the chemical space of ILs into continuous latent space, which is demonstrated by t-distributed stochastic neighbor embedding (t-SNE) visualization and sampled ions of the latent space. ANN is connected with the re-balanced VAE to predict the CO2 solubility and the resultant VAE-ANN model achieves a low mean absolute error (MAE) of 0.022 on the test set. Lastly, the PSO algorithm is employed to search the latent space for optimal IL structures with the highest predicted solubility. A total of 5120 ILs are generated and optimized through 10 parallel runs of PSO. Their CO2 solubilities are predicted and compared to those of the 3735 ILs combined with the already-known cations and anions in the CO2 solubility database under 298.15 K and 100 kPa. The results demonstrate a notably larger distribution of higher CO2 solubility in optimized ILs after PSO, which effectively points out the significance and directions for exploring the wide IL chemical space.