{"title":"Exploring the solubility potential of anti-cancer and supportive agents in supercritical CO2 through advanced computational intelligence techniques","authors":"Reza Soleimani , Mandana Moradi Kouchi , Ziba Behtouei , Zahra Ghasemi , Alireza Baghban","doi":"10.1016/j.jcou.2025.103227","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate prediction of solid drug solubility in supercritical carbon dioxide (SC-CO₂) is critical for optimizing pharmaceutical processes, especially in environmentally sustainable drug formulation and purification. This study develops a machine learning (ML) framework for predicting solubility in SC-CO₂ using 744 experimental data points (520 training, 112 validation, 112 testing). Four features—melting point, molecular weight, pressure, and temperature—were used as model inputs. A comparative assessment was performed between conventional regression methods (Linear, Ridge, Lasso, Elastic Net) and advanced ML algorithms, including Support Vector Machine, K-Nearest Neighbors, Decision Tree, Random Forest, Gradient Boosting, XGBoost, LightGBM, CatBoost, Gaussian Process Regression, Artificial Neural Networks, and Convolutional Neural Networks (CNN). The results show that tree-based ensembles and deep learning approaches significantly outperform linear models. Notably, the CNN model achieved the best test performance with R² = 0.9839 and MSE = 0.0800, followed by CatBoost (R² = 0.9795) and Gaussian Process Regression (R² = 0.9751). Feature importance analysis using SHAP revealed molecular weight as the most influential variable, followed by pressure, temperature, and melting point. Overall, this study highlights the potential of ML in improving solubility prediction and supports its application in early-stage drug development and green pharmaceutical processing.</div></div>","PeriodicalId":350,"journal":{"name":"Journal of CO2 Utilization","volume":"102 ","pages":"Article 103227"},"PeriodicalIF":8.4000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of CO2 Utilization","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212982025002112","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate prediction of solid drug solubility in supercritical carbon dioxide (SC-CO₂) is critical for optimizing pharmaceutical processes, especially in environmentally sustainable drug formulation and purification. This study develops a machine learning (ML) framework for predicting solubility in SC-CO₂ using 744 experimental data points (520 training, 112 validation, 112 testing). Four features—melting point, molecular weight, pressure, and temperature—were used as model inputs. A comparative assessment was performed between conventional regression methods (Linear, Ridge, Lasso, Elastic Net) and advanced ML algorithms, including Support Vector Machine, K-Nearest Neighbors, Decision Tree, Random Forest, Gradient Boosting, XGBoost, LightGBM, CatBoost, Gaussian Process Regression, Artificial Neural Networks, and Convolutional Neural Networks (CNN). The results show that tree-based ensembles and deep learning approaches significantly outperform linear models. Notably, the CNN model achieved the best test performance with R² = 0.9839 and MSE = 0.0800, followed by CatBoost (R² = 0.9795) and Gaussian Process Regression (R² = 0.9751). Feature importance analysis using SHAP revealed molecular weight as the most influential variable, followed by pressure, temperature, and melting point. Overall, this study highlights the potential of ML in improving solubility prediction and supports its application in early-stage drug development and green pharmaceutical processing.
期刊介绍:
The Journal of CO2 Utilization offers a single, multi-disciplinary, scholarly platform for the exchange of novel research in the field of CO2 re-use for scientists and engineers in chemicals, fuels and materials.
The emphasis is on the dissemination of leading-edge research from basic science to the development of new processes, technologies and applications.
The Journal of CO2 Utilization publishes original peer-reviewed research papers, reviews, and short communications, including experimental and theoretical work, and analytical models and simulations.