{"title":"利用先进的计算模型高保真地预测药物在超临界CO₂中的溶解度。","authors":"Hashem O. Alsaab , Saeed Shirazian","doi":"10.1016/j.ejps.2025.107321","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately estimating the solubility of drugs in supercritical carbon dioxide (SC<img>CO₂) still represents a major difficulty in drug formulation, separation processes, and green technologies. Traditional empirical and semi-empirical methods usually have a hard time representing the complex non-linear interactions that determine solubility under different thermodynamic conditions (e.g., T and P), which, in turn, restricts their applicability and predictive consistency. This study presents an ensemble framework that combines three machine learning regressors, namely, Extreme Gradient Boosting Regression (XGBR), Light Gradient Boosting Regression (LGBR), and CatBoost Regression (CATr), facilitated by two bio-inspired optimization algorithms, the Artificial Protozoa Optimizer (APO) and the Hippopotamus Optimization Algorithm (HOA) for estimation of pharmaceutical solubility in supercritical CO<sub>2</sub>. A dataset of 110 experimental samples reflecting the temperature, pressure, molecular weight (MW), and melting point (MP) of four drugs (Rifampin, Sirolimus, Tacrolimus, and Teriflunomide) was used to model their solubility. Model robustness was ensured through k-fold cross-validation, and interpretability was assessed via SHAP and FAST sensitivity analysis. Additionally, prediction intervals were generated using bootstrapping, enhancing reliability for real-world applications. The ensemble (XGBR + LGBR + CATr optimized by HOA) achieved predictive accuracy (R² = 0.9920, RMSE = 0.08878). The results highlight the potential of optimized ensemble learning in capturing non-linear solubility behaviors, offering a reliable computational framework for pharmaceutical engineering and green drug processing.</div></div>","PeriodicalId":12018,"journal":{"name":"European Journal of Pharmaceutical Sciences","volume":"215 ","pages":"Article 107321"},"PeriodicalIF":4.7000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High-fidelity prediction of drug solubility in supercritical CO₂ for pharmaceutical applications using advanced computational modeling\",\"authors\":\"Hashem O. Alsaab , Saeed Shirazian\",\"doi\":\"10.1016/j.ejps.2025.107321\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurately estimating the solubility of drugs in supercritical carbon dioxide (SC<img>CO₂) still represents a major difficulty in drug formulation, separation processes, and green technologies. Traditional empirical and semi-empirical methods usually have a hard time representing the complex non-linear interactions that determine solubility under different thermodynamic conditions (e.g., T and P), which, in turn, restricts their applicability and predictive consistency. This study presents an ensemble framework that combines three machine learning regressors, namely, Extreme Gradient Boosting Regression (XGBR), Light Gradient Boosting Regression (LGBR), and CatBoost Regression (CATr), facilitated by two bio-inspired optimization algorithms, the Artificial Protozoa Optimizer (APO) and the Hippopotamus Optimization Algorithm (HOA) for estimation of pharmaceutical solubility in supercritical CO<sub>2</sub>. A dataset of 110 experimental samples reflecting the temperature, pressure, molecular weight (MW), and melting point (MP) of four drugs (Rifampin, Sirolimus, Tacrolimus, and Teriflunomide) was used to model their solubility. Model robustness was ensured through k-fold cross-validation, and interpretability was assessed via SHAP and FAST sensitivity analysis. Additionally, prediction intervals were generated using bootstrapping, enhancing reliability for real-world applications. The ensemble (XGBR + LGBR + CATr optimized by HOA) achieved predictive accuracy (R² = 0.9920, RMSE = 0.08878). The results highlight the potential of optimized ensemble learning in capturing non-linear solubility behaviors, offering a reliable computational framework for pharmaceutical engineering and green drug processing.</div></div>\",\"PeriodicalId\":12018,\"journal\":{\"name\":\"European Journal of Pharmaceutical Sciences\",\"volume\":\"215 \",\"pages\":\"Article 107321\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Pharmaceutical Sciences\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0928098725003197\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Pharmaceutical Sciences","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0928098725003197","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
High-fidelity prediction of drug solubility in supercritical CO₂ for pharmaceutical applications using advanced computational modeling
Accurately estimating the solubility of drugs in supercritical carbon dioxide (SCCO₂) still represents a major difficulty in drug formulation, separation processes, and green technologies. Traditional empirical and semi-empirical methods usually have a hard time representing the complex non-linear interactions that determine solubility under different thermodynamic conditions (e.g., T and P), which, in turn, restricts their applicability and predictive consistency. This study presents an ensemble framework that combines three machine learning regressors, namely, Extreme Gradient Boosting Regression (XGBR), Light Gradient Boosting Regression (LGBR), and CatBoost Regression (CATr), facilitated by two bio-inspired optimization algorithms, the Artificial Protozoa Optimizer (APO) and the Hippopotamus Optimization Algorithm (HOA) for estimation of pharmaceutical solubility in supercritical CO2. A dataset of 110 experimental samples reflecting the temperature, pressure, molecular weight (MW), and melting point (MP) of four drugs (Rifampin, Sirolimus, Tacrolimus, and Teriflunomide) was used to model their solubility. Model robustness was ensured through k-fold cross-validation, and interpretability was assessed via SHAP and FAST sensitivity analysis. Additionally, prediction intervals were generated using bootstrapping, enhancing reliability for real-world applications. The ensemble (XGBR + LGBR + CATr optimized by HOA) achieved predictive accuracy (R² = 0.9920, RMSE = 0.08878). The results highlight the potential of optimized ensemble learning in capturing non-linear solubility behaviors, offering a reliable computational framework for pharmaceutical engineering and green drug processing.
期刊介绍:
The journal publishes research articles, review articles and scientific commentaries on all aspects of the pharmaceutical sciences with emphasis on conceptual novelty and scientific quality. The Editors welcome articles in this multidisciplinary field, with a focus on topics relevant for drug discovery and development.
More specifically, the Journal publishes reports on medicinal chemistry, pharmacology, drug absorption and metabolism, pharmacokinetics and pharmacodynamics, pharmaceutical and biomedical analysis, drug delivery (including gene delivery), drug targeting, pharmaceutical technology, pharmaceutical biotechnology and clinical drug evaluation. The journal will typically not give priority to manuscripts focusing primarily on organic synthesis, natural products, adaptation of analytical approaches, or discussions pertaining to drug policy making.
Scientific commentaries and review articles are generally by invitation only or by consent of the Editors. Proceedings of scientific meetings may be published as special issues or supplements to the Journal.