{"title":"Predicting the solubility of drugs in supercritical carbon dioxide using machine learning and atomic contribution","authors":"Ahmadreza Roosta , Feridun Esmaeilzadeh , Reza Haghbakhsh","doi":"10.1016/j.ejpb.2025.114720","DOIUrl":null,"url":null,"abstract":"<div><div>The pharmaceutical sector is aware of supercritical CO<sub>2</sub> (SC-CO<sub>2</sub>) as a possible replacement for problematic organic solvents. Using a novel artificial intelligence (AI) strategy to predict drug solubility using the SC-CO<sub>2</sub> system mathematically has been deemed an intriguing approach. In this work, the atomic contribution (AC) method and machine learning (ML) models are combined to develop hybrid machine learning models to compute the solubility of several drugs, including anticoagulants, anti-cancers, calcium channel blockers, immunosuppressives, antihistamines, and others. The novelty of the approach lies in using the AC concept to capture molecular details at the atomic level. This enables the model to account for the specific contributions of individual atoms and to provide more precise input features for machine learning. The integration of these molecular insights with ML techniques results in significantly improved predictive performance over traditional ML methods. Throughout the modeling procedure, temperature, pressure, the density of SC-CO<sub>2</sub>, and the effect of constituent atoms of the drugs are the input variables, while the solubility of drugs is the output. This study looks into predicting the solubility of these drugs in SC-CO<sub>2</sub> using the least square support vector machine (LSSVM) with radial basis function kernel (RBF) and multilayer perceptron artificial neural network (MLPANN). These models were developed using a database including 2358 experimental solubility data points from 86 solid drugs. The solubility of solid drugs in supercritical CO<sub>2</sub> spans a remarkably wide range in this study, from as high as 3.9 × 10<sup>-2</sup> to as low as 1 × 10<sup>-7</sup>. The results demonstrated that this innovative approach could estimate solid drug solubility in SC-CO<sub>2</sub> with <em>AARD%</em> and <em>R<sup>2</sup></em> values of 7.20 and 0.99, respectively, under different pressure and temperature conditions. The ability of the models to capture a wide range of solubilities in SC-CO<sub>2</sub> showcases their effectiveness in dealing with both highly and poorly soluble compounds. The developed models, considering their global prediction, accuracy, and being user-friendly, are the best options to be used by researchers for incorporating into software for enabling more efficient design of supercritical extraction processes and reducing the need for trial-and-error experimentation in manufacturing.</div></div>","PeriodicalId":12024,"journal":{"name":"European Journal of Pharmaceutics and Biopharmaceutics","volume":"211 ","pages":"Article 114720"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Pharmaceutics and Biopharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0939641125000979","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
The pharmaceutical sector is aware of supercritical CO2 (SC-CO2) as a possible replacement for problematic organic solvents. Using a novel artificial intelligence (AI) strategy to predict drug solubility using the SC-CO2 system mathematically has been deemed an intriguing approach. In this work, the atomic contribution (AC) method and machine learning (ML) models are combined to develop hybrid machine learning models to compute the solubility of several drugs, including anticoagulants, anti-cancers, calcium channel blockers, immunosuppressives, antihistamines, and others. The novelty of the approach lies in using the AC concept to capture molecular details at the atomic level. This enables the model to account for the specific contributions of individual atoms and to provide more precise input features for machine learning. The integration of these molecular insights with ML techniques results in significantly improved predictive performance over traditional ML methods. Throughout the modeling procedure, temperature, pressure, the density of SC-CO2, and the effect of constituent atoms of the drugs are the input variables, while the solubility of drugs is the output. This study looks into predicting the solubility of these drugs in SC-CO2 using the least square support vector machine (LSSVM) with radial basis function kernel (RBF) and multilayer perceptron artificial neural network (MLPANN). These models were developed using a database including 2358 experimental solubility data points from 86 solid drugs. The solubility of solid drugs in supercritical CO2 spans a remarkably wide range in this study, from as high as 3.9 × 10-2 to as low as 1 × 10-7. The results demonstrated that this innovative approach could estimate solid drug solubility in SC-CO2 with AARD% and R2 values of 7.20 and 0.99, respectively, under different pressure and temperature conditions. The ability of the models to capture a wide range of solubilities in SC-CO2 showcases their effectiveness in dealing with both highly and poorly soluble compounds. The developed models, considering their global prediction, accuracy, and being user-friendly, are the best options to be used by researchers for incorporating into software for enabling more efficient design of supercritical extraction processes and reducing the need for trial-and-error experimentation in manufacturing.
期刊介绍:
The European Journal of Pharmaceutics and Biopharmaceutics provides a medium for the publication of novel, innovative and hypothesis-driven research from the areas of Pharmaceutics and Biopharmaceutics.
Topics covered include for example:
Design and development of drug delivery systems for pharmaceuticals and biopharmaceuticals (small molecules, proteins, nucleic acids)
Aspects of manufacturing process design
Biomedical aspects of drug product design
Strategies and formulations for controlled drug transport across biological barriers
Physicochemical aspects of drug product development
Novel excipients for drug product design
Drug delivery and controlled release systems for systemic and local applications
Nanomaterials for therapeutic and diagnostic purposes
Advanced therapy medicinal products
Medical devices supporting a distinct pharmacological effect.