Combining High-Throughput Screening and Machine Learning to Predict the Formation of Both Binary and Ternary Amorphous Solid Dispersion Formulations for Early Drug Discovery and Development.

IF 3.5 3区 医学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Pharmaceutical Research Pub Date : 2025-04-01 Epub Date: 2025-04-03 DOI:10.1007/s11095-025-03853-z
Tianshu Lu, Yiyang Wu, Ping Xiong, Hao Zhong, Yang Ding, Haifeng Li, Defang Ouyang
{"title":"Combining High-Throughput Screening and Machine Learning to Predict the Formation of Both Binary and Ternary Amorphous Solid Dispersion Formulations for Early Drug Discovery and Development.","authors":"Tianshu Lu, Yiyang Wu, Ping Xiong, Hao Zhong, Yang Ding, Haifeng Li, Defang Ouyang","doi":"10.1007/s11095-025-03853-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Amorphous solid dispersion (ASD) is widely utilized to enhance the solubility and bioavailability of water-insoluble drugs. However, conventional experimental approaches for ASD development are often resource-intensive and time-consuming. Machine learning (ML) algorithms have great potential to predict ASD formulations but face the challenge of extensive data to construct reliable models. Current study aims to predict the formation of both binary and ternary ASD by combined high-throughput screening (HTS) and ML approaches.</p><p><strong>Methods: </strong>Micro-quantity HTS was conducted to generate 1272 binary and ternary solid dispersions using solvent evaporation method. The Powder X-Ray Diffraction (PXRD) was used to characterize the amorphous state of formulations. The results indicated that 188 formulations successfully formed amorphous solid dispersions (ASDs), while 1084 resulted in crystalline formations. Models development employed nested cross-validation with four algorithms: Light Gradient Boosting Machine (LGBM), Random Forest (RF), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP).</p><p><strong>Results: </strong>The RF model for ASD formation achieved 96.7% accuracy on the in-house HTS dataset, with a precision of approximately 87.9% and an F1 score of 83.6%. Furthermore, the RF model trained with milligram-scale HTS experimental data could effectively predict the large-scale ASD formulations from the literature, highlighting its promise as a powerful tool for advancing ASD prediction.</p><p><strong>Conclusion: </strong>In summary, the combination of HTS experiments and ML techniques provides a valuable reference framework for ASD development, greatly minimizing both time and material usage in the selection of formulations during the early stages of drug discovery with a limited quantity of API.</p>","PeriodicalId":20027,"journal":{"name":"Pharmaceutical Research","volume":" ","pages":"697-709"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11095-025-03853-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Amorphous solid dispersion (ASD) is widely utilized to enhance the solubility and bioavailability of water-insoluble drugs. However, conventional experimental approaches for ASD development are often resource-intensive and time-consuming. Machine learning (ML) algorithms have great potential to predict ASD formulations but face the challenge of extensive data to construct reliable models. Current study aims to predict the formation of both binary and ternary ASD by combined high-throughput screening (HTS) and ML approaches.

Methods: Micro-quantity HTS was conducted to generate 1272 binary and ternary solid dispersions using solvent evaporation method. The Powder X-Ray Diffraction (PXRD) was used to characterize the amorphous state of formulations. The results indicated that 188 formulations successfully formed amorphous solid dispersions (ASDs), while 1084 resulted in crystalline formations. Models development employed nested cross-validation with four algorithms: Light Gradient Boosting Machine (LGBM), Random Forest (RF), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP).

Results: The RF model for ASD formation achieved 96.7% accuracy on the in-house HTS dataset, with a precision of approximately 87.9% and an F1 score of 83.6%. Furthermore, the RF model trained with milligram-scale HTS experimental data could effectively predict the large-scale ASD formulations from the literature, highlighting its promise as a powerful tool for advancing ASD prediction.

Conclusion: In summary, the combination of HTS experiments and ML techniques provides a valuable reference framework for ASD development, greatly minimizing both time and material usage in the selection of formulations during the early stages of drug discovery with a limited quantity of API.

结合高通量筛选和机器学习预测二元和三元非晶固体分散配方的形成,用于早期药物发现和开发。
目的:非晶固体分散体(ASD)被广泛应用于提高水不溶性药物的溶解度和生物利用度。然而,传统的ASD发展实验方法往往是资源密集和耗时的。机器学习(ML)算法在预测ASD配方方面具有很大的潜力,但面临着构建可靠模型所需的大量数据的挑战。本研究旨在通过高通量筛选(HTS)和ML相结合的方法预测二元和三元ASD的形成。方法:采用溶剂蒸发法对1272个二、三元固体分散体进行微量热液相色谱制备。采用粉末x射线衍射(PXRD)表征了配方的非晶态。结果表明,有188种配方成功形成非晶固体分散体(ASDs), 1084种配方成功形成晶体。模型开发采用四种算法嵌套交叉验证:光梯度增强机(LGBM)、随机森林(RF)、支持向量机(SVM)和多层感知机(MLP)。结果:ASD地层的RF模型在内部HTS数据集上达到96.7%的准确率,精度约为87.9%,F1得分为83.6%。此外,使用毫克级HTS实验数据训练的射频模型可以有效地预测文献中的大规模ASD配方,这突显了其作为推进ASD预测的有力工具的前景。结论:综上所述,HTS实验和ML技术的结合为ASD的开发提供了一个有价值的参考框架,在药物发现的早期阶段,在API数量有限的情况下,极大地减少了配方选择的时间和材料使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pharmaceutical Research
Pharmaceutical Research 医学-化学综合
CiteScore
6.60
自引率
5.40%
发文量
276
审稿时长
3.4 months
期刊介绍: Pharmaceutical Research, an official journal of the American Association of Pharmaceutical Scientists, is committed to publishing novel research that is mechanism-based, hypothesis-driven and addresses significant issues in drug discovery, development and regulation. Current areas of interest include, but are not limited to: -(pre)formulation engineering and processing- computational biopharmaceutics- drug delivery and targeting- molecular biopharmaceutics and drug disposition (including cellular and molecular pharmacology)- pharmacokinetics, pharmacodynamics and pharmacogenetics. Research may involve nonclinical and clinical studies, and utilize both in vitro and in vivo approaches. Studies on small drug molecules, pharmaceutical solid materials (including biomaterials, polymers and nanoparticles) biotechnology products (including genes, peptides, proteins and vaccines), and genetically engineered cells are welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信