Enabling Data-Driven Solubility Modeling at GSK: Enhancing Purge Predictions for Mutagenic Impurities

IF 3.1 3区 化学 Q2 CHEMISTRY, APPLIED
Luigi Da Vià, Matthias Depoortere, Robert D. Willacy, Alastair J. Roberts, Pandian Sokkar, Mathieu Fossépré, Andrew Ruba, Magdalena A. Zwierzyna
{"title":"Enabling Data-Driven Solubility Modeling at GSK: Enhancing Purge Predictions for Mutagenic Impurities","authors":"Luigi Da Vià, Matthias Depoortere, Robert D. Willacy, Alastair J. Roberts, Pandian Sokkar, Mathieu Fossépré, Andrew Ruba, Magdalena A. Zwierzyna","doi":"10.1021/acs.oprd.4c00384","DOIUrl":null,"url":null,"abstract":"In the pharmaceutical industry, solubility is a critical parameter influencing various stages of drug development, from early discovery to commercial manufacturing. This work showcases a high-throughput solubility screening workflow and describes the steps required to standardize and curate data suitably to allow automated data flow. Using the high-quality data, we developed a quantitative structure–property relationship model using gradient boosting and molecular descriptors, requiring only a 2D molecular structure to generate predictions. The accuracy of the model is competitive with alternative approaches where additional physical data is not required. A key use case for solubility predictions made in this way is in developing control strategies for mutagenic impurities, allowing for a data-driven and consistent method for calculating the solubility contribution to purge calculations. Further perspective is given on the future of the application of the model as a solubility prediction algorithm and on the approach to data-driven methodologies supporting drug development in general, highlighting the potential for federated learning approaches which use technological approaches to overcome the barrier to cross-industry data sharing.","PeriodicalId":55,"journal":{"name":"Organic Process Research & Development","volume":"13 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Organic Process Research & Development","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.oprd.4c00384","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

In the pharmaceutical industry, solubility is a critical parameter influencing various stages of drug development, from early discovery to commercial manufacturing. This work showcases a high-throughput solubility screening workflow and describes the steps required to standardize and curate data suitably to allow automated data flow. Using the high-quality data, we developed a quantitative structure–property relationship model using gradient boosting and molecular descriptors, requiring only a 2D molecular structure to generate predictions. The accuracy of the model is competitive with alternative approaches where additional physical data is not required. A key use case for solubility predictions made in this way is in developing control strategies for mutagenic impurities, allowing for a data-driven and consistent method for calculating the solubility contribution to purge calculations. Further perspective is given on the future of the application of the model as a solubility prediction algorithm and on the approach to data-driven methodologies supporting drug development in general, highlighting the potential for federated learning approaches which use technological approaches to overcome the barrier to cross-industry data sharing.

Abstract Image

在葛兰素史克公司实现数据驱动的溶解度建模:加强对突变杂质的净化预测
在制药行业,从早期发现到商业化生产,溶解度是影响药物开发各个阶段的关键参数。这项工作展示了高通量溶解度筛选工作流程,并介绍了为实现自动数据流而对数据进行标准化和适当整理所需的步骤。利用高质量数据,我们使用梯度提升和分子描述符开发了一个定量结构-性质关系模型,只需二维分子结构即可生成预测结果。在不需要额外物理数据的情况下,该模型的准确性可与其他方法媲美。用这种方法进行溶解度预测的一个主要用途是制定诱变杂质的控制策略,从而采用数据驱动和一致的方法计算溶解度对净化计算的贡献。报告还进一步展望了该模型作为溶解度预测算法的应用前景,以及支持药物开发的数据驱动方法,强调了联合学习方法的潜力,该方法利用技术手段克服了跨行业数据共享的障碍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.90
自引率
14.70%
发文量
251
审稿时长
2 months
期刊介绍: The journal Organic Process Research & Development serves as a communication tool between industrial chemists and chemists working in universities and research institutes. As such, it reports original work from the broad field of industrial process chemistry but also presents academic results that are relevant, or potentially relevant, to industrial applications. Process chemistry is the science that enables the safe, environmentally benign and ultimately economical manufacturing of organic compounds that are required in larger amounts to help address the needs of society. Consequently, the Journal encompasses every aspect of organic chemistry, including all aspects of catalysis, synthetic methodology development and synthetic strategy exploration, but also includes aspects from analytical and solid-state chemistry and chemical engineering, such as work-up tools,process safety, or flow-chemistry. The goal of development and optimization of chemical reactions and processes is their transfer to a larger scale; original work describing such studies and the actual implementation on scale is highly relevant to the journal. However, studies on new developments from either industry, research institutes or academia that have not yet been demonstrated on scale, but where an industrial utility can be expected and where the study has addressed important prerequisites for a scale-up and has given confidence into the reliability and practicality of the chemistry, also serve the mission of OPR&D as a communication tool between the different contributors to the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信