Recurrent Neural Networks Predict Future Peptide Aggregation for Drug Development.

IF 4.5 2区医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL

Molecular Pharmaceutics Pub Date : 2025-10-15 DOI:10.1021/acs.molpharmaceut.5c00314

Prageeth R Wijewardhane, Katelyn Smith, Jonathan Fine, Jameson R Bothe, Peter Wuelfing, Yong Liu, Gaurav Chopra

{"title":"Recurrent Neural Networks Predict Future Peptide Aggregation for Drug Development.","authors":"Prageeth R Wijewardhane, Katelyn Smith, Jonathan Fine, Jameson R Bothe, Peter Wuelfing, Yong Liu, Gaurav Chopra","doi":"10.1021/acs.molpharmaceut.5c00314","DOIUrl":null,"url":null,"abstract":"<p><p>Physical stability of an active pharmaceutical ingredient (API) is a key consideration in the development of a pharmaceutical drug. Solution conditions such as pH, excipient concentrations, and storage temperatures can impact the physical stability of a therapeutic peptide in formulation. Optimizing these conditions is a critical activity in achieving a higher stability of a therapeutic peptide product. A Thioflavin T (ThioT) fluorescent reporter assay is widely used to measure the aggregation of peptide products. ThioT kinetic assays are used to predict the propensity of fibril formation by using ThioT curves for a peptide stored in a solution. However, there is no analytical relationship that can be used to relate the physical stability for different formulation conditions, resulting in execution of large-scale stability assays that require significant resources for pharmaceutical companies. Therefore, there is a need to develop new artificial intelligence (AI) methods to predict future ThioT curves in a fast and cost-effective manner. Here, we combined an experimental measure of time-varying conformational states from ThioT assays with AI models to predict peptide aggregation in different formulation conditions during drug development. We formulated the peptide aggregation problem as \"language translation\" in natural language processing, wherein the sequence of aggregation states at earlier time points was used to predict (or \"translate\") the aggregation states for future time points. We developed a new sequence-to-sequence long short-term memory (LSTM)-based recurrent neural network (RNN) model to predict entire ThioT curves at future time points (6 and 12 months) using data sets from initial and 1 month ThioT curves for different conditions. We achieved an excellent average mean absolute error (MAE) of 2.04 for the model, which was used to predict and experimentally validate ThioT curves for a 6 month time point. In contrast to the LSTM, the multilayer perceptron (MLP) baseline model showed a higher MAE of 5.17. However, at the 12 month time point, with limited training data, both models achieved comparable results with average MAEs of 4.25 and 4.45 for LSTM and MLP, respectively. Therefore, we conclude that LSTM models can be used to predict future ThioT curves only using the initial and 1 month ThioT curves as input. We believe that the use of recurrent neural network models will benefit the pharmaceutical industry to predict and explore the formulation landscape for future physical stability measurements of APIs based on short-term stability data.</p>","PeriodicalId":52,"journal":{"name":"Molecular Pharmaceutics","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.molpharmaceut.5c00314","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Physical stability of an active pharmaceutical ingredient (API) is a key consideration in the development of a pharmaceutical drug. Solution conditions such as pH, excipient concentrations, and storage temperatures can impact the physical stability of a therapeutic peptide in formulation. Optimizing these conditions is a critical activity in achieving a higher stability of a therapeutic peptide product. A Thioflavin T (ThioT) fluorescent reporter assay is widely used to measure the aggregation of peptide products. ThioT kinetic assays are used to predict the propensity of fibril formation by using ThioT curves for a peptide stored in a solution. However, there is no analytical relationship that can be used to relate the physical stability for different formulation conditions, resulting in execution of large-scale stability assays that require significant resources for pharmaceutical companies. Therefore, there is a need to develop new artificial intelligence (AI) methods to predict future ThioT curves in a fast and cost-effective manner. Here, we combined an experimental measure of time-varying conformational states from ThioT assays with AI models to predict peptide aggregation in different formulation conditions during drug development. We formulated the peptide aggregation problem as "language translation" in natural language processing, wherein the sequence of aggregation states at earlier time points was used to predict (or "translate") the aggregation states for future time points. We developed a new sequence-to-sequence long short-term memory (LSTM)-based recurrent neural network (RNN) model to predict entire ThioT curves at future time points (6 and 12 months) using data sets from initial and 1 month ThioT curves for different conditions. We achieved an excellent average mean absolute error (MAE) of 2.04 for the model, which was used to predict and experimentally validate ThioT curves for a 6 month time point. In contrast to the LSTM, the multilayer perceptron (MLP) baseline model showed a higher MAE of 5.17. However, at the 12 month time point, with limited training data, both models achieved comparable results with average MAEs of 4.25 and 4.45 for LSTM and MLP, respectively. Therefore, we conclude that LSTM models can be used to predict future ThioT curves only using the initial and 1 month ThioT curves as input. We believe that the use of recurrent neural network models will benefit the pharmaceutical industry to predict and explore the formulation landscape for future physical stability measurements of APIs based on short-term stability data.

查看原文本刊更多论文

递归神经网络预测未来药物开发中的肽聚集。

活性药物成分（API）的物理稳定性是药物开发中的一个关键考虑因素。溶液条件如pH值、赋形剂浓度和储存温度会影响制剂中治疗性肽的物理稳定性。优化这些条件是实现治疗性肽产品更高稳定性的关键活动。硫黄素T （ThioT）荧光报告法被广泛用于测定肽产物的聚集。硫代物动力学分析是用来预测倾向的纤维形成使用硫代物曲线的肽储存在溶液中。然而，没有一种分析关系可用于将不同配方条件下的物理稳定性联系起来，这导致制药公司需要大量资源来执行大规模的稳定性分析。因此，有必要开发新的人工智能（AI）方法，以快速和经济的方式预测未来的物联网曲线。在这里，我们将thot测定的随时间变化的构象状态的实验测量与人工智能模型相结合，以预测药物开发过程中不同配方条件下的肽聚集。我们将多肽聚集问题表述为自然语言处理中的“语言翻译”，其中使用较早时间点的聚集状态序列来预测（或“翻译”）未来时间点的聚集状态。我们开发了一种新的基于序列到序列长短期记忆（LSTM）的递归神经网络（RNN）模型，使用不同条件下初始和1个月ThioT曲线的数据集来预测未来时间点（6个月和12个月）的整个ThioT曲线。该模型的平均绝对误差（MAE）为2.04，可用于预测和实验验证6个月时间点的ThioT曲线。与LSTM相比，多层感知机（MLP）基线模型的MAE更高，为5.17。然而，在12个月的时间点上，在训练数据有限的情况下，两种模型的结果相当，LSTM和MLP的平均MAEs分别为4.25和4.45。因此，我们得出结论，LSTM模型可以仅使用初始和1个月的ThioT曲线作为输入来预测未来的ThioT曲线。我们相信，使用递归神经网络模型将有利于制药行业预测和探索基于短期稳定性数据的原料药未来物理稳定性测量的配方前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Molecular Pharmaceutics 医学-药学

CiteScore

8.00

自引率

6.10%

发文量

391

审稿时长

2 months

期刊介绍： Molecular Pharmaceutics publishes the results of original research that contributes significantly to the molecular mechanistic understanding of drug delivery and drug delivery systems. The journal encourages contributions describing research at the interface of drug discovery and drug development. Scientific areas within the scope of the journal include physical and pharmaceutical chemistry, biochemistry and biophysics, molecular and cellular biology, and polymer and materials science as they relate to drug and drug delivery system efficacy. Mechanistic Drug Delivery and Drug Targeting research on modulating activity and efficacy of a drug or drug product is within the scope of Molecular Pharmaceutics. Theoretical and experimental peer-reviewed research articles, communications, reviews, and perspectives are welcomed.