HyPepTox-Fuse: An interpretable hybrid framework for accurate peptide toxicity prediction fusing protein language model-based embeddings with conventional descriptors.

IF 8.9

Journal of pharmaceutical analysis Pub Date : 2025-08-01 Epub Date: 2025-07-24 DOI:10.1016/j.jpha.2025.101410

Duong Thanh Tran, Nhat Truong Pham, Nguyen Doan Hieu Nguyen, Leyi Wei, Balachandran Manavalan

{"title":"HyPepTox-Fuse: An interpretable hybrid framework for accurate peptide toxicity prediction fusing protein language model-based embeddings with conventional descriptors.","authors":"Duong Thanh Tran, Nhat Truong Pham, Nguyen Doan Hieu Nguyen, Leyi Wei, Balachandran Manavalan","doi":"10.1016/j.jpha.2025.101410","DOIUrl":null,"url":null,"abstract":"<p><p>Peptide-based therapeutics hold great promise for the treatment of various diseases; however, their clinical application is often hindered by toxicity challenges. The accurate prediction of peptide toxicity is crucial for designing safe peptide-based therapeutics. While traditional experimental approaches are time-consuming and expensive, computational methods have emerged as viable alternatives, including similarity-based and machine learning (ML)-/deep learning (DL)-based methods. However, existing methods often struggle with robustness and generalizability. To address these challenges, we propose HyPepTox-Fuse, a novel framework that fuses protein language model (PLM)-based embeddings with conventional descriptors. HyPepTox-Fuse integrates ensemble PLM-based embeddings to achieve richer peptide representations by leveraging a cross-modal multi-head attention mechanism and Transformer architecture. A robust feature ranking and selection pipeline further refines conventional descriptors, thus enhancing prediction performance. Our framework outperforms state-of-the-art methods in cross-validation and independent evaluations, offering a scalable and reliable tool for peptide toxicity prediction. Moreover, we conducted a case study to validate the robustness and generalizability of HyPepTox-Fuse, highlighting its effectiveness in enhancing model performance. Furthermore, the HyPepTox-Fuse server is freely accessible at https://balalab-skku.org/HyPepTox-Fuse/ and the source code is publicly available at https://github.com/cbbl-skku-org/HyPepTox-Fuse/. The study thus presents an intuitive platform for predicting peptide toxicity and supports reproducibility through openly available datasets.</p>","PeriodicalId":94338,"journal":{"name":"Journal of pharmaceutical analysis","volume":"15 8","pages":"101410"},"PeriodicalIF":8.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446778/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of pharmaceutical analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jpha.2025.101410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/24 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Peptide-based therapeutics hold great promise for the treatment of various diseases; however, their clinical application is often hindered by toxicity challenges. The accurate prediction of peptide toxicity is crucial for designing safe peptide-based therapeutics. While traditional experimental approaches are time-consuming and expensive, computational methods have emerged as viable alternatives, including similarity-based and machine learning (ML)-/deep learning (DL)-based methods. However, existing methods often struggle with robustness and generalizability. To address these challenges, we propose HyPepTox-Fuse, a novel framework that fuses protein language model (PLM)-based embeddings with conventional descriptors. HyPepTox-Fuse integrates ensemble PLM-based embeddings to achieve richer peptide representations by leveraging a cross-modal multi-head attention mechanism and Transformer architecture. A robust feature ranking and selection pipeline further refines conventional descriptors, thus enhancing prediction performance. Our framework outperforms state-of-the-art methods in cross-validation and independent evaluations, offering a scalable and reliable tool for peptide toxicity prediction. Moreover, we conducted a case study to validate the robustness and generalizability of HyPepTox-Fuse, highlighting its effectiveness in enhancing model performance. Furthermore, the HyPepTox-Fuse server is freely accessible at https://balalab-skku.org/HyPepTox-Fuse/ and the source code is publicly available at https://github.com/cbbl-skku-org/HyPepTox-Fuse/. The study thus presents an intuitive platform for predicting peptide toxicity and supports reproducibility through openly available datasets.

查看原文本刊更多论文

HyPepTox-Fuse：一个可解释的混合框架，用于准确的肽毒性预测，融合了基于蛋白质语言模型的嵌入和传统描述符。

以肽为基础的治疗方法对治疗各种疾病具有很大的希望；然而，它们的临床应用往往受到毒性挑战的阻碍。肽毒性的准确预测对于设计安全的基于肽的治疗方法至关重要。虽然传统的实验方法既耗时又昂贵，但计算方法已经成为可行的替代方法，包括基于相似性和基于机器学习(ML) /深度学习（DL）的方法。然而，现有的方法往往与鲁棒性和泛化性作斗争。为了解决这些挑战，我们提出了HyPepTox-Fuse，这是一个将基于蛋白质语言模型（PLM）的嵌入与传统描述符融合在一起的新框架。HyPepTox-Fuse集成了基于集成plm的嵌入，通过利用跨模态多头注意机制和Transformer架构实现更丰富的肽表示。鲁棒的特征排序和选择管道进一步细化了传统的描述符，从而提高了预测性能。我们的框架在交叉验证和独立评估方面优于最先进的方法，为肽毒性预测提供了可扩展和可靠的工具。此外，我们进行了一个案例研究，以验证HyPepTox-Fuse的鲁棒性和泛化性，突出其在提高模型性能方面的有效性。此外，HyPepTox-Fuse服务器可以在https://balalab-skku.org/HyPepTox-Fuse/上免费访问，源代码可以在https://github.com/cbbl-skku-org/HyPepTox-Fuse/上公开获取。因此，该研究为预测肽毒性提供了一个直观的平台，并通过公开可用的数据集支持可重复性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of pharmaceutical analysis

自引率

0.00%

发文量