Minjun Kwon , Yong Eun Jang , Ji Su Hwang , Seok Gi Kim , Nimisha Pradeep George , Shaherin Basith , Gwang Lee
{"title":"EnsemPred-ACP:结合机器和深度学习改进抗癌肽预测","authors":"Minjun Kwon , Yong Eun Jang , Ji Su Hwang , Seok Gi Kim , Nimisha Pradeep George , Shaherin Basith , Gwang Lee","doi":"10.1016/j.compbiomed.2025.110668","DOIUrl":null,"url":null,"abstract":"<div><div>Anticancer peptide (ACP) has emerged as potent therapeutic agents owing to its ability to selectively target cancer cells while minimising toxicity to healthy cells. However, the accurate computational prediction of ACP remains challenging because of the complex molecular mechanisms underlying cancer. In this study, we introduce EnsemPred-ACP, an innovative ensemble framework that combines machine learning (ML) and deep learning (DL) approaches to enhance ACP prediction. Our primary innovation is the introduction of binary profile features (BPF) to augment pre-trained protein embeddings, thereby capturing position-specific patterns crucial for ACP identification. The framework used a dual-pipeline architecture; ML models processed handcrafted sequence features and embeddings, whereas DL models handled BPF-enhanced embeddings. Upon evaluation with independent datasets, EnsemPred-ACP achieved an accuracy of 0.863, sensitivity of 0.897, and specificity of 0.830, notably outperforming existing methods. The model demonstrated a strong generalisation performance, achieving an area under the receiver operating characteristic curve of 0.93. Ablation studies on independent datasets further highlighted the substantial impact of BPF, enhancing the prediction accuracy by 2.5 % and 11.1 % when integrated with ESM2 and ProtT5 embeddings, respectively. These results demonstrate the effectiveness of our integrated approach in accurately identifying potential therapeutic peptides, thereby contributing to the advancement of peptide-based cancer therapeutics.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"196 ","pages":"Article 110668"},"PeriodicalIF":7.0000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EnsemPred-ACP: Combining machine and deep learning to improve anticancer peptide prediction\",\"authors\":\"Minjun Kwon , Yong Eun Jang , Ji Su Hwang , Seok Gi Kim , Nimisha Pradeep George , Shaherin Basith , Gwang Lee\",\"doi\":\"10.1016/j.compbiomed.2025.110668\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Anticancer peptide (ACP) has emerged as potent therapeutic agents owing to its ability to selectively target cancer cells while minimising toxicity to healthy cells. However, the accurate computational prediction of ACP remains challenging because of the complex molecular mechanisms underlying cancer. In this study, we introduce EnsemPred-ACP, an innovative ensemble framework that combines machine learning (ML) and deep learning (DL) approaches to enhance ACP prediction. Our primary innovation is the introduction of binary profile features (BPF) to augment pre-trained protein embeddings, thereby capturing position-specific patterns crucial for ACP identification. The framework used a dual-pipeline architecture; ML models processed handcrafted sequence features and embeddings, whereas DL models handled BPF-enhanced embeddings. Upon evaluation with independent datasets, EnsemPred-ACP achieved an accuracy of 0.863, sensitivity of 0.897, and specificity of 0.830, notably outperforming existing methods. The model demonstrated a strong generalisation performance, achieving an area under the receiver operating characteristic curve of 0.93. Ablation studies on independent datasets further highlighted the substantial impact of BPF, enhancing the prediction accuracy by 2.5 % and 11.1 % when integrated with ESM2 and ProtT5 embeddings, respectively. These results demonstrate the effectiveness of our integrated approach in accurately identifying potential therapeutic peptides, thereby contributing to the advancement of peptide-based cancer therapeutics.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"196 \",\"pages\":\"Article 110668\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525010194\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525010194","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
EnsemPred-ACP: Combining machine and deep learning to improve anticancer peptide prediction
Anticancer peptide (ACP) has emerged as potent therapeutic agents owing to its ability to selectively target cancer cells while minimising toxicity to healthy cells. However, the accurate computational prediction of ACP remains challenging because of the complex molecular mechanisms underlying cancer. In this study, we introduce EnsemPred-ACP, an innovative ensemble framework that combines machine learning (ML) and deep learning (DL) approaches to enhance ACP prediction. Our primary innovation is the introduction of binary profile features (BPF) to augment pre-trained protein embeddings, thereby capturing position-specific patterns crucial for ACP identification. The framework used a dual-pipeline architecture; ML models processed handcrafted sequence features and embeddings, whereas DL models handled BPF-enhanced embeddings. Upon evaluation with independent datasets, EnsemPred-ACP achieved an accuracy of 0.863, sensitivity of 0.897, and specificity of 0.830, notably outperforming existing methods. The model demonstrated a strong generalisation performance, achieving an area under the receiver operating characteristic curve of 0.93. Ablation studies on independent datasets further highlighted the substantial impact of BPF, enhancing the prediction accuracy by 2.5 % and 11.1 % when integrated with ESM2 and ProtT5 embeddings, respectively. These results demonstrate the effectiveness of our integrated approach in accurately identifying potential therapeutic peptides, thereby contributing to the advancement of peptide-based cancer therapeutics.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.