KNU-DTI: KNowledge United Drug-Target Interaction prediction

IF 7 2区医学 Q1 BIOLOGY

Computers in biology and medicine Pub Date : 2025-03-01 DOI:10.1016/j.compbiomed.2025.109927

Ryong Heo , Dahyeon Lee , Byung Ju Kim , Sangmin Seo , Sanghyun Park , Chihyun Park

{"title":"KNU-DTI: KNowledge United Drug-Target Interaction prediction","authors":"Ryong Heo , Dahyeon Lee , Byung Ju Kim , Sangmin Seo , Sanghyun Park , Chihyun Park","doi":"10.1016/j.compbiomed.2025.109927","DOIUrl":null,"url":null,"abstract":"<div><h3>Motivation</h3><div>Accurately predicting drug-target protein interactions (DTI) is a cornerstone of drug discovery, enabling the identification of potential therapeutic compounds. Sequence-based prediction models, despite their simplicity, hold great promise in extracting essential information directly from raw sequences. However, the focus in recent DTI studies has increasingly shifted toward enhancing algorithmic complexity, often at the expense of fully leveraging robust sequence representation learning methods. This shift has led to the underestimation and gradual neglect of methodologies aimed at effectively capturing discriminative features from sequences. Our work seeks to address this oversight by emphasizing the value of well-constructed sequence representation algorithms, demonstrating that even with simple interaction mapping algorithm techniques, accurate DTI models can be achieved. By prioritizing meaningful information extraction over excessive model complexity, we aim to advance the development of practical and generalizable DTI prediction frameworks.</div></div><div><h3>Results</h3><div>We developed the KNowledge Uniting DTI model (KNU-DTI), which retrieves structural information and unites them. Protein structural properties were obtained using structural property sequence (SPS). Extended-connectivity fingerprint (ECFP) was used to estimate the structure-activity relationship in molecules. Including these two features, a total of five latent vectors were derived from protein and molecule via various neural networks and integrated by elemental-wise addition to predict binding interactions or affinity. Using four test concepts to evaluate the model, we show that the model outperforms recently published competitors. Finally, a case study indicated that our model has a competitive edge over existing docking simulations in some cases.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"189 ","pages":"Article 109927"},"PeriodicalIF":7.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525002781","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Motivation

Accurately predicting drug-target protein interactions (DTI) is a cornerstone of drug discovery, enabling the identification of potential therapeutic compounds. Sequence-based prediction models, despite their simplicity, hold great promise in extracting essential information directly from raw sequences. However, the focus in recent DTI studies has increasingly shifted toward enhancing algorithmic complexity, often at the expense of fully leveraging robust sequence representation learning methods. This shift has led to the underestimation and gradual neglect of methodologies aimed at effectively capturing discriminative features from sequences. Our work seeks to address this oversight by emphasizing the value of well-constructed sequence representation algorithms, demonstrating that even with simple interaction mapping algorithm techniques, accurate DTI models can be achieved. By prioritizing meaningful information extraction over excessive model complexity, we aim to advance the development of practical and generalizable DTI prediction frameworks.

Results

We developed the KNowledge Uniting DTI model (KNU-DTI), which retrieves structural information and unites them. Protein structural properties were obtained using structural property sequence (SPS). Extended-connectivity fingerprint (ECFP) was used to estimate the structure-activity relationship in molecules. Including these two features, a total of five latent vectors were derived from protein and molecule via various neural networks and integrated by elemental-wise addition to predict binding interactions or affinity. Using four test concepts to evaluate the model, we show that the model outperforms recently published competitors. Finally, a case study indicated that our model has a competitive edge over existing docking simulations in some cases.

Abstract Image

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.