Incorporating Neighboring Protein Features for Enhanced Drug-Target Interaction Prediction: A Comparative Analysis of Similarity-Based Alignment Methods.
{"title":"Incorporating Neighboring Protein Features for Enhanced Drug-Target Interaction Prediction: A Comparative Analysis of Similarity-Based Alignment Methods.","authors":"Xiaoqing Ru, Chao Zha, Xin Gao","doi":"10.1021/acs.jcim.5c00979","DOIUrl":null,"url":null,"abstract":"<p><p>Drug-target interaction (DTI) prediction is a fundamental computational task in drug discovery. Despite recent advancements, existing approaches often suffer from data sparsity and fail to capture the intricate nature of molecular interactions, limiting predictive performance. To address these challenges, we propose a novel DTI prediction framework that enhances both accuracy and interpretability by incorporating features from highly similar protein neighbors. Our framework extracts chemical and physicochemical features from drug-target binding affinity data and integrates interaction features from highly similar protein neighbors to enrich representation. To identify these neighbors, we employ a range of protein similarity alignment algorithms, including BLAST, MUSCLE, MAFFT, Clustal Omega and Foldseek. Experiments on the Davis and KIBA data sets demonstrate that incorporating features from high-similarity neighbors substantially improves prediction accuracy. Further analysis reveals that top-ranked neighbors contribute the most to performance gains, underscoring the importance of similarity-based feature augmentation. Additionally, comparisons among alignment methods highlight their robustness in neighbor selection, and case studies confirm the biological relevance of shared targets among closely related proteins. Overall, our framework presents a novel solution to data sparsity, improves predictive performance, and enhances model interpretability. This work lays a solid foundation for precise DTI prediction and provides valuable insights for advancing computational methods in drug discovery.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"7701-7711"},"PeriodicalIF":5.3000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.5c00979","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Drug-target interaction (DTI) prediction is a fundamental computational task in drug discovery. Despite recent advancements, existing approaches often suffer from data sparsity and fail to capture the intricate nature of molecular interactions, limiting predictive performance. To address these challenges, we propose a novel DTI prediction framework that enhances both accuracy and interpretability by incorporating features from highly similar protein neighbors. Our framework extracts chemical and physicochemical features from drug-target binding affinity data and integrates interaction features from highly similar protein neighbors to enrich representation. To identify these neighbors, we employ a range of protein similarity alignment algorithms, including BLAST, MUSCLE, MAFFT, Clustal Omega and Foldseek. Experiments on the Davis and KIBA data sets demonstrate that incorporating features from high-similarity neighbors substantially improves prediction accuracy. Further analysis reveals that top-ranked neighbors contribute the most to performance gains, underscoring the importance of similarity-based feature augmentation. Additionally, comparisons among alignment methods highlight their robustness in neighbor selection, and case studies confirm the biological relevance of shared targets among closely related proteins. Overall, our framework presents a novel solution to data sparsity, improves predictive performance, and enhances model interpretability. This work lays a solid foundation for precise DTI prediction and provides valuable insights for advancing computational methods in drug discovery.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.