通过整合深度q网络进行结构分析的ai驱动蛋白口袋检测。

IF 3.1 3区生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY

Journal of Computer-Aided Molecular Design Pub Date : 2025-10-06 DOI:10.1007/s10822-025-00669-2

Prashanth Choppara, Lokesh Bommareddy

{"title":"通过整合深度q网络进行结构分析的ai驱动蛋白口袋检测。","authors":"Prashanth Choppara, Lokesh Bommareddy","doi":"10.1007/s10822-025-00669-2","DOIUrl":null,"url":null,"abstract":"Protein pockets, or small cavities on the protein surface, are critical sites for enzymatic catalysis, molecular recognition, and drug binding. Accurately identifying these pockets is crucial for understanding protein function and designing therapeutic interventions. Traditional computational methods such as molecular docking, surface grid mapping, and molecular dynamics simulations are hampered by the use of fixed protein structures, and therefore it is challenging to identify cryptic pockets when they appear under physiological conditions. We propose a deep reinforcement learning (DRL) technique based on deep Q-networks (DQN) to identify precise protein pockets. Our strategy to improve the prediction of functional binding sites incorporates important molecular descriptors such as spatial coordinates, solvent-accessible surface area (SASA), hydrophobicity, and electrostatic charge. We pre-process protein structure data from the protein data bank (PDB) through feature extraction and selection methods, including variance threshold filtering and dimensionality reduction using an autoencoder. The sparse feature representation enables efficient training of a DQN agent, which navigates protein surfaces and iteratively optimizes pocket predictions. By using reinforcement learning concepts, the model adapts its pocket detection strategy according to the learned reward signals, increasing sensitivity and specificity. The method is tested on benchmark datasets and is found to exhibit superior performance in detecting well-defined and cryptic pockets over traditional computational methods. Experimental evidence suggests that our model successfully identifies binding sites in various protein families, with significant implications for drug discovery and protein-ligand interaction studies. Moreover, the model's ability to incorporate geometric and biochemical features allows for a better understanding of pocket functionality. The scalability of our method makes it an important tool for large-scale virtual screening and personalized medicine. By using deep reinforcement learning, this research provides a new and effective framework for protein pocket prediction, opening up opportunities for developing new tools in structural bioinformatics, drug design, and molecular biology research.","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":"90"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AI-driven protein pocket detection through integrating deep Q-networks for structural analysis.\",\"authors\":\"Prashanth Choppara, Lokesh Bommareddy\",\"doi\":\"10.1007/s10822-025-00669-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein pockets, or small cavities on the protein surface, are critical sites for enzymatic catalysis, molecular recognition, and drug binding. Accurately identifying these pockets is crucial for understanding protein function and designing therapeutic interventions. Traditional computational methods such as molecular docking, surface grid mapping, and molecular dynamics simulations are hampered by the use of fixed protein structures, and therefore it is challenging to identify cryptic pockets when they appear under physiological conditions. We propose a deep reinforcement learning (DRL) technique based on deep Q-networks (DQN) to identify precise protein pockets. Our strategy to improve the prediction of functional binding sites incorporates important molecular descriptors such as spatial coordinates, solvent-accessible surface area (SASA), hydrophobicity, and electrostatic charge. We pre-process protein structure data from the protein data bank (PDB) through feature extraction and selection methods, including variance threshold filtering and dimensionality reduction using an autoencoder. The sparse feature representation enables efficient training of a DQN agent, which navigates protein surfaces and iteratively optimizes pocket predictions. By using reinforcement learning concepts, the model adapts its pocket detection strategy according to the learned reward signals, increasing sensitivity and specificity. The method is tested on benchmark datasets and is found to exhibit superior performance in detecting well-defined and cryptic pockets over traditional computational methods. Experimental evidence suggests that our model successfully identifies binding sites in various protein families, with significant implications for drug discovery and protein-ligand interaction studies. Moreover, the model's ability to incorporate geometric and biochemical features allows for a better understanding of pocket functionality. The scalability of our method makes it an important tool for large-scale virtual screening and personalized medicine. By using deep reinforcement learning, this research provides a new and effective framework for protein pocket prediction, opening up opportunities for developing new tools in structural bioinformatics, drug design, and molecular biology research.\",\"PeriodicalId\":621,\"journal\":{\"name\":\"Journal of Computer-Aided Molecular Design\",\"volume\":\"39 1\",\"pages\":\"90\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer-Aided Molecular Design\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s10822-025-00669-2\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10822-025-00669-2","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

蛋白质口袋，或蛋白质表面的小空腔，是酶催化、分子识别和药物结合的关键位点。准确识别这些口袋对于理解蛋白质功能和设计治疗干预措施至关重要。传统的计算方法，如分子对接、表面网格映射和分子动力学模拟，由于固定蛋白质结构的使用而受到阻碍，因此在生理条件下识别隐藏口袋是具有挑战性的。我们提出了一种基于深度q网络（DQN）的深度强化学习（DRL）技术来精确识别蛋白质口袋。我们改进功能结合位点预测的策略结合了重要的分子描述符，如空间坐标、溶剂可及表面积（SASA）、疏水性和静电荷。我们通过特征提取和选择方法，包括方差阈值滤波和自编码器降维，对蛋白质数据库中的蛋白质结构数据进行预处理。稀疏特征表示能够有效地训练DQN代理，该代理可以导航蛋白质表面并迭代优化口袋预测。该模型利用强化学习概念，根据学习到的奖励信号调整口袋检测策略，提高了灵敏度和特异性。该方法在基准数据集上进行了测试，发现在检测定义良好的和隐藏的口袋方面比传统的计算方法表现出更好的性能。实验证据表明，我们的模型成功地识别了各种蛋白质家族的结合位点，这对药物发现和蛋白质-配体相互作用研究具有重要意义。此外，该模型结合几何和生化特征的能力可以更好地理解口袋功能。该方法的可扩展性使其成为大规模虚拟筛查和个性化医疗的重要工具。通过使用深度强化学习，本研究为蛋白质口袋预测提供了一个新的有效框架，为结构生物信息学、药物设计和分子生物学研究开发新工具提供了机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

AI-driven protein pocket detection through integrating deep Q-networks for structural analysis.

Protein pockets, or small cavities on the protein surface, are critical sites for enzymatic catalysis, molecular recognition, and drug binding. Accurately identifying these pockets is crucial for understanding protein function and designing therapeutic interventions. Traditional computational methods such as molecular docking, surface grid mapping, and molecular dynamics simulations are hampered by the use of fixed protein structures, and therefore it is challenging to identify cryptic pockets when they appear under physiological conditions. We propose a deep reinforcement learning (DRL) technique based on deep Q-networks (DQN) to identify precise protein pockets. Our strategy to improve the prediction of functional binding sites incorporates important molecular descriptors such as spatial coordinates, solvent-accessible surface area (SASA), hydrophobicity, and electrostatic charge. We pre-process protein structure data from the protein data bank (PDB) through feature extraction and selection methods, including variance threshold filtering and dimensionality reduction using an autoencoder. The sparse feature representation enables efficient training of a DQN agent, which navigates protein surfaces and iteratively optimizes pocket predictions. By using reinforcement learning concepts, the model adapts its pocket detection strategy according to the learned reward signals, increasing sensitivity and specificity. The method is tested on benchmark datasets and is found to exhibit superior performance in detecting well-defined and cryptic pockets over traditional computational methods. Experimental evidence suggests that our model successfully identifies binding sites in various protein families, with significant implications for drug discovery and protein-ligand interaction studies. Moreover, the model's ability to incorporate geometric and biochemical features allows for a better understanding of pocket functionality. The scalability of our method makes it an important tool for large-scale virtual screening and personalized medicine. By using deep reinforcement learning, this research provides a new and effective framework for protein pocket prediction, opening up opportunities for developing new tools in structural bioinformatics, drug design, and molecular biology research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Computer-Aided Molecular Design 生物-计算机：跨学科应用

CiteScore

8.00

自引率

8.60%

发文量

审稿时长

3 months

期刊介绍： The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas: - theoretical chemistry; - computational chemistry; - computer and molecular graphics; - molecular modeling; - protein engineering; - drug design; - expert systems; - general structure-property relationships; - molecular dynamics; - chemical database development and usage.