arXiv - QuanBio - Biomolecules最新文献_第10页

Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets 利用潜空间 RL 微调蛋白质靶点的小分子生成模型

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-02 DOI: arxiv-2407.13780

Ulrich A. Mbou Sob, Qiulin Li, Miguel Arbesú, Oliver Bent, Andries P. Smit, Arnu Pretorius

{"title":"Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets","authors":"Ulrich A. Mbou Sob, Qiulin Li, Miguel Arbesú, Oliver Bent, Andries P. Smit, Arnu Pretorius","doi":"arxiv-2407.13780","DOIUrl":"https://doi.org/arxiv-2407.13780","url":null,"abstract":"A specific challenge with deep learning approaches for molecule generation is\u0000generating both syntactically valid and chemically plausible molecular string\u0000representations. To address this, we propose a novel generative latent-variable\u0000transformer model for small molecules that leverages a recently proposed\u0000molecular string representation called SAFE. We introduce a modification to\u0000SAFE to reduce the number of invalid fragmented molecules generated during\u0000training and use this to train our model. Our experiments show that our model\u0000can generate novel molecules with a validity rate > 90% and a fragmentation\u0000rate < 1% by sampling from a latent space. By fine-tuning the model using\u0000reinforcement learning to improve molecular docking, we significantly increase\u0000the number of hit candidates for five specific protein targets compared to the\u0000pre-trained model, nearly doubling this number for certain targets.\u0000Additionally, our top 5% mean docking scores are comparable to the current\u0000state-of-the-art (SOTA), and we marginally outperform SOTA on three of the five\u0000targets.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leveraging Latent Evolutionary Optimization for Targeted Molecule Generation 利用潜在进化优化技术生成靶向分子

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-02 DOI: arxiv-2407.13779

Siddartha Reddy N, Sai Prakash MV, Varun V, Vishal Vaddina, Saisubramaniam Gopalakrishnan

{"title":"Leveraging Latent Evolutionary Optimization for Targeted Molecule Generation","authors":"Siddartha Reddy N, Sai Prakash MV, Varun V, Vishal Vaddina, Saisubramaniam Gopalakrishnan","doi":"arxiv-2407.13779","DOIUrl":"https://doi.org/arxiv-2407.13779","url":null,"abstract":"Lead optimization is a pivotal task in the drug design phase within the drug\u0000discovery lifecycle. The primary objective is to refine the lead compound to\u0000meet specific molecular properties for progression to the subsequent phase of\u0000development. In this work, we present an innovative approach, Latent\u0000Evolutionary Optimization for Molecule Generation (LEOMol), a generative\u0000modeling framework for the efficient generation of optimized molecules. LEOMol\u0000leverages Evolutionary Algorithms, such as Genetic Algorithm and Differential\u0000Evolution, to search the latent space of a Variational AutoEncoder (VAE). This\u0000search facilitates the identification of the target molecule distribution\u0000within the latent space. Our approach consistently demonstrates superior\u0000performance compared to previous state-of-the-art models across a range of\u0000constrained molecule generation tasks, outperforming existing models in all\u0000four sub-tasks related to property targeting. Additionally, we suggest the\u0000importance of including toxicity in the evaluation of generative models.\u0000Furthermore, an ablation study underscores the improvements that our approach\u0000provides over gradient-based latent space optimization methods. This\u0000underscores the effectiveness and superiority of LEOMol in addressing the\u0000inherent challenges in constrained molecule generation while emphasizing its\u0000potential to propel advancements in drug discovery.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"57 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing DrugCLIP：用于药物再设计的药物-疾病对比相互作用

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-02 DOI: arxiv-2407.02265

Yingzhou Lu, Yaojun Hu, Chenhao Li

引用次数: 0

AI-driven Alternative Medicine: A Novel Approach to Drug Discovery and Repurposing 人工智能驱动的替代医学：药物发现和再利用的新方法

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-02 DOI: arxiv-2407.02126

Oleksandr Bilokon, Nataliya Bilokon, Paul Bilokon

引用次数: 0

FreeCG: Free the Design Space of Clebsch-Gordan Transform for machine learning force field FreeCG：为机器学习力场释放克莱布什-戈尔丹变换的设计空间

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-02 DOI: arxiv-2407.02263

Shihao Shao, Haoran Geng, Qinghua Cui

{"title":"FreeCG: Free the Design Space of Clebsch-Gordan Transform for machine learning force field","authors":"Shihao Shao, Haoran Geng, Qinghua Cui","doi":"arxiv-2407.02263","DOIUrl":"https://doi.org/arxiv-2407.02263","url":null,"abstract":"The Clebsch-Gordan Transform (CG transform) effectively encodes many-body\u0000interactions. Many studies have proven its accuracy in depicting atomic\u0000environments, although this comes with high computational needs. The\u0000computational burden of this challenge is hard to reduce due to the need for\u0000permutation equivariance, which limits the design space of the CG transform\u0000layer. We show that, implementing the CG transform layer on\u0000permutation-invariant inputs allows complete freedom in the design of this\u0000layer without affecting symmetry. Developing further on this premise, our idea\u0000is to create a CG transform layer that operates on permutation-invariant\u0000abstract edges generated from real edge information. We bring in group CG\u0000transform with sparse path, abstract edges shuffling, and attention enhancer to\u0000form a powerful and efficient CG transform layer. Our method, known as FreeCG,\u0000achieves State-of-The-Art (SoTA) results in force prediction for MD17, rMD17,\u0000MD22, and property prediction in QM9 datasets with notable enhancement. It\u0000introduces a novel paradigm for carrying out efficient and expressive CG\u0000transform in future geometric neural network designs.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"209 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141525289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ZeroDDI: A Zero-Shot Drug-Drug Interaction Event Prediction Method with Semantic Enhanced Learning and Dual-Modal Uniform Alignment ZeroDDI：采用语义增强学习和双模式统一配准的零镜头药物相互作用事件预测方法

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-01 DOI: arxiv-2407.00891

Ziyan Wang, Zhankun Xiong, Feng Huang, Xuan Liu, Wen Zhang

{"title":"ZeroDDI: A Zero-Shot Drug-Drug Interaction Event Prediction Method with Semantic Enhanced Learning and Dual-Modal Uniform Alignment","authors":"Ziyan Wang, Zhankun Xiong, Feng Huang, Xuan Liu, Wen Zhang","doi":"arxiv-2407.00891","DOIUrl":"https://doi.org/arxiv-2407.00891","url":null,"abstract":"Drug-drug interactions (DDIs) can result in various pharmacological changes,\u0000which can be categorized into different classes known as DDI events (DDIEs). In\u0000recent years, previously unobserved/unseen DDIEs have been emerging, posing a\u0000new classification task when unseen classes have no labelled instances in the\u0000training stage, which is formulated as a zero-shot DDIE prediction (ZS-DDIE)\u0000task. However, existing computational methods are not directly applicable to\u0000ZS-DDIE, which has two primary challenges: obtaining suitable DDIE\u0000representations and handling the class imbalance issue. To overcome these\u0000challenges, we propose a novel method named ZeroDDI for the ZS-DDIE task.\u0000Specifically, we design a biological semantic enhanced DDIE representation\u0000learning module, which emphasizes the key biological semantics and distills\u0000discriminative molecular substructure-related semantics for DDIE representation\u0000learning. Furthermore, we propose a dual-modal uniform alignment strategy to\u0000distribute drug pair representations and DDIE semantic representations\u0000uniformly in a unit sphere and align the matched ones, which can mitigate the\u0000issue of class imbalance. Extensive experiments showed that ZeroDDI surpasses\u0000the baselines and indicate that it is a promising tool for detecting unseen\u0000DDIEs. Our code has been released in https://github.com/wzy-Sarah/ZeroDDI.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141525288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of Co-Excipient Selection on Hydrophobic Polymer Folding: Insights for Optimal Formulation Design 助剂选择对疏水性聚合物折叠的影响：优化配方设计的启示

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-01 DOI: arxiv-2407.00885

Jonathan W. P. Zajac, Praveen Muralikrishnan, Caryn L. Heldt, Sarah L. Perry, Sapna Sarupria

{"title":"Impact of Co-Excipient Selection on Hydrophobic Polymer Folding: Insights for Optimal Formulation Design","authors":"Jonathan W. P. Zajac, Praveen Muralikrishnan, Caryn L. Heldt, Sarah L. Perry, Sapna Sarupria","doi":"arxiv-2407.00885","DOIUrl":"https://doi.org/arxiv-2407.00885","url":null,"abstract":"The stabilization of liquid biological products is a complex task that\u0000depends on the chemical composition of both the active ingredient and any\u0000excipients in solution. Frequently, a large number of unique excipients are\u0000required to stabilize biologics, though it is not well-known how these\u0000excipients interact with one another. To probe these excipient-excipient\u0000interactions, we performed molecular dynamics simulations of arginine -- a\u0000widely used excipient with unique properties -- in solution either alone or\u0000with equimolar lysine or glutamate. We studied the effects of these mixtures on\u0000a hydrophobic polymer model to isolate excipient mechanisms on hydrophobic\u0000interactions, relevant to both protein folding and biomolecular self-assembly.\u0000We observed that arginine is the most effective single excipient in stabilizing\u0000hydrophobic polymer collapse, and its effectiveness can be augmented by lysine\u0000or glutamate addition. We utilized a decomposition of the potential of mean\u0000force to identify that the key source of arginine-lysine and arginine-glutamate\u0000synergy on polymer collapse is a reduction in attractive polymer-excipient\u0000direct interactions. Further, we applied principles from network theory to\u0000characterize the local solvent network that embeds the hydrophobic polymer.\u0000Through this approach, we found that arginine enables a more highly connected\u0000and stable network than in pure water, lysine, or glutamate solutions.\u0000Importantly, these network properties are preserved when lysine or glutamate\u0000are added to arginine solutions. Overall, we highlight the importance of\u0000identifying key molecular consequences of co-excipient selection, aiding in the\u0000establishment of rational formulation design rules.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"111 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141525290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Frontiers in integrative structural biology: modeling disordered proteins and utilizing in situ data 综合结构生物学前沿：无序蛋白质建模和利用原位数据

arXiv - QuanBio - Biomolecules Pub Date : 2024-06-30 DOI: arxiv-2407.00566

Kartik Majila, Shreyas Arvindekar, Muskaan Jindal, Shruthi Viswanath

{"title":"Frontiers in integrative structural biology: modeling disordered proteins and utilizing in situ data","authors":"Kartik Majila, Shreyas Arvindekar, Muskaan Jindal, Shruthi Viswanath","doi":"arxiv-2407.00566","DOIUrl":"https://doi.org/arxiv-2407.00566","url":null,"abstract":"Integrative modeling enables structure determination for large macromolecular\u0000assemblies by combining data from multiple sources of experiment data with\u0000theoretical and computational predictions. Recent advancements in AI-based\u0000structure prediction and electron cryo-microscopy have sparked renewed\u0000enthusiasm for integrative modeling; structures from AI-based methods can be\u0000integrated with in situ maps to characterize large assemblies. This approach\u0000previously allowed us and others to determine the architectures of diverse\u0000macromolecular assemblies, such as nuclear pore complexes, chromatin\u0000remodelers, and cell-cell junctions. Experimental data spanning several scales\u0000was used in these studies, ranging from high-resolution data, such as X-ray\u0000crystallography and Alphafold structures, to low-resolution data, such as\u0000cryo-electron tomography maps and data from co-immunoprecipitation experiments.\u0000Two recurrent modeling challenges emerged across a range of studies. First,\u0000modeling disordered regions, which constituted a significant portion of these\u0000assemblies, necessitated the development of new methods. Second, methods needed\u0000to be developed to utilize the information from cryo-electron tomography, a\u0000timely challenge as structural biology is increasingly moving towards in situ\u0000characterization. Here, we recapitulate recent developments in the modeling of\u0000disordered proteins and the analysis of cryo-electron tomography data and\u0000highlight opportunities for method development in the context of integrative\u0000modeling.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models DCI：蛋白质复杂结构模型的精确质量评估标准

arXiv - QuanBio - Biomolecules Pub Date : 2024-06-30 DOI: arxiv-2407.00560

Wenda Wang, Jiaqi Zhai, He Huang, Xinqi Gong

{"title":"DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models","authors":"Wenda Wang, Jiaqi Zhai, He Huang, Xinqi Gong","doi":"arxiv-2407.00560","DOIUrl":"https://doi.org/arxiv-2407.00560","url":null,"abstract":"The structure of proteins is the basis for studying protein function and drug\u0000design. The emergence of AlphaFold 2 has greatly promoted the prediction of\u0000protein 3D structures, and it is of great significance to give an overall and\u0000accurate evaluation of the predicted models, especially the complex models.\u0000Among the existing methods for evaluating multimer structures, DockQ is the\u0000most commonly used. However, as a more suitable metric for complex docking,\u0000DockQ cannot provide a unique and accurate evaluation in the non-docking\u0000situation. Therefore, it is necessary to propose an evaluation strategy that\u0000can directly evaluate the whole complex without limitation and achieve good\u0000results. In this work, we proposed DCI score, a new evaluation strategy for\u0000protein complex structure models, which only bases on distance map and CI\u0000(contact-interface) map, DCI focuses on the prediction accuracy of the contact\u0000interface based on the overall evaluation of complex structure, is not inferior\u0000to DockQ in the evaluation accuracy according to CAPRI classification, and is\u0000able to handle the non-docking situation better than DockQ. Besides, we\u0000calculated DCI score on CASP datasets and compared it with CASP official\u0000assessment, which obtained good results. In addition, we found that DCI can\u0000better evaluate the overall structure deviation caused by interface prediction\u0000errors in the case of multi-chains. Our DCI is available at\u0000url{https://gitee.com/WendaWang/DCI-score.git}, and the online-server is\u0000available at url{http://mialab.ruc.edu.cn/DCIServer/}.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141525287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

T- Hop: A framework for studying the importance path information in molecular graphs for chemical property prediction T- Hop：研究分子图中重要路径信息以预测化学性质的框架

arXiv - QuanBio - Biomolecules Pub Date : 2024-06-29 DOI: arxiv-2407.14270

Abdulrahman Ibraheem, Narsis Kiani, Jesper Tegner

{"title":"T- Hop: A framework for studying the importance path information in molecular graphs for chemical property prediction","authors":"Abdulrahman Ibraheem, Narsis Kiani, Jesper Tegner","doi":"arxiv-2407.14270","DOIUrl":"https://doi.org/arxiv-2407.14270","url":null,"abstract":"This paper studies the usefulness of incorporating path information in\u0000predicting chemical properties from molecular graphs, in the domain of QSAR\u0000(Quantitative Structure-Activity Relationship). Towards this, we developed a\u0000GNN-style model which can be toggled to operate in one of two modes: a\u0000non-degenerate mode which incorporates path information, and a degenerate mode\u0000which leaves out path information. Thus, by comparing the performance of the\u0000non-degenerate mode versus the degenerate mode on relevant QSAR datasets, we\u0000were able to directly assess the significance of path information on those\u0000datasets. Our results corroborate previous works, by suggesting that the\u0000usefulness of path information is datasetdependent. Unlike previous studies\u0000however, we took the very first steps towards building a model that could\u0000predict upfront whether or not path information would be useful for a given\u0000dataset at hand. Moreover, we also found that, albeit its simplicity, the\u0000degenerate mode of our model yielded rather surprising results, which\u0000outperformed more sophisticated SOTA models in certain cases.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"129 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0