Decomposed Direct Preference Optimization for Structure-Based Drug Design

arXiv - QuanBio - Biomolecules Pub Date : 2024-07-19 DOI:arxiv-2407.13981

Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu

{"title":"Decomposed Direct Preference Optimization for Structure-Based Drug Design","authors":"Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu","doi":"arxiv-2407.13981","DOIUrl":null,"url":null,"abstract":"Diffusion models have achieved promising results for Structure-Based Drug\nDesign (SBDD). Nevertheless, high-quality protein subpocket and ligand data are\nrelatively scarce, which hinders the models' generation capabilities. Recently,\nDirect Preference Optimization (DPO) has emerged as a pivotal tool for the\nalignment of generative models such as large language models and diffusion\nmodels, providing greater flexibility and accuracy by directly aligning model\noutputs with human preferences. Building on this advancement, we introduce DPO\nto SBDD in this paper. We tailor diffusion models to pharmaceutical needs by\naligning them with elaborately designed chemical score functions. We propose a\nnew structure-based molecular optimization method called DecompDPO, which\ndecomposes the molecule into arms and scaffolds and performs preference\noptimization at both local substructure and global molecule levels, allowing\nfor more precise control with fine-grained preferences. Notably, DecompDPO can\nbe effectively used for two main purposes: (1) fine-tuning pretrained diffusion\nmodels for molecule generation across various protein families, and (2)\nmolecular optimization given a specific protein subpocket after generation.\nExtensive experiments on the CrossDocked2020 benchmark show that DecompDPO\nsignificantly improves model performance in both molecule generation and\noptimization, with up to 100% Median High Affinity and a 54.9% Success Rate.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.13981","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Diffusion models have achieved promising results for Structure-Based Drug Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are relatively scarce, which hinders the models' generation capabilities. Recently, Direct Preference Optimization (DPO) has emerged as a pivotal tool for the alignment of generative models such as large language models and diffusion models, providing greater flexibility and accuracy by directly aligning model outputs with human preferences. Building on this advancement, we introduce DPO to SBDD in this paper. We tailor diffusion models to pharmaceutical needs by aligning them with elaborately designed chemical score functions. We propose a new structure-based molecular optimization method called DecompDPO, which decomposes the molecule into arms and scaffolds and performs preference optimization at both local substructure and global molecule levels, allowing for more precise control with fine-grained preferences. Notably, DecompDPO can be effectively used for two main purposes: (1) fine-tuning pretrained diffusion models for molecule generation across various protein families, and (2) molecular optimization given a specific protein subpocket after generation. Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO significantly improves model performance in both molecule generation and optimization, with up to 100% Median High Affinity and a 54.9% Success Rate.

查看原文本刊更多论文

基于结构的药物设计的分解直接偏好优化

扩散模型在基于结构的药物设计（SBDD）中取得了可喜的成果。然而，高质量的蛋白质子口袋和配体数据相对稀缺，阻碍了模型的生成能力。最近，直接偏好优化（Direct Preference Optimization，DPO）已成为大语言模型和扩散模型等生成模型对齐的关键工具，通过直接将模型输出与人类偏好对齐，提供了更大的灵活性和准确性。基于这一进步，我们在本文中将 DPO 引入 SBDD。我们通过将扩散模型与精心设计的化学评分函数相匹配，使其符合制药需求。我们提出了一种新的基于结构的分子优化方法--DecompDPO，它能将分子分解为臂和支架，并在局部子结构和全局分子水平上执行偏好优化，从而实现更精确的细粒度偏好控制。值得注意的是，DecompDPO 可有效用于两个主要目的：（1）微调预训练扩散模型，用于生成不同蛋白质家族的分子；（2）生成后对特定蛋白质子口袋进行分子优化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - QuanBio - Biomolecules

自引率

0.00%

发文量