Decomposed Direct Preference Optimization for Structure-Based Drug Design

Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu
{"title":"Decomposed Direct Preference Optimization for Structure-Based Drug Design","authors":"Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu","doi":"arxiv-2407.13981","DOIUrl":null,"url":null,"abstract":"Diffusion models have achieved promising results for Structure-Based Drug\nDesign (SBDD). Nevertheless, high-quality protein subpocket and ligand data are\nrelatively scarce, which hinders the models' generation capabilities. Recently,\nDirect Preference Optimization (DPO) has emerged as a pivotal tool for the\nalignment of generative models such as large language models and diffusion\nmodels, providing greater flexibility and accuracy by directly aligning model\noutputs with human preferences. Building on this advancement, we introduce DPO\nto SBDD in this paper. We tailor diffusion models to pharmaceutical needs by\naligning them with elaborately designed chemical score functions. We propose a\nnew structure-based molecular optimization method called DecompDPO, which\ndecomposes the molecule into arms and scaffolds and performs preference\noptimization at both local substructure and global molecule levels, allowing\nfor more precise control with fine-grained preferences. Notably, DecompDPO can\nbe effectively used for two main purposes: (1) fine-tuning pretrained diffusion\nmodels for molecule generation across various protein families, and (2)\nmolecular optimization given a specific protein subpocket after generation.\nExtensive experiments on the CrossDocked2020 benchmark show that DecompDPO\nsignificantly improves model performance in both molecule generation and\noptimization, with up to 100% Median High Affinity and a 54.9% Success Rate.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.13981","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Diffusion models have achieved promising results for Structure-Based Drug Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are relatively scarce, which hinders the models' generation capabilities. Recently, Direct Preference Optimization (DPO) has emerged as a pivotal tool for the alignment of generative models such as large language models and diffusion models, providing greater flexibility and accuracy by directly aligning model outputs with human preferences. Building on this advancement, we introduce DPO to SBDD in this paper. We tailor diffusion models to pharmaceutical needs by aligning them with elaborately designed chemical score functions. We propose a new structure-based molecular optimization method called DecompDPO, which decomposes the molecule into arms and scaffolds and performs preference optimization at both local substructure and global molecule levels, allowing for more precise control with fine-grained preferences. Notably, DecompDPO can be effectively used for two main purposes: (1) fine-tuning pretrained diffusion models for molecule generation across various protein families, and (2) molecular optimization given a specific protein subpocket after generation. Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO significantly improves model performance in both molecule generation and optimization, with up to 100% Median High Affinity and a 54.9% Success Rate.
基于结构的药物设计的分解直接偏好优化
扩散模型在基于结构的药物设计(SBDD)中取得了可喜的成果。然而,高质量的蛋白质子口袋和配体数据相对稀缺,阻碍了模型的生成能力。最近,直接偏好优化(Direct Preference Optimization,DPO)已成为大语言模型和扩散模型等生成模型对齐的关键工具,通过直接将模型输出与人类偏好对齐,提供了更大的灵活性和准确性。基于这一进步,我们在本文中将 DPO 引入 SBDD。我们通过将扩散模型与精心设计的化学评分函数相匹配,使其符合制药需求。我们提出了一种新的基于结构的分子优化方法--DecompDPO,它能将分子分解为臂和支架,并在局部子结构和全局分子水平上执行偏好优化,从而实现更精确的细粒度偏好控制。值得注意的是,DecompDPO 可有效用于两个主要目的:(1)微调预训练扩散模型,用于生成不同蛋白质家族的分子;(2)生成后对特定蛋白质子口袋进行分子优化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信