{"title":"Decomposed Direct Preference Optimization for Structure-Based Drug Design","authors":"Xiwei Cheng, Xiangxin Zhou, Yuwei Yang, Yu Bao, Quanquan Gu","doi":"arxiv-2407.13981","DOIUrl":null,"url":null,"abstract":"Diffusion models have achieved promising results for Structure-Based Drug\nDesign (SBDD). Nevertheless, high-quality protein subpocket and ligand data are\nrelatively scarce, which hinders the models' generation capabilities. Recently,\nDirect Preference Optimization (DPO) has emerged as a pivotal tool for the\nalignment of generative models such as large language models and diffusion\nmodels, providing greater flexibility and accuracy by directly aligning model\noutputs with human preferences. Building on this advancement, we introduce DPO\nto SBDD in this paper. We tailor diffusion models to pharmaceutical needs by\naligning them with elaborately designed chemical score functions. We propose a\nnew structure-based molecular optimization method called DecompDPO, which\ndecomposes the molecule into arms and scaffolds and performs preference\noptimization at both local substructure and global molecule levels, allowing\nfor more precise control with fine-grained preferences. Notably, DecompDPO can\nbe effectively used for two main purposes: (1) fine-tuning pretrained diffusion\nmodels for molecule generation across various protein families, and (2)\nmolecular optimization given a specific protein subpocket after generation.\nExtensive experiments on the CrossDocked2020 benchmark show that DecompDPO\nsignificantly improves model performance in both molecule generation and\noptimization, with up to 100% Median High Affinity and a 54.9% Success Rate.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.13981","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Diffusion models have achieved promising results for Structure-Based Drug
Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are
relatively scarce, which hinders the models' generation capabilities. Recently,
Direct Preference Optimization (DPO) has emerged as a pivotal tool for the
alignment of generative models such as large language models and diffusion
models, providing greater flexibility and accuracy by directly aligning model
outputs with human preferences. Building on this advancement, we introduce DPO
to SBDD in this paper. We tailor diffusion models to pharmaceutical needs by
aligning them with elaborately designed chemical score functions. We propose a
new structure-based molecular optimization method called DecompDPO, which
decomposes the molecule into arms and scaffolds and performs preference
optimization at both local substructure and global molecule levels, allowing
for more precise control with fine-grained preferences. Notably, DecompDPO can
be effectively used for two main purposes: (1) fine-tuning pretrained diffusion
models for molecule generation across various protein families, and (2)
molecular optimization given a specific protein subpocket after generation.
Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO
significantly improves model performance in both molecule generation and
optimization, with up to 100% Median High Affinity and a 54.9% Success Rate.