{"title":"神经震荡:神经微分方程中的几何约束。","authors":"Justin S. Diamond, Markus A. Lill","doi":"10.1186/s13321-025-01053-w","DOIUrl":null,"url":null,"abstract":"<p>Generating accurate molecular conformations hinges on sampling effectively from a high-dimensional space of atomic arrangements, which grows exponentially with system size. To ensure physically valid geometries and increase the likelihood of reaching low-energy conformations, it is us ful to incorporate prior physicsbased information by recasting them as geometric constraints that naturally arise as nonlinear constraint satisfaction problems. In this work, we propose an approach to embed these strict constraints into neural differential equations, leveraging the denoising diffusion framework. By projecting the stochastic generative dynamics onto a manifold defined by constraint sets, our method enforces exact feasibility at each step, unlike alternative approaches that merely impose soft constraints through probabilistic guidance. This technique generates lower-energy molecular conformations, enables more efficient subspace exploration, and formally subsumes classifier-guidance-type methods by treating geometric constraints as strict algebraic conditions within the diffusion process.</p><p>Neural SHAKE formulates exact manifold‑projected score‑based diffusion : each reverse-SDEincrement is orthogonally projected, via a Lagrange-multiplier solve, onto the constraint surfaceσₐ(x)=0 for a = 1,…, A, with A the number of independent constraints and thus the manifold’scodimension . This projection preserves global SE(3) symmetry and enforces constraints tosolver tolerance. It induces a well-posed surface Fokker–Planck flow on the (3 N − A)-dimensional manifold, while a coarea/Fixman Jacobian carries the ambient 3 N-dimensionaldensity to a normalized density on that manifold, preserving probability mass after the dimensionality reduction.</p>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"17 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-01053-w","citationCount":"0","resultStr":"{\"title\":\"Neural SHAKE: geometric constraints in neural differential equations\",\"authors\":\"Justin S. Diamond, Markus A. Lill\",\"doi\":\"10.1186/s13321-025-01053-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Generating accurate molecular conformations hinges on sampling effectively from a high-dimensional space of atomic arrangements, which grows exponentially with system size. To ensure physically valid geometries and increase the likelihood of reaching low-energy conformations, it is us ful to incorporate prior physicsbased information by recasting them as geometric constraints that naturally arise as nonlinear constraint satisfaction problems. In this work, we propose an approach to embed these strict constraints into neural differential equations, leveraging the denoising diffusion framework. By projecting the stochastic generative dynamics onto a manifold defined by constraint sets, our method enforces exact feasibility at each step, unlike alternative approaches that merely impose soft constraints through probabilistic guidance. This technique generates lower-energy molecular conformations, enables more efficient subspace exploration, and formally subsumes classifier-guidance-type methods by treating geometric constraints as strict algebraic conditions within the diffusion process.</p><p>Neural SHAKE formulates exact manifold‑projected score‑based diffusion : each reverse-SDEincrement is orthogonally projected, via a Lagrange-multiplier solve, onto the constraint surfaceσₐ(x)=0 for a = 1,…, A, with A the number of independent constraints and thus the manifold’scodimension . This projection preserves global SE(3) symmetry and enforces constraints tosolver tolerance. It induces a well-posed surface Fokker–Planck flow on the (3 N − A)-dimensional manifold, while a coarea/Fixman Jacobian carries the ambient 3 N-dimensionaldensity to a normalized density on that manifold, preserving probability mass after the dimensionality reduction.</p>\",\"PeriodicalId\":617,\"journal\":{\"name\":\"Journal of Cheminformatics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-01053-w\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cheminformatics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s13321-025-01053-w\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-025-01053-w","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
生成精确的分子构象取决于从原子排列的高维空间有效地采样,原子排列随系统大小呈指数增长。为了确保物理上有效的几何形状并增加达到低能量构象的可能性,我们可以通过将先前的基于物理的信息重新转换为自然出现的非线性约束满足问题的几何约束来合并它们。在这项工作中,我们提出了一种利用去噪扩散框架将这些严格约束嵌入神经微分方程的方法。通过将随机生成动力学投射到约束集定义的流形上,我们的方法在每一步都强制执行精确的可行性,而不像其他方法仅仅通过概率指导施加软约束。该技术产生低能分子构象,实现更有效的子空间探索,并通过将几何约束作为扩散过程中的严格代数条件正式纳入分类器引导型方法。科学贡献:Neural SHAKE制定了精确的流形投影分数为基础的扩散:每个反向sdeincrement通过拉格朗日乘子解正交投影到约束表面σ (x)=0, a = 1,…,a,其中a是独立约束的数量,因此是流形的scodimension。该投影保留了全局SE(3)对称性,并对求解容忍度施加了约束。它在(3n - a)维流形上诱导了一个适定的表面Fokker-Planck流,而coarea/Fixman雅可比矩阵将周围的3n维密度携带到该流形上的标准化密度,在降维后保留了概率质量。
Neural SHAKE: geometric constraints in neural differential equations
Generating accurate molecular conformations hinges on sampling effectively from a high-dimensional space of atomic arrangements, which grows exponentially with system size. To ensure physically valid geometries and increase the likelihood of reaching low-energy conformations, it is us ful to incorporate prior physicsbased information by recasting them as geometric constraints that naturally arise as nonlinear constraint satisfaction problems. In this work, we propose an approach to embed these strict constraints into neural differential equations, leveraging the denoising diffusion framework. By projecting the stochastic generative dynamics onto a manifold defined by constraint sets, our method enforces exact feasibility at each step, unlike alternative approaches that merely impose soft constraints through probabilistic guidance. This technique generates lower-energy molecular conformations, enables more efficient subspace exploration, and formally subsumes classifier-guidance-type methods by treating geometric constraints as strict algebraic conditions within the diffusion process.
Neural SHAKE formulates exact manifold‑projected score‑based diffusion : each reverse-SDEincrement is orthogonally projected, via a Lagrange-multiplier solve, onto the constraint surfaceσₐ(x)=0 for a = 1,…, A, with A the number of independent constraints and thus the manifold’scodimension . This projection preserves global SE(3) symmetry and enforces constraints tosolver tolerance. It induces a well-posed surface Fokker–Planck flow on the (3 N − A)-dimensional manifold, while a coarea/Fixman Jacobian carries the ambient 3 N-dimensionaldensity to a normalized density on that manifold, preserving probability mass after the dimensionality reduction.
期刊介绍:
Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling.
Coverage includes, but is not limited to:
chemical information systems, software and databases, and molecular modelling,
chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases,
computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.