Combining Hammett σ constants for Δ-machine learning and catalyst discovery†

IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY
V. Diana Rakotonirina, Marco Bragato, Stefan Heinen and O. Anatole von Lilienfeld
{"title":"Combining Hammett σ constants for Δ-machine learning and catalyst discovery†","authors":"V. Diana Rakotonirina, Marco Bragato, Stefan Heinen and O. Anatole von Lilienfeld","doi":"10.1039/D4DD00228H","DOIUrl":null,"url":null,"abstract":"<p >We study the applicability of the Hammett-inspired product (HIP) Ansatz to model relative substrate binding within homogenous organometallic catalysis, assigning <em>σ</em> and <em>ρ</em> to ligands and metals, respectively. Implementing an additive combination (c) rule for obtaining <em>σ</em> constants for any ligand pair combination results in a cHIP model that enhances data efficiency in computational ligand tuning. We show its usefulness (i) as a baseline for Δ-machine learning (ML), and (ii) to identify novel catalyst candidates <em>via</em> volcano plots. After testing the combination rule on Hammett constants previously published in the literature, we have generated numerical evidence for the Suzuki–Miyaura (SM) C–C cross-coupling reaction using two synthetic datasets of metallic catalysts (including (10) and (11)-metals Ni, Pd, Pt, and Cu, Ag, Au as well as 96 ligands such as N-heterocyclic carbenes, phosphines, or pyridines). When used as a baseline, Δ-ML prediction errors of relative binding decrease systematically with training set size and reach chemical accuracy (∼1 kcal mol<small><sup>−1</sup></small>) for 20k training instances. Employing the individual ligand constants obtained from cHIP, we report relative substrate binding for a novel dataset consisting of 720 catalysts (not part of training data), of which 145 fall into the most promising range on the volcano plot accounting for oxidative addition, transmetalation, and reductive elimination steps. Multiple Ni-based catalysts, <em>e.g.</em> Aphos-Ni-P(<em>t</em>-Bu)<small><sub>3</sub></small>, are included among these promising candidates, potentially offering dramatic cost savings in experimental applications.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 12","pages":" 2487-2496"},"PeriodicalIF":6.2000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/dd/d4dd00228h?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/dd/d4dd00228h","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

We study the applicability of the Hammett-inspired product (HIP) Ansatz to model relative substrate binding within homogenous organometallic catalysis, assigning σ and ρ to ligands and metals, respectively. Implementing an additive combination (c) rule for obtaining σ constants for any ligand pair combination results in a cHIP model that enhances data efficiency in computational ligand tuning. We show its usefulness (i) as a baseline for Δ-machine learning (ML), and (ii) to identify novel catalyst candidates via volcano plots. After testing the combination rule on Hammett constants previously published in the literature, we have generated numerical evidence for the Suzuki–Miyaura (SM) C–C cross-coupling reaction using two synthetic datasets of metallic catalysts (including (10) and (11)-metals Ni, Pd, Pt, and Cu, Ag, Au as well as 96 ligands such as N-heterocyclic carbenes, phosphines, or pyridines). When used as a baseline, Δ-ML prediction errors of relative binding decrease systematically with training set size and reach chemical accuracy (∼1 kcal mol−1) for 20k training instances. Employing the individual ligand constants obtained from cHIP, we report relative substrate binding for a novel dataset consisting of 720 catalysts (not part of training data), of which 145 fall into the most promising range on the volcano plot accounting for oxidative addition, transmetalation, and reductive elimination steps. Multiple Ni-based catalysts, e.g. Aphos-Ni-P(t-Bu)3, are included among these promising candidates, potentially offering dramatic cost savings in experimental applications.

Abstract Image

结合Hammett σ常数Δ-machine学习和催化剂发现†
我们研究了hammet启发产物(HIP) Ansatz模型在均相有机金属催化中相对底物结合的适用性,分别为配体和金属分配了σ和ρ。采用可加性组合(c)规则获得任意配体对组合的σ常数,从而提高了计算配体调谐的数据效率。我们展示了它的实用性(i)作为Δ-machine学习(ML)的基线,以及(ii)通过火山图识别新的催化剂候选物。在测试了先前在文献中发表的Hammett常数的组合规则之后,我们使用两个金属催化剂(包括(10)和(11)金属Ni, Pd, Pt, Cu, Ag, Au以及96种配体,如n -杂环羰基,膦或吡啶)的合成数据集生成了Suzuki-Miyaura (SM) C-C交叉偶联反应的数值证据。当用作基线时,Δ-ML相对结合的预测误差随着训练集的大小而系统地减少,并在20k个训练实例中达到化学精度(~ 1 kcal mol−1)。利用从cHIP获得的单个配体常数,我们报告了由720种催化剂(不属于训练数据的一部分)组成的新数据集的相对底物结合,其中145种属于火山图上最有希望的范围,用于氧化加成,金属转化和还原消除步骤。多种镍基催化剂,如Aphos-Ni-P(t-Bu)3,包括在这些有前途的候选材料中,有可能在实验应用中大幅节省成本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信