Coupling causality and interpretable machine learning to reveal the reaction coordinate of C–N coupling with a supramolecular Cu-calix[8]arene catalyst
R. A. Talmazan, J. Gamper, I. Castillo, T. S. Hofer and M. Podewitz
{"title":"Coupling causality and interpretable machine learning to reveal the reaction coordinate of C–N coupling with a supramolecular Cu-calix[8]arene catalyst","authors":"R. A. Talmazan, J. Gamper, I. Castillo, T. S. Hofer and M. Podewitz","doi":"10.1039/D5DD00216H","DOIUrl":null,"url":null,"abstract":"<p >Supramolecular 3d transition-metal catalysts are large, flexible systems with intricate interactions, resulting in complex reaction coordinates. To capture their dynamic nature, we developed a broadly applicable, high-throughput workflow, that leverages quantum mechanics/molecular mechanics molecular dynamics (QM/MM MD) in explicit solvent, to investigate a Cu(<small>I</small>)-calix[8]arene-catalysed C–N coupling reaction. The system complexity and high amount of data generated from sampling the reaction requires automated analyses. To identify and quantify the reaction coordinate from noisy simulation trajectories, we applied interpretable machine learning techniques (Lasso, Random Forest, Logistic Regression) in a consensus model, alongside dimensionality reduction methods (PCA, LDA, tICA). By employing a Granger Causality model, we move beyond the traditional view of a reaction coordinate, by defining it instead as a sequence of molecular motions leading up to the reaction.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 10","pages":" 2954-2971"},"PeriodicalIF":6.2000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12421827/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d5dd00216h","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Supramolecular 3d transition-metal catalysts are large, flexible systems with intricate interactions, resulting in complex reaction coordinates. To capture their dynamic nature, we developed a broadly applicable, high-throughput workflow, that leverages quantum mechanics/molecular mechanics molecular dynamics (QM/MM MD) in explicit solvent, to investigate a Cu(I)-calix[8]arene-catalysed C–N coupling reaction. The system complexity and high amount of data generated from sampling the reaction requires automated analyses. To identify and quantify the reaction coordinate from noisy simulation trajectories, we applied interpretable machine learning techniques (Lasso, Random Forest, Logistic Regression) in a consensus model, alongside dimensionality reduction methods (PCA, LDA, tICA). By employing a Granger Causality model, we move beyond the traditional view of a reaction coordinate, by defining it instead as a sequence of molecular motions leading up to the reaction.