Blake E. Ocampo, , , Bilal Altundas, , , Matthew J. Bock, , , Sara Feiz, , and , Scott E. Denmark*,
{"title":"Data-Driven Prediction of Enantioselectivity for the Sharpless Asymmetric Dihydroxylation: Model Development and Experimental Validation","authors":"Blake E. Ocampo, , , Bilal Altundas, , , Matthew J. Bock, , , Sara Feiz, , and , Scott E. Denmark*, ","doi":"10.1021/acscentsci.5c00900","DOIUrl":null,"url":null,"abstract":"<p >The Sharpless asymmetric dihydroxylation remains a key transformation in chemical synthesis, yet its success hides unexpected cases of lower selectivity. A chemoinformatic workflow was developed to allow data-driven analysis of the reaction. A database of 1007 reactions employing AD-mix α and β was curated from the literature, and an alignment-dependent, fragment-based featurization of alkenes was implemented for modeling. This platform converged on machine learning models capable of predicting the magnitude of enantioselectivity for multiple alkene classes, achieving <i>Q</i><sup>2</sup><sub>F3</sub> values ≥ 0.8, test <i>r</i><sup>2</sup> values ≥ 0.7 and mean absolute errors (MAE) ≤ 0.3 kcal/mol. The features of alkenes contributing to model performance were assessed with SHapley Additive exPlanations (SHAP) analysis to gather insight into factors underlying predictions. Experimental validation demonstrated that the models could achieve meaningful predictions on out-of-sample alkenes.</p><p >A data-driven approach was designed to analyze the Sharpless Asymmetric Dihydroxylation for insight into factors driving enantioselectivity and high-performing models were experimentally validated</p>","PeriodicalId":10,"journal":{"name":"ACS Central Science","volume":"11 9","pages":"1640–1650"},"PeriodicalIF":10.4000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acscentsci.5c00900","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Central Science","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acscentsci.5c00900","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The Sharpless asymmetric dihydroxylation remains a key transformation in chemical synthesis, yet its success hides unexpected cases of lower selectivity. A chemoinformatic workflow was developed to allow data-driven analysis of the reaction. A database of 1007 reactions employing AD-mix α and β was curated from the literature, and an alignment-dependent, fragment-based featurization of alkenes was implemented for modeling. This platform converged on machine learning models capable of predicting the magnitude of enantioselectivity for multiple alkene classes, achieving Q2F3 values ≥ 0.8, test r2 values ≥ 0.7 and mean absolute errors (MAE) ≤ 0.3 kcal/mol. The features of alkenes contributing to model performance were assessed with SHapley Additive exPlanations (SHAP) analysis to gather insight into factors underlying predictions. Experimental validation demonstrated that the models could achieve meaningful predictions on out-of-sample alkenes.
A data-driven approach was designed to analyze the Sharpless Asymmetric Dihydroxylation for insight into factors driving enantioselectivity and high-performing models were experimentally validated
期刊介绍:
ACS Central Science publishes significant primary reports on research in chemistry and allied fields where chemical approaches are pivotal. As the first fully open-access journal by the American Chemical Society, it covers compelling and important contributions to the broad chemistry and scientific community. "Central science," a term popularized nearly 40 years ago, emphasizes chemistry's central role in connecting physical and life sciences, and fundamental sciences with applied disciplines like medicine and engineering. The journal focuses on exceptional quality articles, addressing advances in fundamental chemistry and interdisciplinary research.