{"title":"非对称催化选择性预测的元学习方法","authors":"Sukriti Singh, José Miguel Hernández-Lobato","doi":"10.1038/s41467-025-58854-8","DOIUrl":null,"url":null,"abstract":"<p>Transition metal-catalyzed asymmetric reactions are of high contemporary importance in organic synthesis. Recently, machine learning (ML) has shown promise in accelerating the development of newer catalytic protocols. However, the need for large amount of experimental data can present a bottleneck for implementing ML models. Here, we propose a meta-learning workflow that can harness the literature-derived data to extract shared reaction features and requires only a few examples to predict the outcome of new reactions. Prototypical networks are used as a meta-learning method to predict the enantioselectivity of asymmetric hydrogenation of olefins. This meta-learning model consistently provides significant performance improvement over other popular ML methods such as random forests and graph neural networks. The performance of our meta-model is analyzed with varying sizes of training examples to demonstrate its utility even with limited data. A good model performance on an out-of-sample test set further indicates the general applicability of our approach. We believe this work will provide a leap forward in identifying promising reactions in the early phases of reaction development when minimal data is available.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"17 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A meta-learning approach for selectivity prediction in asymmetric catalysis\",\"authors\":\"Sukriti Singh, José Miguel Hernández-Lobato\",\"doi\":\"10.1038/s41467-025-58854-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Transition metal-catalyzed asymmetric reactions are of high contemporary importance in organic synthesis. Recently, machine learning (ML) has shown promise in accelerating the development of newer catalytic protocols. However, the need for large amount of experimental data can present a bottleneck for implementing ML models. Here, we propose a meta-learning workflow that can harness the literature-derived data to extract shared reaction features and requires only a few examples to predict the outcome of new reactions. Prototypical networks are used as a meta-learning method to predict the enantioselectivity of asymmetric hydrogenation of olefins. This meta-learning model consistently provides significant performance improvement over other popular ML methods such as random forests and graph neural networks. The performance of our meta-model is analyzed with varying sizes of training examples to demonstrate its utility even with limited data. A good model performance on an out-of-sample test set further indicates the general applicability of our approach. We believe this work will provide a leap forward in identifying promising reactions in the early phases of reaction development when minimal data is available.</p>\",\"PeriodicalId\":19066,\"journal\":{\"name\":\"Nature Communications\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":15.7000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Communications\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41467-025-58854-8\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-58854-8","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A meta-learning approach for selectivity prediction in asymmetric catalysis
Transition metal-catalyzed asymmetric reactions are of high contemporary importance in organic synthesis. Recently, machine learning (ML) has shown promise in accelerating the development of newer catalytic protocols. However, the need for large amount of experimental data can present a bottleneck for implementing ML models. Here, we propose a meta-learning workflow that can harness the literature-derived data to extract shared reaction features and requires only a few examples to predict the outcome of new reactions. Prototypical networks are used as a meta-learning method to predict the enantioselectivity of asymmetric hydrogenation of olefins. This meta-learning model consistently provides significant performance improvement over other popular ML methods such as random forests and graph neural networks. The performance of our meta-model is analyzed with varying sizes of training examples to demonstrate its utility even with limited data. A good model performance on an out-of-sample test set further indicates the general applicability of our approach. We believe this work will provide a leap forward in identifying promising reactions in the early phases of reaction development when minimal data is available.
期刊介绍:
Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.