{"title":"Diverging errors: A comparison of DFT and machine-learning predictions of NMR shieldings","authors":"Ema Chaloupecká , Ondřej Socha , Martin Dračínský","doi":"10.1016/j.ssnmr.2025.102019","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of NMR parameters from first principles is essential for the structural characterization of molecular solids. Recent studies have shown that single-molecule correction schemes—based on hybrid DFT calculations—can significantly improve the accuracy of periodic DFT predictions of nuclear shieldings. Here, we evaluate the performance of this correction approach not only for periodic DFT calculations but also for ShiftML2, a machine-learning model trained on PBE-calculated NMR data. For <sup>13</sup>C nuclei, the application of single-molecule PBE0 corrections to periodic PBE shieldings has reduced the root-mean-square deviation (RMSD) from 2.18 to 1.20 ppm, with negligible improvement observed for <sup>1</sup>H. When applied to ShiftML2 predictions, the corrections have yielded a smaller reduction in <sup>13</sup>C RMSD (from 3.02 to 2.51 ppm); again, they have had minimal impact on <sup>1</sup>H predictions. Residual analysis has revealed weak correlation between DFT and ML errors, suggesting that while some sources of systematic deviation may be shared, others are likely distinct. These results demonstrate that DFT-specific correction schemes do not straightforwardly translate to machine-learning models, highlighting the need for ML-tailored post-processing or retraining strategies. The findings have important implications for the integration of machine learning into high-throughput NMR workflows and the development of more accurate predictive tools for solid-state spectroscopy.</div></div>","PeriodicalId":21937,"journal":{"name":"Solid state nuclear magnetic resonance","volume":"138 ","pages":"Article 102019"},"PeriodicalIF":1.8000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Solid state nuclear magnetic resonance","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926204025000359","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of NMR parameters from first principles is essential for the structural characterization of molecular solids. Recent studies have shown that single-molecule correction schemes—based on hybrid DFT calculations—can significantly improve the accuracy of periodic DFT predictions of nuclear shieldings. Here, we evaluate the performance of this correction approach not only for periodic DFT calculations but also for ShiftML2, a machine-learning model trained on PBE-calculated NMR data. For 13C nuclei, the application of single-molecule PBE0 corrections to periodic PBE shieldings has reduced the root-mean-square deviation (RMSD) from 2.18 to 1.20 ppm, with negligible improvement observed for 1H. When applied to ShiftML2 predictions, the corrections have yielded a smaller reduction in 13C RMSD (from 3.02 to 2.51 ppm); again, they have had minimal impact on 1H predictions. Residual analysis has revealed weak correlation between DFT and ML errors, suggesting that while some sources of systematic deviation may be shared, others are likely distinct. These results demonstrate that DFT-specific correction schemes do not straightforwardly translate to machine-learning models, highlighting the need for ML-tailored post-processing or retraining strategies. The findings have important implications for the integration of machine learning into high-throughput NMR workflows and the development of more accurate predictive tools for solid-state spectroscopy.
期刊介绍:
The journal Solid State Nuclear Magnetic Resonance publishes original manuscripts of high scientific quality dealing with all experimental and theoretical aspects of solid state NMR. This includes advances in instrumentation, development of new experimental techniques and methodology, new theoretical insights, new data processing and simulation methods, and original applications of established or novel methods to scientific problems.