密度泛函理论热化学计算的基准基集:为何应避免使用非极化基础集和极化 6-311G 系列

Samuel J. Pitman, Alicia K. Evans, Robbie T. Ireland, Felix Lempriere, Laura K. McKemmish
{"title":"密度泛函理论热化学计算的基准基集:为何应避免使用非极化基础集和极化 6-311G 系列","authors":"Samuel J. Pitman, Alicia K. Evans, Robbie T. Ireland, Felix Lempriere, Laura K. McKemmish","doi":"arxiv-2409.03964","DOIUrl":null,"url":null,"abstract":"Basis sets are a crucial but often largely overlooked choice when setting up\nquantum chemistry calculations. The choice of basis set can be critical in\ndetermining the accuracy and calculation time of your quantum chemistry\ncalculations. Clear recommendations based on thorough benchmarking are\nessential, but not readily available currently. This study investigates the\nrelative quality of basis sets for general properties by benchmarking basis set\nperformance for a diverse set of 136 reactions (from the diet-150-GMTKN55\ndataset). In our analysis, we find the distributions of errors are often\nsignificantly non-Gaussian, meaning that the joint consideration of median\nerrors, mean absolute errors and outlier statistics is helpful to provide a\nholistic understanding of basis set performance. Our direct comparison of\nperformance between most modern basis sets provides quantitative evidence for\nbasis set recommendations that broadly align with the established understanding\nof basis set experts and is evident in the design of modern basis sets. For\nexample, while zeta is a good measure of quality, it is not the only\ndetermining factor for an accurate calculation with unpolarised double and\ntriple-zeta basis sets (like 6-31G and 6-311G) having very poor performance.\nAppropriate use of polarisation functions (e.g. 6-31G*) is essential to obtain\nthe accuracy offered by double or triple zeta basis sets. In our study, the\nbest performance in our study for double and triple zeta basis set are\n6-31++G** and pcseg-2 respectively. The polarised 6-311G basis set family has\npoor parameterisation which means its performance is more like a double-zeta\nthan triple-zeta basis set. All versions of the 6-311G basis set family should\nbe avoided entirely for valence chemistry calculations moving forward.","PeriodicalId":501304,"journal":{"name":"arXiv - PHYS - Chemical Physics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Benchmarking Basis Sets for Density Functional Theory Thermochemistry Calculations: Why unpolarised basis sets and the polarised 6-311G family should be avoided\",\"authors\":\"Samuel J. Pitman, Alicia K. Evans, Robbie T. Ireland, Felix Lempriere, Laura K. McKemmish\",\"doi\":\"arxiv-2409.03964\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Basis sets are a crucial but often largely overlooked choice when setting up\\nquantum chemistry calculations. The choice of basis set can be critical in\\ndetermining the accuracy and calculation time of your quantum chemistry\\ncalculations. Clear recommendations based on thorough benchmarking are\\nessential, but not readily available currently. This study investigates the\\nrelative quality of basis sets for general properties by benchmarking basis set\\nperformance for a diverse set of 136 reactions (from the diet-150-GMTKN55\\ndataset). In our analysis, we find the distributions of errors are often\\nsignificantly non-Gaussian, meaning that the joint consideration of median\\nerrors, mean absolute errors and outlier statistics is helpful to provide a\\nholistic understanding of basis set performance. Our direct comparison of\\nperformance between most modern basis sets provides quantitative evidence for\\nbasis set recommendations that broadly align with the established understanding\\nof basis set experts and is evident in the design of modern basis sets. For\\nexample, while zeta is a good measure of quality, it is not the only\\ndetermining factor for an accurate calculation with unpolarised double and\\ntriple-zeta basis sets (like 6-31G and 6-311G) having very poor performance.\\nAppropriate use of polarisation functions (e.g. 6-31G*) is essential to obtain\\nthe accuracy offered by double or triple zeta basis sets. In our study, the\\nbest performance in our study for double and triple zeta basis set are\\n6-31++G** and pcseg-2 respectively. The polarised 6-311G basis set family has\\npoor parameterisation which means its performance is more like a double-zeta\\nthan triple-zeta basis set. All versions of the 6-311G basis set family should\\nbe avoided entirely for valence chemistry calculations moving forward.\",\"PeriodicalId\":501304,\"journal\":{\"name\":\"arXiv - PHYS - Chemical Physics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Chemical Physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.03964\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Chemical Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03964","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在建立量子化学计算时,基集是一个至关重要的选择,但往往容易被忽视。基础集的选择是决定量子化学计算精度和计算时间的关键。基于全面基准测试的明确建议非常重要,但目前还不容易获得。本研究通过对 136 个不同反应(来自 diet-150-GMTKN55 数据集)的基集性能进行基准测试,研究了基集一般性质的比较质量。在分析中,我们发现误差的分布往往具有显著的非高斯性,这意味着联合考虑中位误差、平均绝对误差和离群值统计有助于全面了解基础集的性能。我们对大多数现代基集的性能进行了直接比较,为基集建议提供了定量证据,这些建议与基集专家的既定认识基本一致,在现代基集的设计中也很明显。例如,虽然 zeta 是衡量质量的良好指标,但它并不是准确计算的唯一决定因素,未极化的双 zeta 和三 zeta 基集(如 6-31G 和 6-311G)的性能非常差。在我们的研究中,性能最好的双zeta 和三zeta 基集分别是 6-31++G** 和 pcseg-2。极化 6-311G 基集系列的参数化能力较差,这意味着其性能更像双泽塔三泽塔基集。在今后的价化学计算中,应完全避免使用所有版本的 6-311G 基集族。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Benchmarking Basis Sets for Density Functional Theory Thermochemistry Calculations: Why unpolarised basis sets and the polarised 6-311G family should be avoided
Basis sets are a crucial but often largely overlooked choice when setting up quantum chemistry calculations. The choice of basis set can be critical in determining the accuracy and calculation time of your quantum chemistry calculations. Clear recommendations based on thorough benchmarking are essential, but not readily available currently. This study investigates the relative quality of basis sets for general properties by benchmarking basis set performance for a diverse set of 136 reactions (from the diet-150-GMTKN55 dataset). In our analysis, we find the distributions of errors are often significantly non-Gaussian, meaning that the joint consideration of median errors, mean absolute errors and outlier statistics is helpful to provide a holistic understanding of basis set performance. Our direct comparison of performance between most modern basis sets provides quantitative evidence for basis set recommendations that broadly align with the established understanding of basis set experts and is evident in the design of modern basis sets. For example, while zeta is a good measure of quality, it is not the only determining factor for an accurate calculation with unpolarised double and triple-zeta basis sets (like 6-31G and 6-311G) having very poor performance. Appropriate use of polarisation functions (e.g. 6-31G*) is essential to obtain the accuracy offered by double or triple zeta basis sets. In our study, the best performance in our study for double and triple zeta basis set are 6-31++G** and pcseg-2 respectively. The polarised 6-311G basis set family has poor parameterisation which means its performance is more like a double-zeta than triple-zeta basis set. All versions of the 6-311G basis set family should be avoided entirely for valence chemistry calculations moving forward.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信