Anastasia Croitoru, Anmol Kumar, Jean-Christophe Lambry, Jihyeon Lee, Suliman Sharif, Wenbo Yu, Alexander D MacKerell, Alexey Aleksandrov
{"title":"Increasing the Accuracy and Robustness of the CHARMM General Force Field with an Expanded Training Set.","authors":"Anastasia Croitoru, Anmol Kumar, Jean-Christophe Lambry, Jihyeon Lee, Suliman Sharif, Wenbo Yu, Alexander D MacKerell, Alexey Aleksandrov","doi":"10.1021/acs.jctc.5c00046","DOIUrl":null,"url":null,"abstract":"<p><p>Small molecule empirical force fields (FFs), including the CHARMM General Force Field (CGenFF), are designed to have wide coverage of organic molecules and to rapidly assign parameters to molecules not explicitly included in the FF. Assignment of parameters to new molecules in CGenFF is based on a trained bond-angle-dihedral charge increment linear interpolation scheme for the partial atomic charges along with bonded parameters assigned based on analogy using a rules-based penalty score scheme associated with atom types and chemical connectivity. Accordingly, the accuracy of CGenFF is related to the extent of the training set of available parameters. In the present study that training set is extended by 1390 molecules selected to represent connectivities new to CGenFF training compounds. Quantum mechanical (QM) data for optimized geometries, bond, valence angle, and dihedral angle potential energy scans, interactions with water, molecular dipole moments, and electrostatic potentials were used as target data. The resultant bonded parameters and partial atomic charges were used to train a new version of the CGenFF program, v5.0, which was used to generate parameters for a validation set of molecules, including drug-like molecules approved by the FDA, which were then benchmarked against both experimental and QM data. CGenFF v5.0 shows overall improvements with respect to QM intramolecular geometries, vibrations, dihedral potential energy scans, dipole moments and interactions with water. Tests of pure solvent properties of 216 molecules show small improvements versus the previous release of CGenFF v2.5.1 reflecting the high quality of the Lennard-Jones parameters that were explicitly optimized during the initial optimization of both the CGenFF and the CHARMM36 force field. CGenFF v5.0 represents an improvement that is anticipated to more accurately model intramolecular geometries and strain energies as well as noncovalent interactions of drug-like and other organic molecules.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":" ","pages":"3044-3065"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11938330/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.5c00046","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Small molecule empirical force fields (FFs), including the CHARMM General Force Field (CGenFF), are designed to have wide coverage of organic molecules and to rapidly assign parameters to molecules not explicitly included in the FF. Assignment of parameters to new molecules in CGenFF is based on a trained bond-angle-dihedral charge increment linear interpolation scheme for the partial atomic charges along with bonded parameters assigned based on analogy using a rules-based penalty score scheme associated with atom types and chemical connectivity. Accordingly, the accuracy of CGenFF is related to the extent of the training set of available parameters. In the present study that training set is extended by 1390 molecules selected to represent connectivities new to CGenFF training compounds. Quantum mechanical (QM) data for optimized geometries, bond, valence angle, and dihedral angle potential energy scans, interactions with water, molecular dipole moments, and electrostatic potentials were used as target data. The resultant bonded parameters and partial atomic charges were used to train a new version of the CGenFF program, v5.0, which was used to generate parameters for a validation set of molecules, including drug-like molecules approved by the FDA, which were then benchmarked against both experimental and QM data. CGenFF v5.0 shows overall improvements with respect to QM intramolecular geometries, vibrations, dihedral potential energy scans, dipole moments and interactions with water. Tests of pure solvent properties of 216 molecules show small improvements versus the previous release of CGenFF v2.5.1 reflecting the high quality of the Lennard-Jones parameters that were explicitly optimized during the initial optimization of both the CGenFF and the CHARMM36 force field. CGenFF v5.0 represents an improvement that is anticipated to more accurately model intramolecular geometries and strain energies as well as noncovalent interactions of drug-like and other organic molecules.
期刊介绍:
The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.