A multiple linear regression approach to the estimation of carboxylic acid ester and lactone alkaline hydrolysis rate constants.

IF 2.3 3区 环境科学与生态学 Q3 CHEMISTRY, MULTIDISCIPLINARY
J Lazare, C Tebes-Stevens, E J Weber
{"title":"A multiple linear regression approach to the estimation of carboxylic acid ester and lactone alkaline hydrolysis rate constants.","authors":"J Lazare, C Tebes-Stevens, E J Weber","doi":"10.1080/1062936X.2023.2188608","DOIUrl":null,"url":null,"abstract":"<p><p>Pesticides, pharmaceuticals, and other organic contaminants often undergo hydrolysis when released into the environment; therefore, measured or estimated hydrolysis rates are needed to assess their environmental persistence. An intuitive multiple linear regression (MLR) approach was used to develop robust QSARs for predicting base-catalyzed rate constants of carboxylic acid esters (CAEs) and lactones. We explored various combinations of independent descriptors, resulting in four primary models (two for lactones and two for CAEs), with a total of 15 and 11 parameters included in the CAE and lactone QSAR models, respectively. The most significant descriptors include p<i>K</i><sub>a</sub>, electronegativity, charge density, and steric parameters. Model performance is assessed using Drug Theoretics and Cheminformatics Laboratory's DTC-QSAR tool, demonstrating high accuracy for both internal validation (<i>r</i><sup>2</sup> = 0.93 and RMSE = 0.41-0.43 for CAEs; <i>r</i><sup>2</sup> = 0.90-0.93 and RMSE = 0.38-0.46 for lactones) and external validation (<i>r</i><sup>2</sup> = 0.93 and RMSE = 0.43-0.45 for CAEs; <i>r</i><sup>2</sup> = 0.94-0.98 and RMSE = 0.33-0.41 for lactones). The developed models require only low-cost computational resources and have substantially improved performance compared to existing hydrolysis rate prediction models (HYDROWIN and SPARC).</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"34 3","pages":"183-210"},"PeriodicalIF":2.3000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547131/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SAR and QSAR in Environmental Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/1062936X.2023.2188608","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/3/23 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Pesticides, pharmaceuticals, and other organic contaminants often undergo hydrolysis when released into the environment; therefore, measured or estimated hydrolysis rates are needed to assess their environmental persistence. An intuitive multiple linear regression (MLR) approach was used to develop robust QSARs for predicting base-catalyzed rate constants of carboxylic acid esters (CAEs) and lactones. We explored various combinations of independent descriptors, resulting in four primary models (two for lactones and two for CAEs), with a total of 15 and 11 parameters included in the CAE and lactone QSAR models, respectively. The most significant descriptors include pKa, electronegativity, charge density, and steric parameters. Model performance is assessed using Drug Theoretics and Cheminformatics Laboratory's DTC-QSAR tool, demonstrating high accuracy for both internal validation (r2 = 0.93 and RMSE = 0.41-0.43 for CAEs; r2 = 0.90-0.93 and RMSE = 0.38-0.46 for lactones) and external validation (r2 = 0.93 and RMSE = 0.43-0.45 for CAEs; r2 = 0.94-0.98 and RMSE = 0.33-0.41 for lactones). The developed models require only low-cost computational resources and have substantially improved performance compared to existing hydrolysis rate prediction models (HYDROWIN and SPARC).

羧酸酯和内酯碱性水解速率常数的多元线性回归估计方法。
农药、药品和其他有机污染物在释放到环境中时经常发生水解;因此,需要测量或估计水解速率来评估其环境持久性。使用直观的多元线性回归(MLR)方法开发了用于预测羧酸酯(CAEs)和内酯的碱催化速率常数的稳健QSAR。我们探索了独立描述符的各种组合,产生了四个主要模型(两个用于内酯,两个用于CAE),CAE和内酯QSAR模型中分别包含总共15个和11个参数。最重要的描述符包括pKa、电负性、电荷密度和空间参数。使用药物理论和化学信息学实验室的DTC-QSAR工具评估模型性能,证明了内部验证(CAE的r2=0.93和RMSE=0.41-0.43;内酯的r2=0.90-0.93和RMSE=0.038-0.46)和外部验证(CAEs的r2=0.73和RMSE=0.43-0.45;内酯的r2=0.94-0.98和RMSE0.33-0.41)的高准确性。所开发的模型只需要低成本的计算资源,并且与现有的水解速率预测模型(HYDROWIN和SPARC)相比具有显著提高的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.20
自引率
20.00%
发文量
78
审稿时长
>24 weeks
期刊介绍: SAR and QSAR in Environmental Research is an international journal welcoming papers on the fundamental and practical aspects of the structure-activity and structure-property relationships in the fields of environmental science, agrochemistry, toxicology, pharmacology and applied chemistry. A unique aspect of the journal is the focus on emerging techniques for the building of SAR and QSAR models in these widely varying fields. The scope of the journal includes, but is not limited to, the topics of topological and physicochemical descriptors, mathematical, statistical and graphical methods for data analysis, computer methods and programs, original applications and comparative studies. In addition to primary scientific papers, the journal contains reviews of books and software and news of conferences. Special issues on topics of current and widespread interest to the SAR and QSAR community will be published from time to time.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信