A comparative QSAR study of aryl-substituted isobenzofuran-1(3H)-ones inhibitors

Z. Rostami, E. Pourbasheer
{"title":"A comparative QSAR study of aryl-substituted isobenzofuran-1(3H)-ones inhibitors","authors":"Z. Rostami, E. Pourbasheer","doi":"10.33945/SAMI/ECC.2019.1.7","DOIUrl":null,"url":null,"abstract":"A comparative workflow, including linear and non-linear QSAR models, was carried out to evaluate the predictive accuracy of models and predict the inhibition activity of a series of aryl-substituted isobenzofuran-1(3H)-ones. The data set consisted of 34 compounds was classified into the training and test sets, randomly. Molecular descriptors were selected using the genetic algorithm (GA) as a feature selection tool. Various linear models based on multiple linear regression (MLR), principle component regression (PCR) and partial least square (PLS) and non-linear models based on artificial neural network (ANN), adaptive network-based fuzzy inference system (ANFIS) and support vector machine (SVM) methods were developed and compared. The accuracy of the models was studied by leave-one-out cross-validation (Q_LOO^2), Y-randomization test and group of compounds as external test set. Six descriptors were selected by GA to develop predictive models. With respect to the linear models, GA-PCR method was more accurate than the reset with statistical results of 〖 R〗_train^2=0.883, R_test^2=0.897,〖 R〗_(adj,train)^2=0.829,〖 R〗_(adj,test)^2=0.849,〖 F〗_train=24.07 and F_test=34.17. In case of non-linear models, GA-SVM (R_train^2=0.992 and R_test^2=0.997) showed high predictive accuracy for the inhibitory activity. It was found that the selected descriptors have the major roles in interpretation of biological activities of the compounds.","PeriodicalId":14624,"journal":{"name":"Iranian Chemical Communication","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iranian Chemical Communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33945/SAMI/ECC.2019.1.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A comparative workflow, including linear and non-linear QSAR models, was carried out to evaluate the predictive accuracy of models and predict the inhibition activity of a series of aryl-substituted isobenzofuran-1(3H)-ones. The data set consisted of 34 compounds was classified into the training and test sets, randomly. Molecular descriptors were selected using the genetic algorithm (GA) as a feature selection tool. Various linear models based on multiple linear regression (MLR), principle component regression (PCR) and partial least square (PLS) and non-linear models based on artificial neural network (ANN), adaptive network-based fuzzy inference system (ANFIS) and support vector machine (SVM) methods were developed and compared. The accuracy of the models was studied by leave-one-out cross-validation (Q_LOO^2), Y-randomization test and group of compounds as external test set. Six descriptors were selected by GA to develop predictive models. With respect to the linear models, GA-PCR method was more accurate than the reset with statistical results of 〖 R〗_train^2=0.883, R_test^2=0.897,〖 R〗_(adj,train)^2=0.829,〖 R〗_(adj,test)^2=0.849,〖 F〗_train=24.07 and F_test=34.17. In case of non-linear models, GA-SVM (R_train^2=0.992 and R_test^2=0.997) showed high predictive accuracy for the inhibitory activity. It was found that the selected descriptors have the major roles in interpretation of biological activities of the compounds.
芳基取代异苯并呋喃-1(3H)- 1抑制剂的QSAR比较研究
通过比较线性和非线性QSAR模型,对模型的预测精度进行了评估,并对一系列芳基取代异苯并呋喃-1(3H)-的抑制活性进行了预测。数据集由34个化合物组成,随机分为训练集和测试集。采用遗传算法(GA)作为特征选择工具来选择分子描述符。建立并比较了基于多元线性回归(MLR)、主成分回归(PCR)和偏最小二乘(PLS)的各种线性模型,以及基于人工神经网络(ANN)、基于自适应网络的模糊推理系统(ANFIS)和支持向量机(SVM)方法的非线性模型。通过留一交叉验证(Q_LOO^2)、y随机化检验和化合物组作为外部测试集,研究了模型的准确性。通过遗传算法选择6个描述符建立预测模型。对于线性模型,GA-PCR法比重置法更准确,统计结果为〖R〗_train^2=0.883, R_test^2=0.897,〖R〗_(adj,train)^2=0.829,〖R〗_(adj,test)^2=0.849,〖F〗_train=24.07, F_test=34.17。在非线性模型下,GA-SVM (R_train^2=0.992和R_test^2=0.997)对抑制活性的预测准确率较高。结果表明,所选择的描述符在解释化合物的生物活性方面具有重要作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Iranian Chemical Communication
Iranian Chemical Communication CHEMISTRY, MULTIDISCIPLINARY-
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信