{"title":"利用支持向量回归算法构建配体与γ-氨基丁酸A型受体之间的QSAR模型","authors":"Shu Cheng, Yanrui Ding","doi":"10.1109/DCABES50732.2020.00060","DOIUrl":null,"url":null,"abstract":"Quantitative structure-activity relationship (QSAR) plays an important role in the prediction of biological activity based on machine learning. According to the characteristics of the binding interface between ligands and the γ-Aminobutyric acid type A (GABAA) receptor, we used random forest feature selection and support vector regression (SVR) to establish three QSAR models. The best QSAR model features include docking ligand molecular descriptors and ligand-receptor interactions. We also used Leave-One-Out-Cross-Validation (LOOCV) to select the appropriate value C = 2, g = 0.0221. The result of cross validation (QLOO2) is 0.8225, R2 of test set is 0.8326, and MSE is 0.0910. In addition, we found that BELm2, BELe2, Mor08v, Mor29m, refRMS and intermol _ energy are key features, which helps to build QSAR model more accurately.","PeriodicalId":351404,"journal":{"name":"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Construction of QSAR model between the ligand and γ-Aminobutyric acid type A receptor using support vector regression algorithm\",\"authors\":\"Shu Cheng, Yanrui Ding\",\"doi\":\"10.1109/DCABES50732.2020.00060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Quantitative structure-activity relationship (QSAR) plays an important role in the prediction of biological activity based on machine learning. According to the characteristics of the binding interface between ligands and the γ-Aminobutyric acid type A (GABAA) receptor, we used random forest feature selection and support vector regression (SVR) to establish three QSAR models. The best QSAR model features include docking ligand molecular descriptors and ligand-receptor interactions. We also used Leave-One-Out-Cross-Validation (LOOCV) to select the appropriate value C = 2, g = 0.0221. The result of cross validation (QLOO2) is 0.8225, R2 of test set is 0.8326, and MSE is 0.0910. In addition, we found that BELm2, BELe2, Mor08v, Mor29m, refRMS and intermol _ energy are key features, which helps to build QSAR model more accurately.\",\"PeriodicalId\":351404,\"journal\":{\"name\":\"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DCABES50732.2020.00060\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCABES50732.2020.00060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
定量构效关系(Quantitative structure-activity relationship, QSAR)在基于机器学习的生物活性预测中发挥着重要作用。根据配体与γ-氨基丁酸A型(GABAA)受体结合界面的特点,采用随机森林特征选择和支持向量回归(SVR)方法建立了三种QSAR模型。最佳的QSAR模型特征包括对接配体分子描述符和配体-受体相互作用。我们还使用leave - one - out交叉验证(LOOCV)来选择合适的值C = 2, g = 0.0221。交叉验证结果(QLOO2)为0.8225,检验集R2为0.8326,MSE为0.0910。此外,我们发现BELm2、BELe2、Mor08v、Mor29m、refRMS和intermol能量是QSAR模型的关键特征,这有助于更准确地建立QSAR模型。
Construction of QSAR model between the ligand and γ-Aminobutyric acid type A receptor using support vector regression algorithm
Quantitative structure-activity relationship (QSAR) plays an important role in the prediction of biological activity based on machine learning. According to the characteristics of the binding interface between ligands and the γ-Aminobutyric acid type A (GABAA) receptor, we used random forest feature selection and support vector regression (SVR) to establish three QSAR models. The best QSAR model features include docking ligand molecular descriptors and ligand-receptor interactions. We also used Leave-One-Out-Cross-Validation (LOOCV) to select the appropriate value C = 2, g = 0.0221. The result of cross validation (QLOO2) is 0.8225, R2 of test set is 0.8326, and MSE is 0.0910. In addition, we found that BELm2, BELe2, Mor08v, Mor29m, refRMS and intermol _ energy are key features, which helps to build QSAR model more accurately.