{"title":"利用机器学习方法对血栓素 A2 合成酶抑制剂进行分类模型和 SAR 分析。","authors":"Y Ji, R Li, Y Tian, G Chen, A Yan","doi":"10.1080/1062936X.2022.2078880","DOIUrl":null,"url":null,"abstract":"<p><p>Thromboxane A<sub>2</sub> synthase (TXS) is a promising drug target for cardiovascular diseases and cancer. In this work, we conducted a structure-activity relationship (SAR) study on 526 TXS inhibitors for bioactivity prediction. Three types of descriptors (MACCS fingerprints, ECFP4 fingerprints, and MOE descriptors) were utilized to characterize inhibitors, 24 classification models were developed by support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and deep neural networks (DNN). Then we reduced the number of fingerprints according to the contribution of descriptors to the models, and constructed 16 extra models on simplified fingerprints. In general, Model_4D built by DNN algorithm and 67 bits MACCS fingerprints performs best. The prediction accuracy of the model on the test set is 0.969, and Matthews correlation coefficient (MCC) is 0.936. The distance between compound and model (d<sub>STD-PRO</sub>) was used to characterize the application domain of the model. In the test set of Model_4D, d<sub>STD-PRO</sub> of 91.5% compounds is lower than the corresponding training set threshold (threshold<sub>0.90</sub> = 0.1055), and the accuracy of these compounds is 0.983. In addition, the important descriptors were summarized and further analyzed. It showed that aromatic nitrogenous heterocyclic groups were beneficial to improve the bioactivity of TXS inhibitors.</p>","PeriodicalId":48388,"journal":{"name":"International Organization","volume":"28 1","pages":"429-462"},"PeriodicalIF":8.2000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification models and SAR analysis on thromboxane A<sub>2</sub> synthase inhibitors by machine learning methods.\",\"authors\":\"Y Ji, R Li, Y Tian, G Chen, A Yan\",\"doi\":\"10.1080/1062936X.2022.2078880\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Thromboxane A<sub>2</sub> synthase (TXS) is a promising drug target for cardiovascular diseases and cancer. In this work, we conducted a structure-activity relationship (SAR) study on 526 TXS inhibitors for bioactivity prediction. Three types of descriptors (MACCS fingerprints, ECFP4 fingerprints, and MOE descriptors) were utilized to characterize inhibitors, 24 classification models were developed by support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and deep neural networks (DNN). Then we reduced the number of fingerprints according to the contribution of descriptors to the models, and constructed 16 extra models on simplified fingerprints. In general, Model_4D built by DNN algorithm and 67 bits MACCS fingerprints performs best. The prediction accuracy of the model on the test set is 0.969, and Matthews correlation coefficient (MCC) is 0.936. The distance between compound and model (d<sub>STD-PRO</sub>) was used to characterize the application domain of the model. In the test set of Model_4D, d<sub>STD-PRO</sub> of 91.5% compounds is lower than the corresponding training set threshold (threshold<sub>0.90</sub> = 0.1055), and the accuracy of these compounds is 0.983. In addition, the important descriptors were summarized and further analyzed. It showed that aromatic nitrogenous heterocyclic groups were beneficial to improve the bioactivity of TXS inhibitors.</p>\",\"PeriodicalId\":48388,\"journal\":{\"name\":\"International Organization\",\"volume\":\"28 1\",\"pages\":\"429-462\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Organization\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1080/1062936X.2022.2078880\",\"RegionNum\":1,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/6/9 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"INTERNATIONAL RELATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Organization","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/1062936X.2022.2078880","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/6/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"INTERNATIONAL RELATIONS","Score":null,"Total":0}
Classification models and SAR analysis on thromboxane A2 synthase inhibitors by machine learning methods.
Thromboxane A2 synthase (TXS) is a promising drug target for cardiovascular diseases and cancer. In this work, we conducted a structure-activity relationship (SAR) study on 526 TXS inhibitors for bioactivity prediction. Three types of descriptors (MACCS fingerprints, ECFP4 fingerprints, and MOE descriptors) were utilized to characterize inhibitors, 24 classification models were developed by support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and deep neural networks (DNN). Then we reduced the number of fingerprints according to the contribution of descriptors to the models, and constructed 16 extra models on simplified fingerprints. In general, Model_4D built by DNN algorithm and 67 bits MACCS fingerprints performs best. The prediction accuracy of the model on the test set is 0.969, and Matthews correlation coefficient (MCC) is 0.936. The distance between compound and model (dSTD-PRO) was used to characterize the application domain of the model. In the test set of Model_4D, dSTD-PRO of 91.5% compounds is lower than the corresponding training set threshold (threshold0.90 = 0.1055), and the accuracy of these compounds is 0.983. In addition, the important descriptors were summarized and further analyzed. It showed that aromatic nitrogenous heterocyclic groups were beneficial to improve the bioactivity of TXS inhibitors.
期刊介绍:
International Organization (IO) is a prominent peer-reviewed journal that comprehensively covers the field of international affairs. Its subject areas encompass foreign policies, international relations, political economy, security policies, environmental disputes, regional integration, alliance patterns, conflict resolution, economic development, and international capital movements. Continuously ranked among the top journals in the field, IO does not publish book reviews but instead features high-quality review essays that survey new developments, synthesize important ideas, and address key issues for future scholarship.