Xianquan Wang , Fengjing Nie , Zhan Gao , Guoliang Li , Dengshuai Zhang , Jinfeng Zhang , Peijian Zhang
{"title":"Studies on QSAR models for the anti-virus effect of oseltamivir derivatives targeting H5N1 based on Mix-Kernel support vector machine","authors":"Xianquan Wang , Fengjing Nie , Zhan Gao , Guoliang Li , Dengshuai Zhang , Jinfeng Zhang , Peijian Zhang","doi":"10.1016/j.chemolab.2024.105273","DOIUrl":null,"url":null,"abstract":"<div><div>To assist in design and synthesis of novel oseltamivir derivatives as to help yield more inhibitors with lower time and cost, four quantitative structure-activity relationship (QSAR) models on 83 ODs with methods of random forest (RF), extreme gradient boosting (XGBoost), radial basis kernel function support vector machine (RBF-SVM) and mix kernel function support vector machine (MIX-SVM) were established. In the study, five descriptors of the highest importance were identified by RF. To further examine the robustness and stability of the four models, leave-one-out cross validation was adapted to the models and R, Rwere used to measure the performance. The model built by MIX-SVM performed best on the dataset: R<sup>2</sup> on training set and test set were 0.963 and 0.961, mean squared error (MSE) on training set and test set were 0.055 and 0.054, and R, R were 0.918 and 0.889, respectively. Furthermore, five important descriptors were selected for analysis, and the modelling method with the application of MIX-SVM achieved great prediction performance, which could facilitate the design and synthesis of oseltamivir derivatives.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"261 ","pages":"Article 105273"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743924002132","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Studies on QSAR models for the anti-virus effect of oseltamivir derivatives targeting H5N1 based on Mix-Kernel support vector machine
To assist in design and synthesis of novel oseltamivir derivatives as to help yield more inhibitors with lower time and cost, four quantitative structure-activity relationship (QSAR) models on 83 ODs with methods of random forest (RF), extreme gradient boosting (XGBoost), radial basis kernel function support vector machine (RBF-SVM) and mix kernel function support vector machine (MIX-SVM) were established. In the study, five descriptors of the highest importance were identified by RF. To further examine the robustness and stability of the four models, leave-one-out cross validation was adapted to the models and R, Rwere used to measure the performance. The model built by MIX-SVM performed best on the dataset: R2 on training set and test set were 0.963 and 0.961, mean squared error (MSE) on training set and test set were 0.055 and 0.054, and R, R were 0.918 and 0.889, respectively. Furthermore, five important descriptors were selected for analysis, and the modelling method with the application of MIX-SVM achieved great prediction performance, which could facilitate the design and synthesis of oseltamivir derivatives.
期刊介绍:
Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines.
Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data.
The journal deals with the following topics:
1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.)
2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered.
3) Development of new software that provides novel tools or truly advances the use of chemometrical methods.
4) Well characterized data sets to test performance for the new methods and software.
The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.