{"title":"Linear support vector machine to classify the vibrational modes for complex chemical systems","authors":"T. Le, T. Tran, Lam Huynh","doi":"10.1145/3184066.3184087","DOIUrl":null,"url":null,"abstract":"Classification of vibrational modes into hindered internal rotation (HIR) and harmonic oscillation modes is important to obtain correct thermodynamic data for a chemical species for a wide range of temperatures. In this study, we propose a multivariate linear support vector machine (SVM) model to solve this challenging binary classification problem. The results of the proposed model were found to be similar to those of logistic regression and 2-5% better than those of the rule-based method. Moreover, the number of features found by linear SVM was also fewer than that of logistic regression (five versus six), which makes it easier to be interpreted by chemists. The detailed explanation of such differences is also presented. The three models were implemented in the GUI of the Multi-Species Multi-Channel Software Suite (Duong et al., Int. J. Chem. Kinet, 2015, 564) to facilitate the determination of HIR modes as well as the calculation of thermodynamic properties for a chemical species of interest.","PeriodicalId":109559,"journal":{"name":"International Conference on Machine Learning and Soft Computing","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3184066.3184087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Classification of vibrational modes into hindered internal rotation (HIR) and harmonic oscillation modes is important to obtain correct thermodynamic data for a chemical species for a wide range of temperatures. In this study, we propose a multivariate linear support vector machine (SVM) model to solve this challenging binary classification problem. The results of the proposed model were found to be similar to those of logistic regression and 2-5% better than those of the rule-based method. Moreover, the number of features found by linear SVM was also fewer than that of logistic regression (five versus six), which makes it easier to be interpreted by chemists. The detailed explanation of such differences is also presented. The three models were implemented in the GUI of the Multi-Species Multi-Channel Software Suite (Duong et al., Int. J. Chem. Kinet, 2015, 564) to facilitate the determination of HIR modes as well as the calculation of thermodynamic properties for a chemical species of interest.