Hitoshi Yamano, Hiroaki Shimizu, S. Kanaya, Tomoyuki Miyao, Aki Hirai, N. Ono
{"title":"Predicting and considering properties of general polymers using incomplete dataset","authors":"Hitoshi Yamano, Hiroaki Shimizu, S. Kanaya, Tomoyuki Miyao, Aki Hirai, N. Ono","doi":"10.1109/ISSM51728.2020.9377497","DOIUrl":null,"url":null,"abstract":"Polymer properties are usually more difficult to predict than those of small molecules due to them forming superstructures. In this work, we aimed at finding a versatile approach to predict multiple polymer properties using imperfect data with missing values. The dataset was hierarchically clustered on the basis of two independent factors: polymer properties and polymer structures. In polymer property-based clustering, visualizing relations of polymers was found to be an effective way of estimating the difficulty of polymer property prediction. In polymer structure-based clustering, each cluster could be formed based on the structural features. Thus, the clustering contributed to understanding structural characteristics of monomer unit structures. In addition to analyzing the data set in an unsupervised manner, we constructed polymer properties prediction models based solely on the information of monomer unit structures. Partial least squared (PLS) regression models could predict density, glass transition temperature and dissolution parameter with high accuracy. We also propose approach to evaluate obtained model using data already prepared.","PeriodicalId":270309,"journal":{"name":"2020 International Symposium on Semiconductor Manufacturing (ISSM)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium on Semiconductor Manufacturing (ISSM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSM51728.2020.9377497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Polymer properties are usually more difficult to predict than those of small molecules due to them forming superstructures. In this work, we aimed at finding a versatile approach to predict multiple polymer properties using imperfect data with missing values. The dataset was hierarchically clustered on the basis of two independent factors: polymer properties and polymer structures. In polymer property-based clustering, visualizing relations of polymers was found to be an effective way of estimating the difficulty of polymer property prediction. In polymer structure-based clustering, each cluster could be formed based on the structural features. Thus, the clustering contributed to understanding structural characteristics of monomer unit structures. In addition to analyzing the data set in an unsupervised manner, we constructed polymer properties prediction models based solely on the information of monomer unit structures. Partial least squared (PLS) regression models could predict density, glass transition temperature and dissolution parameter with high accuracy. We also propose approach to evaluate obtained model using data already prepared.