Tzu-Hsuan Chao, Alexander Foncerrada, Patrick J. Shamberger and Daniel P. Tabor
{"title":"Low-data machine learning models for predicting thermodynamic properties of solid–solid phase transformations in plastic crystals†","authors":"Tzu-Hsuan Chao, Alexander Foncerrada, Patrick J. Shamberger and Daniel P. Tabor","doi":"10.1039/D5SM00353A","DOIUrl":null,"url":null,"abstract":"<p >Plastic crystals, many of which are globular small molecules that exhibit transitions between rotationally ordered and rotationally disordered states, represent an important subclass of colossal barocaloric effect materials. The known set of plastic crystals is notably sparse, which presents a challenge to developing predictive thermodynamic models to describe new molecular structures. To predict the transformation entropy of plastic crystals, we developed a comprehensive database of tetrahedral plastic crystal molecules (neopentane analogs) and used several types of features, including chemical functional groups, molecular symmetry, DFT-calculated vibrational entropy, and energy decomposition analysis to train a machine learning model. To select the most relevant features, we used a correlation matrix to screen out highly correlated features and ran sure independence screening and sparsifying operator (SISSO) regression on the remaining features. The SISSO regression samples over combinatorial spaces, including operations and features, to find the relationship between material properties. Using a dataset of 49 plastic crystals and 37 non-plastic crystals based on a common tetrahedral geometry, we have demonstrated the effectiveness of this strategy. Furthermore, we applied this strategy to develop a regression model to predict transition entropy and enthalpy. The top 100 models from the operation space showed that the overall distribution of performance became narrower, sacrificing the top-performing model but avoiding the worst models. Using this approach, we identified the top-performing descriptors to further clarify the underlying mechanisms of the plastic crystal transformation.</p>","PeriodicalId":103,"journal":{"name":"Soft Matter","volume":" 29","pages":" 5957-5968"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/sm/d5sm00353a?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Matter","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/sm/d5sm00353a","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Plastic crystals, many of which are globular small molecules that exhibit transitions between rotationally ordered and rotationally disordered states, represent an important subclass of colossal barocaloric effect materials. The known set of plastic crystals is notably sparse, which presents a challenge to developing predictive thermodynamic models to describe new molecular structures. To predict the transformation entropy of plastic crystals, we developed a comprehensive database of tetrahedral plastic crystal molecules (neopentane analogs) and used several types of features, including chemical functional groups, molecular symmetry, DFT-calculated vibrational entropy, and energy decomposition analysis to train a machine learning model. To select the most relevant features, we used a correlation matrix to screen out highly correlated features and ran sure independence screening and sparsifying operator (SISSO) regression on the remaining features. The SISSO regression samples over combinatorial spaces, including operations and features, to find the relationship between material properties. Using a dataset of 49 plastic crystals and 37 non-plastic crystals based on a common tetrahedral geometry, we have demonstrated the effectiveness of this strategy. Furthermore, we applied this strategy to develop a regression model to predict transition entropy and enthalpy. The top 100 models from the operation space showed that the overall distribution of performance became narrower, sacrificing the top-performing model but avoiding the worst models. Using this approach, we identified the top-performing descriptors to further clarify the underlying mechanisms of the plastic crystal transformation.
期刊介绍:
Soft Matter is an international journal published by the Royal Society of Chemistry using Engineering-Materials Science: A Synthesis as its research focus. It publishes original research articles, review articles, and synthesis articles related to this field, reporting the latest discoveries in the relevant theoretical, practical, and applied disciplines in a timely manner, and aims to promote the rapid exchange of scientific information in this subject area. The journal is an open access journal. The journal is an open access journal and has not been placed on the alert list in the last three years.