Low-data machine learning models for predicting thermodynamic properties of solid–solid phase transformations in plastic crystals†

IF 2.9 3区 化学 Q3 CHEMISTRY, PHYSICAL
Soft Matter Pub Date : 2025-06-25 DOI:10.1039/D5SM00353A
Tzu-Hsuan Chao, Alexander Foncerrada, Patrick J. Shamberger and Daniel P. Tabor
{"title":"Low-data machine learning models for predicting thermodynamic properties of solid–solid phase transformations in plastic crystals†","authors":"Tzu-Hsuan Chao, Alexander Foncerrada, Patrick J. Shamberger and Daniel P. Tabor","doi":"10.1039/D5SM00353A","DOIUrl":null,"url":null,"abstract":"<p >Plastic crystals, many of which are globular small molecules that exhibit transitions between rotationally ordered and rotationally disordered states, represent an important subclass of colossal barocaloric effect materials. The known set of plastic crystals is notably sparse, which presents a challenge to developing predictive thermodynamic models to describe new molecular structures. To predict the transformation entropy of plastic crystals, we developed a comprehensive database of tetrahedral plastic crystal molecules (neopentane analogs) and used several types of features, including chemical functional groups, molecular symmetry, DFT-calculated vibrational entropy, and energy decomposition analysis to train a machine learning model. To select the most relevant features, we used a correlation matrix to screen out highly correlated features and ran sure independence screening and sparsifying operator (SISSO) regression on the remaining features. The SISSO regression samples over combinatorial spaces, including operations and features, to find the relationship between material properties. Using a dataset of 49 plastic crystals and 37 non-plastic crystals based on a common tetrahedral geometry, we have demonstrated the effectiveness of this strategy. Furthermore, we applied this strategy to develop a regression model to predict transition entropy and enthalpy. The top 100 models from the operation space showed that the overall distribution of performance became narrower, sacrificing the top-performing model but avoiding the worst models. Using this approach, we identified the top-performing descriptors to further clarify the underlying mechanisms of the plastic crystal transformation.</p>","PeriodicalId":103,"journal":{"name":"Soft Matter","volume":" 29","pages":" 5957-5968"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/sm/d5sm00353a?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Matter","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/sm/d5sm00353a","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Plastic crystals, many of which are globular small molecules that exhibit transitions between rotationally ordered and rotationally disordered states, represent an important subclass of colossal barocaloric effect materials. The known set of plastic crystals is notably sparse, which presents a challenge to developing predictive thermodynamic models to describe new molecular structures. To predict the transformation entropy of plastic crystals, we developed a comprehensive database of tetrahedral plastic crystal molecules (neopentane analogs) and used several types of features, including chemical functional groups, molecular symmetry, DFT-calculated vibrational entropy, and energy decomposition analysis to train a machine learning model. To select the most relevant features, we used a correlation matrix to screen out highly correlated features and ran sure independence screening and sparsifying operator (SISSO) regression on the remaining features. The SISSO regression samples over combinatorial spaces, including operations and features, to find the relationship between material properties. Using a dataset of 49 plastic crystals and 37 non-plastic crystals based on a common tetrahedral geometry, we have demonstrated the effectiveness of this strategy. Furthermore, we applied this strategy to develop a regression model to predict transition entropy and enthalpy. The top 100 models from the operation space showed that the overall distribution of performance became narrower, sacrificing the top-performing model but avoiding the worst models. Using this approach, we identified the top-performing descriptors to further clarify the underlying mechanisms of the plastic crystal transformation.

Abstract Image

预测塑料晶体中固-固相变热力学性质的低数据机器学习模型。
塑料晶体,其中许多是球形小分子,表现出在旋转有序和旋转无序状态之间的转变,代表了巨大的压热效应材料的一个重要子类。已知的塑料晶体非常稀疏,这对开发预测热力学模型来描述新的分子结构提出了挑战。为了预测塑料晶体的转化熵,我们开发了一个四面体塑料晶体分子(新戊烷类似物)的综合数据库,并使用几种类型的特征,包括化学官能团、分子对称性、dft计算的振动熵和能量分解分析来训练机器学习模型。为了选择最相关的特征,我们使用相关矩阵筛选出高度相关的特征,并对剩余的特征进行独立筛选和稀疏化算子(SISSO)回归。SISSO在组合空间上进行回归样本,包括操作和特征,以找到材料属性之间的关系。使用基于普通四面体几何的49个塑料晶体和37个非塑料晶体的数据集,我们已经证明了该策略的有效性。此外,我们应用该策略开发了一个回归模型来预测过渡熵和焓。来自操作空间的前100个模型表明,性能的总体分布变得更窄,牺牲了表现最好的模型,但避免了表现最差的模型。使用这种方法,我们确定了表现最好的描述符,以进一步阐明塑性晶体转变的潜在机制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soft Matter
Soft Matter 工程技术-材料科学:综合
CiteScore
6.00
自引率
5.90%
发文量
891
审稿时长
1.9 months
期刊介绍: Soft Matter is an international journal published by the Royal Society of Chemistry using Engineering-Materials Science: A Synthesis as its research focus. It publishes original research articles, review articles, and synthesis articles related to this field, reporting the latest discoveries in the relevant theoretical, practical, and applied disciplines in a timely manner, and aims to promote the rapid exchange of scientific information in this subject area. The journal is an open access journal. The journal is an open access journal and has not been placed on the alert list in the last three years.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信