QRChEM: A Deep Learning Framework for Materials Property Prediction and Design Using QR Codes

IF 4.3 Q2 ENGINEERING, CHEMICAL
Haripriyan Uthayakumar, Rahul Krishna K, Raj Jain, Rajnish Kumar and Tarak K. Patra*, 
{"title":"QRChEM: A Deep Learning Framework for Materials Property Prediction and Design Using QR Codes","authors":"Haripriyan Uthayakumar,&nbsp;Rahul Krishna K,&nbsp;Raj Jain,&nbsp;Rajnish Kumar and Tarak K. Patra*,&nbsp;","doi":"10.1021/acsengineeringau.3c00055","DOIUrl":null,"url":null,"abstract":"<p >Machine learning (ML) surrogate models are used for the rapid prediction of materials properties and are promising tools for accelerating new materials design and development. The performance and accuracy of these surrogate models appear to be intricately connected to the molecular representation that is employed. Developing efficient numerical representations of molecules is vital for the success of surrogate models in predicting materials' properties. Here, we propose a new machine-readable molecular representation, namely a molecular quick response (QR) code, for the deep learning of materials structure–property correlations. We built a convolutional deep neural network (CNN) model based on molecular QR codes, which is abbreviated as QRChEM. QRChEM was trained and validated using ∼21 000 data for four representative properties of small molecules, namely specific heat, enthalpy, zero-point vibrational energy, and HOMO–LUMO band gap. We show that QRChEM outperforms the commonly used Morgan fingerprint-based and one-hot encoding (OHE)-based deep learning frameworks. We further performed UMAP (uniform manifold approximation and projection) on the molecular QR codes to demonstrate the differentiability of the molecular topologies, which is vital for high-fidelity surrogate model development.</p>","PeriodicalId":29804,"journal":{"name":"ACS Engineering Au","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsengineeringau.3c00055","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Engineering Au","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsengineeringau.3c00055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning (ML) surrogate models are used for the rapid prediction of materials properties and are promising tools for accelerating new materials design and development. The performance and accuracy of these surrogate models appear to be intricately connected to the molecular representation that is employed. Developing efficient numerical representations of molecules is vital for the success of surrogate models in predicting materials' properties. Here, we propose a new machine-readable molecular representation, namely a molecular quick response (QR) code, for the deep learning of materials structure–property correlations. We built a convolutional deep neural network (CNN) model based on molecular QR codes, which is abbreviated as QRChEM. QRChEM was trained and validated using ∼21 000 data for four representative properties of small molecules, namely specific heat, enthalpy, zero-point vibrational energy, and HOMO–LUMO band gap. We show that QRChEM outperforms the commonly used Morgan fingerprint-based and one-hot encoding (OHE)-based deep learning frameworks. We further performed UMAP (uniform manifold approximation and projection) on the molecular QR codes to demonstrate the differentiability of the molecular topologies, which is vital for high-fidelity surrogate model development.

Abstract Image

Abstract Image

QRChEM:基于QR码的材料性能预测和设计的深度学习框架
机器学习(ML)代理模型用于快速预测材料性能,是加速新材料设计和开发的有前途的工具。这些替代模型的性能和准确性似乎与所采用的分子表示复杂地联系在一起。开发有效的分子数值表示对于替代模型在预测材料性质方面的成功至关重要。在这里,我们提出了一种新的机器可读的分子表示,即分子快速响应(QR)码,用于材料结构-性质相关性的深度学习。我们基于分子QR码(简称QRChEM)构建了卷积深度神经网络(CNN)模型。QRChEM使用~ 21 000个小分子的四个代表性特性数据进行训练和验证,即比热、焓、零点振动能和HOMO-LUMO带隙。我们表明,QRChEM优于常用的基于Morgan指纹和基于单热编码(one-hot encoding, OHE)的深度学习框架。我们进一步对分子QR码进行了UMAP(均匀流形逼近和投影),以证明分子拓扑的可微分性,这对于高保真代理模型的开发至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Engineering Au
ACS Engineering Au 化学工程技术-
自引率
0.00%
发文量
0
期刊介绍: )ACS Engineering Au is an open access journal that reports significant advances in chemical engineering applied chemistry and energy covering fundamentals processes and products. The journal's broad scope includes experimental theoretical mathematical computational chemical and physical research from academic and industrial settings. Short letters comprehensive articles reviews and perspectives are welcome on topics that include:Fundamental research in such areas as thermodynamics transport phenomena (flow mixing mass & heat transfer) chemical reaction kinetics and engineering catalysis separations interfacial phenomena and materialsProcess design development and intensification (e.g. process technologies for chemicals and materials synthesis and design methods process intensification multiphase reactors scale-up systems analysis process control data correlation schemes modeling machine learning Artificial Intelligence)Product research and development involving chemical and engineering aspects (e.g. catalysts plastics elastomers fibers adhesives coatings paper membranes lubricants ceramics aerosols fluidic devices intensified process equipment)Energy and fuels (e.g. pre-treatment processing and utilization of renewable energy resources; processing and utilization of fuels; properties and structure or molecular composition of both raw fuels and refined products; fuel cells hydrogen batteries; photochemical fuel and energy production; decarbonization; electrification; microwave; cavitation)Measurement techniques computational models and data on thermo-physical thermodynamic and transport properties of materials and phase equilibrium behaviorNew methods models and tools (e.g. real-time data analytics multi-scale models physics informed machine learning models machine learning enhanced physics-based models soft sensors high-performance computing)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信