Sherry Yang, Simon Batzner, Ruiqi Gao, Muratahan Aykol, Alexander L. Gaunt, Brendan McMorrow, Danilo J. Rezende, Dale Schuurmans, Igor Mordatch, Ekin D. Cubuk
{"title":"Generative Hierarchical Materials Search","authors":"Sherry Yang, Simon Batzner, Ruiqi Gao, Muratahan Aykol, Alexander L. Gaunt, Brendan McMorrow, Danilo J. Rezende, Dale Schuurmans, Igor Mordatch, Ekin D. Cubuk","doi":"arxiv-2409.06762","DOIUrl":null,"url":null,"abstract":"Generative models trained at scale can now produce text, video, and more\nrecently, scientific data such as crystal structures. In applications of\ngenerative approaches to materials science, and in particular to crystal\nstructures, the guidance from the domain expert in the form of high-level\ninstructions can be essential for an automated system to output candidate\ncrystals that are viable for downstream research. In this work, we formulate\nend-to-end language-to-structure generation as a multi-objective optimization\nproblem, and propose Generative Hierarchical Materials Search (GenMS) for\ncontrollable generation of crystal structures. GenMS consists of (1) a language\nmodel that takes high-level natural language as input and generates\nintermediate textual information about a crystal (e.g., chemical formulae), and\n(2) a diffusion model that takes intermediate information as input and\ngenerates low-level continuous value crystal structures. GenMS additionally\nuses a graph neural network to predict properties (e.g., formation energy) from\nthe generated crystal structures. During inference, GenMS leverages all three\ncomponents to conduct a forward tree search over the space of possible\nstructures. Experiments show that GenMS outperforms other alternatives of\ndirectly using language models to generate structures both in satisfying user\nrequest and in generating low-energy structures. We confirm that GenMS is able\nto generate common crystal structures such as double perovskites, or spinels,\nsolely from natural language input, and hence can form the foundation for more\ncomplex structure generation in near future.","PeriodicalId":501234,"journal":{"name":"arXiv - PHYS - Materials Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Materials Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Generative models trained at scale can now produce text, video, and more
recently, scientific data such as crystal structures. In applications of
generative approaches to materials science, and in particular to crystal
structures, the guidance from the domain expert in the form of high-level
instructions can be essential for an automated system to output candidate
crystals that are viable for downstream research. In this work, we formulate
end-to-end language-to-structure generation as a multi-objective optimization
problem, and propose Generative Hierarchical Materials Search (GenMS) for
controllable generation of crystal structures. GenMS consists of (1) a language
model that takes high-level natural language as input and generates
intermediate textual information about a crystal (e.g., chemical formulae), and
(2) a diffusion model that takes intermediate information as input and
generates low-level continuous value crystal structures. GenMS additionally
uses a graph neural network to predict properties (e.g., formation energy) from
the generated crystal structures. During inference, GenMS leverages all three
components to conduct a forward tree search over the space of possible
structures. Experiments show that GenMS outperforms other alternatives of
directly using language models to generate structures both in satisfying user
request and in generating low-energy structures. We confirm that GenMS is able
to generate common crystal structures such as double perovskites, or spinels,
solely from natural language input, and hence can form the foundation for more
complex structure generation in near future.