Finding simplicity: unsupervised discovery of features, patterns, and order parameters via shift-invariant variational autoencoders

IF 6.3 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology Pub Date : 2023-10-26 DOI:10.1088/2632-2153/ad073b

Maxim A. Ziatdinov, Chun Yin Wong, Sergei V. Kalinin

{"title":"Finding simplicity: unsupervised discovery of features, patterns, and order parameters via shift-invariant variational autoencoders","authors":"Maxim A. Ziatdinov, Chun Yin Wong, Sergei V. Kalinin","doi":"10.1088/2632-2153/ad073b","DOIUrl":null,"url":null,"abstract":"Abstract Recent advances in scanning tunneling and transmission electron microscopies (STM and STEM) have allowed routine generation of large volumes of imaging data containing information on the structure and functionality of materials. The experimental data sets contain signatures of long-range phenomena such as physical order parameter fields, polarization, and strain gradients in STEM, or standing electronic waves and carrier-mediated exchange interactions in STM, all superimposed onto scanning system distortions and gradual changes of contrast due to drift and/or mis-tilt effects. Correspondingly, while the human eye can readily identify certain patterns in the images such as lattice periodicities, repeating structural elements, or microstructures, their automatic extraction and classification are highly non-trivial and universal pathways to accomplish such analyses are absent. We pose that the most distinctive elements of the patterns observed in STM and (S)TEM images are similarity and (almost-) periodicity, behaviors stemming directly from the parsimony of elementary atomic structures, superimposed on the gradual changes reflective of order parameter distributions. However, the discovery of these elements via global Fourier methods is non-trivial due to variability and lack of ideal discrete translation symmetry. To address this problem, we explore the shift-invariant variational autoencoders (shift-VAE) that allow disentangling characteristic repeating features in the images, their variations, and shifts that inevitably occur when randomly sampling the image space. Shift-VAEs balance the uncertainty in the position of the object of interest with the uncertainty in shape reconstruction. This approach is illustrated for model 1D data, and further extended to synthetic and experimental STM and STEM 2D data. We further introduce an approach for training shift-VAEs that allows finding the latent variables that comport to known physical behavior. In this specific case, the condition is that the latent variable maps should be smooth on the length scale of the atomic lattice (as expected for physical order parameters), but other conditions can be imposed. The opportunities and limitations of the shift VAE analysis for pattern discovery are elucidated.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"59 4","pages":"0"},"PeriodicalIF":6.3000,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ad073b","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Recent advances in scanning tunneling and transmission electron microscopies (STM and STEM) have allowed routine generation of large volumes of imaging data containing information on the structure and functionality of materials. The experimental data sets contain signatures of long-range phenomena such as physical order parameter fields, polarization, and strain gradients in STEM, or standing electronic waves and carrier-mediated exchange interactions in STM, all superimposed onto scanning system distortions and gradual changes of contrast due to drift and/or mis-tilt effects. Correspondingly, while the human eye can readily identify certain patterns in the images such as lattice periodicities, repeating structural elements, or microstructures, their automatic extraction and classification are highly non-trivial and universal pathways to accomplish such analyses are absent. We pose that the most distinctive elements of the patterns observed in STM and (S)TEM images are similarity and (almost-) periodicity, behaviors stemming directly from the parsimony of elementary atomic structures, superimposed on the gradual changes reflective of order parameter distributions. However, the discovery of these elements via global Fourier methods is non-trivial due to variability and lack of ideal discrete translation symmetry. To address this problem, we explore the shift-invariant variational autoencoders (shift-VAE) that allow disentangling characteristic repeating features in the images, their variations, and shifts that inevitably occur when randomly sampling the image space. Shift-VAEs balance the uncertainty in the position of the object of interest with the uncertainty in shape reconstruction. This approach is illustrated for model 1D data, and further extended to synthetic and experimental STM and STEM 2D data. We further introduce an approach for training shift-VAEs that allows finding the latent variables that comport to known physical behavior. In this specific case, the condition is that the latent variable maps should be smooth on the length scale of the atomic lattice (as expected for physical order parameters), but other conditions can be imposed. The opportunities and limitations of the shift VAE analysis for pattern discovery are elucidated.

查看原文本刊更多论文

查找简单性:通过移位不变变分自编码器无监督地发现特征、模式和顺序参数

扫描隧道和透射电子显微镜(STM和STEM)的最新进展使得常规生成大量包含材料结构和功能信息的成像数据成为可能。实验数据集包含远程现象的特征，如STEM中的物理有序参数场、极化和应变梯度，或STM中的驻波和载流子中介的交换相互作用，所有这些都叠加到扫描系统畸变和由于漂移和/或误倾斜效应导致的对比度逐渐变化上。相应地，虽然人眼可以很容易地识别图像中的某些模式，如晶格周期性、重复结构元素或微观结构，但它们的自动提取和分类是非常重要的，并且缺乏完成此类分析的通用途径。我们提出，在STM和(S)TEM图像中观察到的模式中最独特的元素是相似性和(几乎)周期性，这些行为直接源于基本原子结构的简约性，叠加在反映有序参数分布的逐渐变化上。然而，由于可变性和缺乏理想的离散平移对称性，通过全局傅里叶方法发现这些元素是不平凡的。为了解决这个问题，我们探索了移位不变变分自编码器(shift-VAE)，它允许分离图像中的特征重复特征，它们的变化，以及在随机采样图像空间时不可避免地发生的移位。shift - vae平衡了目标位置的不确定性和形状重建的不确定性。该方法适用于模型1D数据，并进一步扩展到合成和实验STM和STEM 2D数据。我们进一步介绍了一种训练移位值的方法，该方法允许找到符合已知物理行为的潜在变量。在这个特定的情况下，条件是潜在变量映射在原子晶格的长度尺度上应该是平滑的(正如物理顺序参数所期望的那样)，但是可以施加其他条件。阐述了移动VAE分析在模式发现中的机会和局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine Learning Science and Technology Computer Science-Artificial Intelligence

CiteScore

9.10

自引率

4.40%

发文量

审稿时长

5 weeks

期刊介绍： Machine Learning Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights. Specifically, articles must fall into one of the following categories: advance the state of machine learning-driven applications in the sciences or make conceptual, methodological or theoretical advances in machine learning with applications to, inspiration from, or motivated by scientific problems.