STEMDiff: A Wavelet-Enhanced Diffusion Model for Physics-Informed STEM Image Generation.

IF 14.1 1区材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY

Advanced Science Pub Date : 2025-09-27 DOI:10.1002/advs.202508266

Yihui Bao, Xinyi Lu, Yanyan Xia, Zhencheng Ye, Houyang Chen

{"title":"STEMDiff: A Wavelet-Enhanced Diffusion Model for Physics-Informed STEM Image Generation.","authors":"Yihui Bao, Xinyi Lu, Yanyan Xia, Zhencheng Ye, Houyang Chen","doi":"10.1002/advs.202508266","DOIUrl":null,"url":null,"abstract":"Machine learning has emerged as a powerful tool for analyzing scanning transmission electron microscopy (STEM) images, yet its widespread application remains constrained by the scarcity of annotated training data. While deep generative models offer a promising solution, they typically struggle to reproduce the complex high-frequency components that define experimental STEM images. Here, STEMDiff, a conditional diffusion model that transforms simple binary labels derived from crystal structures into realistic STEM images through a physical information embedding strategy, is proposed. By developing a novel Discrete Wavelet Transform-based skip-connection architecture, the high-frequency bias inherent in diffusion models are addressed, enabling the preservation of experimental noise characteristics. This approach generates images that are quantitatively nearly indistinguishable from experimental data (17 fold improvement over previous methods) while retaining ground truth structural information. Fully convolutional networks trained exclusively on these synthetic images achieve high-precision atomic column detection in experimental STEM images of WSe2 and graphene, despite the presence of substantial background noise and contamination. This approach effectively eliminates the need for laborious manual annotation, providing a scalable solution to the data bottleneck in STEM image analysis. The principles underlying STEMDiff can extend to other scientific imaging modalities, accelerating advancements in materials design for water treatment.","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e08266"},"PeriodicalIF":14.1000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202508266","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning has emerged as a powerful tool for analyzing scanning transmission electron microscopy (STEM) images, yet its widespread application remains constrained by the scarcity of annotated training data. While deep generative models offer a promising solution, they typically struggle to reproduce the complex high-frequency components that define experimental STEM images. Here, STEMDiff, a conditional diffusion model that transforms simple binary labels derived from crystal structures into realistic STEM images through a physical information embedding strategy, is proposed. By developing a novel Discrete Wavelet Transform-based skip-connection architecture, the high-frequency bias inherent in diffusion models are addressed, enabling the preservation of experimental noise characteristics. This approach generates images that are quantitatively nearly indistinguishable from experimental data (17 fold improvement over previous methods) while retaining ground truth structural information. Fully convolutional networks trained exclusively on these synthetic images achieve high-precision atomic column detection in experimental STEM images of WSe₂ and graphene, despite the presence of substantial background noise and contamination. This approach effectively eliminates the need for laborious manual annotation, providing a scalable solution to the data bottleneck in STEM image analysis. The principles underlying STEMDiff can extend to other scientific imaging modalities, accelerating advancements in materials design for water treatment.

查看原文本刊更多论文

STEMDiff：用于物理信息STEM图像生成的小波增强扩散模型。

机器学习已经成为分析扫描透射电子显微镜（STEM）图像的强大工具，但其广泛应用仍然受到缺乏注释训练数据的限制。虽然深度生成模型提供了一个有前途的解决方案，但它们通常难以重现定义实验STEM图像的复杂高频组件。本文提出了一种条件扩散模型STEMDiff，该模型通过物理信息嵌入策略将来自晶体结构的简单二值标签转换为真实的STEM图像。通过开发一种新的基于离散小波变换的跳过连接架构，解决了扩散模型中固有的高频偏置，从而保持了实验噪声特性。该方法生成的图像在数量上几乎与实验数据无法区分（比以前的方法提高了17倍），同时保留了地面真值结构信息。在这些合成图像上训练的全卷积网络在WSe2和石墨烯的实验STEM图像中实现了高精度的原子柱检测，尽管存在大量的背景噪声和污染。这种方法有效地消除了费力的人工注释的需要，为STEM图像分析中的数据瓶颈提供了可扩展的解决方案。STEMDiff的基本原理可以扩展到其他科学成像模式，加速水处理材料设计的进步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advanced Science CHEMISTRY, MULTIDISCIPLINARYNANOSCIENCE &-NANOSCIENCE & NANOTECHNOLOGY

CiteScore

18.90

自引率

2.60%

发文量

1602

审稿时长

1.9 months

期刊介绍： Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.