STEMDiff: A Wavelet-Enhanced Diffusion Model for Physics-Informed STEM Image Generation.

IF 14.1 1区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Yihui Bao, Xinyi Lu, Yanyan Xia, Zhencheng Ye, Houyang Chen
{"title":"STEMDiff: A Wavelet-Enhanced Diffusion Model for Physics-Informed STEM Image Generation.","authors":"Yihui Bao, Xinyi Lu, Yanyan Xia, Zhencheng Ye, Houyang Chen","doi":"10.1002/advs.202508266","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning has emerged as a powerful tool for analyzing scanning transmission electron microscopy (STEM) images, yet its widespread application remains constrained by the scarcity of annotated training data. While deep generative models offer a promising solution, they typically struggle to reproduce the complex high-frequency components that define experimental STEM images. Here, STEMDiff, a conditional diffusion model that transforms simple binary labels derived from crystal structures into realistic STEM images through a physical information embedding strategy, is proposed. By developing a novel Discrete Wavelet Transform-based skip-connection architecture, the high-frequency bias inherent in diffusion models are addressed, enabling the preservation of experimental noise characteristics. This approach generates images that are quantitatively nearly indistinguishable from experimental data (17 fold improvement over previous methods) while retaining ground truth structural information. Fully convolutional networks trained exclusively on these synthetic images achieve high-precision atomic column detection in experimental STEM images of WSe<sub>2</sub> and graphene, despite the presence of substantial background noise and contamination. This approach effectively eliminates the need for laborious manual annotation, providing a scalable solution to the data bottleneck in STEM image analysis. The principles underlying STEMDiff can extend to other scientific imaging modalities, accelerating advancements in materials design for water treatment.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e08266"},"PeriodicalIF":14.1000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202508266","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning has emerged as a powerful tool for analyzing scanning transmission electron microscopy (STEM) images, yet its widespread application remains constrained by the scarcity of annotated training data. While deep generative models offer a promising solution, they typically struggle to reproduce the complex high-frequency components that define experimental STEM images. Here, STEMDiff, a conditional diffusion model that transforms simple binary labels derived from crystal structures into realistic STEM images through a physical information embedding strategy, is proposed. By developing a novel Discrete Wavelet Transform-based skip-connection architecture, the high-frequency bias inherent in diffusion models are addressed, enabling the preservation of experimental noise characteristics. This approach generates images that are quantitatively nearly indistinguishable from experimental data (17 fold improvement over previous methods) while retaining ground truth structural information. Fully convolutional networks trained exclusively on these synthetic images achieve high-precision atomic column detection in experimental STEM images of WSe2 and graphene, despite the presence of substantial background noise and contamination. This approach effectively eliminates the need for laborious manual annotation, providing a scalable solution to the data bottleneck in STEM image analysis. The principles underlying STEMDiff can extend to other scientific imaging modalities, accelerating advancements in materials design for water treatment.

STEMDiff:用于物理信息STEM图像生成的小波增强扩散模型。
机器学习已经成为分析扫描透射电子显微镜(STEM)图像的强大工具,但其广泛应用仍然受到缺乏注释训练数据的限制。虽然深度生成模型提供了一个有前途的解决方案,但它们通常难以重现定义实验STEM图像的复杂高频组件。本文提出了一种条件扩散模型STEMDiff,该模型通过物理信息嵌入策略将来自晶体结构的简单二值标签转换为真实的STEM图像。通过开发一种新的基于离散小波变换的跳过连接架构,解决了扩散模型中固有的高频偏置,从而保持了实验噪声特性。该方法生成的图像在数量上几乎与实验数据无法区分(比以前的方法提高了17倍),同时保留了地面真值结构信息。在这些合成图像上训练的全卷积网络在WSe2和石墨烯的实验STEM图像中实现了高精度的原子柱检测,尽管存在大量的背景噪声和污染。这种方法有效地消除了费力的人工注释的需要,为STEM图像分析中的数据瓶颈提供了可扩展的解决方案。STEMDiff的基本原理可以扩展到其他科学成像模式,加速水处理材料设计的进步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Science
Advanced Science CHEMISTRY, MULTIDISCIPLINARYNANOSCIENCE &-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
18.90
自引率
2.60%
发文量
1602
审稿时长
1.9 months
期刊介绍: Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信