Bragg Spot Finder (BSF): a new machine-learning-aided approach to deal with spot finding for rapidly filtering diffraction pattern images.

IF 6.1 3区 材料科学 Q1 Biochemistry, Genetics and Molecular Biology
Journal of Applied Crystallography Pub Date : 2024-04-26 eCollection Date: 2024-06-01 DOI:10.1107/S1600576724002450
Jianxiang Dong, Zhaozheng Yin, Dale Kreitler, Herbert J Bernstein, Jean Jakoncic
{"title":"Bragg Spot Finder (BSF): a new machine-learning-aided approach to deal with spot finding for rapidly filtering diffraction pattern images.","authors":"Jianxiang Dong, Zhaozheng Yin, Dale Kreitler, Herbert J Bernstein, Jean Jakoncic","doi":"10.1107/S1600576724002450","DOIUrl":null,"url":null,"abstract":"<p><p>Macromolecular crystallography contributes significantly to understanding diseases and, more importantly, how to treat them by providing atomic resolution 3D structures of proteins. This is achieved by collecting X-ray diffraction images of protein crystals from important biological pathways. Spotfinders are used to detect the presence of crystals with usable data, and the spots from such crystals are the primary data used to solve the relevant structures. Having fast and accurate spot finding is essential, but recent advances in synchrotron beamlines used to generate X-ray diffraction images have brought us to the limits of what the best existing spotfinders can do. This bottleneck must be removed so spotfinder software can keep pace with the X-ray beamline hardware improvements and be able to see the weak or diffuse spots required to solve the most challenging problems encountered when working with diffraction images. In this paper, we first present Bragg Spot Detection (BSD), a large benchmark Bragg spot image dataset that contains 304 images with more than 66 000 spots. We then discuss the open source extensible U-Net-based spotfinder Bragg Spot Finder (BSF), with image pre-processing, a U-Net segmentation backbone, and post-processing that includes artifact removal and watershed segmentation. Finally, we perform experiments on the BSD benchmark and obtain results that are (in terms of accuracy) comparable to or better than those obtained with two popular spotfinder software packages (<i>Dozor</i> and <i>DIALS</i>), demonstrating that this is an appropriate framework to support future extensions and improvements.</p>","PeriodicalId":14950,"journal":{"name":"Journal of Applied Crystallography","volume":"57 Pt 3","pages":"670-680"},"PeriodicalIF":6.1000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11151665/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Crystallography","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1107/S1600576724002450","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

Macromolecular crystallography contributes significantly to understanding diseases and, more importantly, how to treat them by providing atomic resolution 3D structures of proteins. This is achieved by collecting X-ray diffraction images of protein crystals from important biological pathways. Spotfinders are used to detect the presence of crystals with usable data, and the spots from such crystals are the primary data used to solve the relevant structures. Having fast and accurate spot finding is essential, but recent advances in synchrotron beamlines used to generate X-ray diffraction images have brought us to the limits of what the best existing spotfinders can do. This bottleneck must be removed so spotfinder software can keep pace with the X-ray beamline hardware improvements and be able to see the weak or diffuse spots required to solve the most challenging problems encountered when working with diffraction images. In this paper, we first present Bragg Spot Detection (BSD), a large benchmark Bragg spot image dataset that contains 304 images with more than 66 000 spots. We then discuss the open source extensible U-Net-based spotfinder Bragg Spot Finder (BSF), with image pre-processing, a U-Net segmentation backbone, and post-processing that includes artifact removal and watershed segmentation. Finally, we perform experiments on the BSD benchmark and obtain results that are (in terms of accuracy) comparable to or better than those obtained with two popular spotfinder software packages (Dozor and DIALS), demonstrating that this is an appropriate framework to support future extensions and improvements.

Bragg Spot Finder (BSF):一种新的机器学习辅助方法,用于快速过滤衍射图像中的斑点。
大分子晶体学通过提供蛋白质的原子分辨率三维结构,为了解疾病,更重要的是如何治疗疾病做出了巨大贡献。这是通过收集重要生物途径中蛋白质晶体的 X 射线衍射图像来实现的。光斑探测器用于检测是否存在可用数据的晶体,这些晶体的光斑是用于解决相关结构的主要数据。快速而准确的光斑探测至关重要,但最近用于生成 X 射线衍射图像的同步加速器光束线的进步,使我们达到了现有最好的光斑探测仪所能达到的极限。我们必须消除这一瓶颈,这样光斑探测器软件才能跟上 X 射线光束线硬件改进的步伐,并能看到弱光斑或弥散光斑,从而解决在处理衍射图像时遇到的最具挑战性的问题。在本文中,我们首先介绍了布拉格光斑检测(BSD),这是一个大型基准布拉格光斑图像数据集,包含 304 幅图像和 66 000 多个光斑。然后,我们讨论了基于 U-Net 的开放源码可扩展光斑探测器 Bragg Spot Finder (BSF),该探测器具有图像预处理、U-Net 分割主干网和后处理功能,后处理包括去除伪影和分水岭分割。最后,我们在 BSD 基准上进行了实验,得到的结果(就准确性而言)与两款流行的斑点查找软件包(Dozor 和 DIALS)相当,甚至更好,这表明这是一个支持未来扩展和改进的合适框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.00
自引率
3.30%
发文量
178
审稿时长
4.7 months
期刊介绍: Many research topics in condensed matter research, materials science and the life sciences make use of crystallographic methods to study crystalline and non-crystalline matter with neutrons, X-rays and electrons. Articles published in the Journal of Applied Crystallography focus on these methods and their use in identifying structural and diffusion-controlled phase transformations, structure-property relationships, structural changes of defects, interfaces and surfaces, etc. Developments of instrumentation and crystallographic apparatus, theory and interpretation, numerical analysis and other related subjects are also covered. The journal is the primary place where crystallographic computer program information is published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信