Deep Learning‐Based Classification of Histone–DNA Interactions Using Drying Droplet Patterns

IF 11.1 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY
Safoura Vaez, Bahar Dadfar, Meike Koenig, Matthias Franzreb, Joerg Lahann
{"title":"Deep Learning‐Based Classification of Histone–DNA Interactions Using Drying Droplet Patterns","authors":"Safoura Vaez, Bahar Dadfar, Meike Koenig, Matthias Franzreb, Joerg Lahann","doi":"10.1002/smsc.202400252","DOIUrl":null,"url":null,"abstract":"Developing scalable and accurate predictive analytical methods for the classification of protein‐DNA binding is critical for advancing our understanding of molecular biology, disease mechanisms, and a wide spectrum of biotechnological and medical applications. It is discovered that histone–DNA interactions can be stratified based on stain patterns created by the deposition of various nucleoprotein solutions onto a substrate. In this study, a deep‐learning neural network is applied to categorize polarized light microscopy images of drying droplet deposits originating from different histone–DNA mixtures. These DNA stain patterns featured high reproducibility across different species and thus enabled comprehensive DNA categorization (100% accuracy) and accurate prediction of their respective binding affinities to histones. Eukaryotic DNA, which has a higher binding affinity to mammalian histones than prokaryotic DNA, is associated with a higher overall prediction accuracy. For a given species, the average prediction accuracy increased with DNA size. To demonstrate generalizability, a pre‐trained CNN is challenged with unknown images that originated from DNA samples of species not included in the training set. The CNN classified these unknown histone‐DNA samples as either strong or medium binders with 84.4% and 96.25% accuracy, respectively.","PeriodicalId":29791,"journal":{"name":"Small Science","volume":null,"pages":null},"PeriodicalIF":11.1000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Small Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/smsc.202400252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Developing scalable and accurate predictive analytical methods for the classification of protein‐DNA binding is critical for advancing our understanding of molecular biology, disease mechanisms, and a wide spectrum of biotechnological and medical applications. It is discovered that histone–DNA interactions can be stratified based on stain patterns created by the deposition of various nucleoprotein solutions onto a substrate. In this study, a deep‐learning neural network is applied to categorize polarized light microscopy images of drying droplet deposits originating from different histone–DNA mixtures. These DNA stain patterns featured high reproducibility across different species and thus enabled comprehensive DNA categorization (100% accuracy) and accurate prediction of their respective binding affinities to histones. Eukaryotic DNA, which has a higher binding affinity to mammalian histones than prokaryotic DNA, is associated with a higher overall prediction accuracy. For a given species, the average prediction accuracy increased with DNA size. To demonstrate generalizability, a pre‐trained CNN is challenged with unknown images that originated from DNA samples of species not included in the training set. The CNN classified these unknown histone‐DNA samples as either strong or medium binders with 84.4% and 96.25% accuracy, respectively.

Abstract Image

利用干燥液滴模式对组蛋白-DNA相互作用进行基于深度学习的分类
为蛋白质-DNA 结合的分类开发可扩展且准确的预测分析方法,对于促进我们对分子生物学、疾病机理以及广泛的生物技术和医学应用的理解至关重要。研究发现,组蛋白与 DNA 的相互作用可根据各种核蛋白溶液沉积在基底上形成的染色模式进行分层。在这项研究中,深度学习神经网络被用于对源自不同组蛋白-DNA 混合物的干燥液滴沉积的偏振光显微镜图像进行分类。这些DNA染色模式在不同物种之间具有很高的可重复性,因此能够进行全面的DNA分类(准确率为100%),并准确预测它们各自与组蛋白的结合亲和力。与原核 DNA 相比,真核 DNA 与哺乳动物组蛋白的结合亲和力更高,因此总体预测准确率也更高。对于特定物种,平均预测准确率随 DNA 大小的增加而提高。为了证明其通用性,预先训练好的 CNN 要面对来自未列入训练集的物种 DNA 样本的未知图像的挑战。CNN 将这些未知的组蛋白 DNA 样本分类为强粘合剂或中等粘合剂,准确率分别为 84.4% 和 96.25%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
14.00
自引率
2.40%
发文量
0
期刊介绍: Small Science is a premium multidisciplinary open access journal dedicated to publishing impactful research from all areas of nanoscience and nanotechnology. It features interdisciplinary original research and focused review articles on relevant topics. The journal covers design, characterization, mechanism, technology, and application of micro-/nanoscale structures and systems in various fields including physics, chemistry, materials science, engineering, environmental science, life science, biology, and medicine. It welcomes innovative interdisciplinary research and its readership includes professionals from academia and industry in fields such as chemistry, physics, materials science, biology, engineering, and environmental and analytical science. Small Science is indexed and abstracted in CAS, DOAJ, Clarivate Analytics, ProQuest Central, Publicly Available Content Database, Science Database, SCOPUS, and Web of Science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信