Spectral neural approximations for models of transcriptional dynamics.

IF 3.2 3区 生物学 Q2 BIOPHYSICS
Biophysical journal Pub Date : 2024-09-03 Epub Date: 2024-05-06 DOI:10.1016/j.bpj.2024.04.034
Gennady Gorin, Maria Carilli, Tara Chari, Lior Pachter
{"title":"Spectral neural approximations for models of transcriptional dynamics.","authors":"Gennady Gorin, Maria Carilli, Tara Chari, Lior Pachter","doi":"10.1016/j.bpj.2024.04.034","DOIUrl":null,"url":null,"abstract":"<p><p>The advent of high-throughput transcriptomics provides an opportunity to advance mechanistic understanding of transcriptional processes and their connections to cellular function at an unprecedented, genome-wide scale. These transcriptional systems, which involve discrete stochastic events, are naturally modeled using chemical master equations (CMEs), which can be solved for probability distributions to fit biophysical rates that govern system dynamics. While CME models have been used as standards in fluorescence transcriptomics for decades to analyze single-species RNA distributions, there are often no closed-form solutions to CMEs that model multiple species, such as nascent and mature RNA transcript counts. This has prevented the application of standard likelihood-based statistical methods for analyzing high-throughput, multi-species transcriptomic datasets using biophysical models. Inspired by recent work in machine learning to learn solutions to complex dynamical systems, we leverage neural networks and statistical understanding of system distributions to produce accurate approximations to a steady-state bivariate distribution for a model of the RNA life cycle that includes nascent and mature molecules. The steady-state distribution to this simple model has no closed-form solution and requires intensive numerical solving techniques: our approach reduces likelihood evaluation time by several orders of magnitude. We demonstrate two approaches, whereby solutions are approximated by 1) learning the weights of kernel distributions with constrained parameters or 2) learning both weights and scaling factors for parameters of kernel distributions. We show that our strategies, denoted by kernel weight regression and parameter-scaled kernel weight regression, respectively, enable broad exploration of parameter space and can be used in existing likelihood frameworks to infer transcriptional burst sizes, RNA splicing rates, and mRNA degradation rates from experimental transcriptomic data.</p>","PeriodicalId":8922,"journal":{"name":"Biophysical journal","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11393700/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biophysical journal","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.bpj.2024.04.034","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/6 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The advent of high-throughput transcriptomics provides an opportunity to advance mechanistic understanding of transcriptional processes and their connections to cellular function at an unprecedented, genome-wide scale. These transcriptional systems, which involve discrete stochastic events, are naturally modeled using chemical master equations (CMEs), which can be solved for probability distributions to fit biophysical rates that govern system dynamics. While CME models have been used as standards in fluorescence transcriptomics for decades to analyze single-species RNA distributions, there are often no closed-form solutions to CMEs that model multiple species, such as nascent and mature RNA transcript counts. This has prevented the application of standard likelihood-based statistical methods for analyzing high-throughput, multi-species transcriptomic datasets using biophysical models. Inspired by recent work in machine learning to learn solutions to complex dynamical systems, we leverage neural networks and statistical understanding of system distributions to produce accurate approximations to a steady-state bivariate distribution for a model of the RNA life cycle that includes nascent and mature molecules. The steady-state distribution to this simple model has no closed-form solution and requires intensive numerical solving techniques: our approach reduces likelihood evaluation time by several orders of magnitude. We demonstrate two approaches, whereby solutions are approximated by 1) learning the weights of kernel distributions with constrained parameters or 2) learning both weights and scaling factors for parameters of kernel distributions. We show that our strategies, denoted by kernel weight regression and parameter-scaled kernel weight regression, respectively, enable broad exploration of parameter space and can be used in existing likelihood frameworks to infer transcriptional burst sizes, RNA splicing rates, and mRNA degradation rates from experimental transcriptomic data.

转录动力学模型的谱神经近似。
高通量转录组学的出现为在前所未有的全基因组范围内推进对转录过程及其与细胞功能的联系的机理理解提供了机会。这些转录系统涉及离散的随机事件,自然可以使用化学主方程(CME)来建模,通过求解概率分布来适应支配系统动态的生物物理速率。几十年来,CME 模型一直被用作荧光转录组学分析单物种 RNA 分布的标准,但对于模拟多物种(如新生和成熟 RNA 转录本数量)的 CME,往往没有闭式解。这阻碍了使用生物物理模型分析高通量、多物种转录组数据集的基于似然法的标准统计方法的应用。受近期机器学习复杂动态系统解决方案的启发,我们利用神经网络和对系统分布的统计理解,为包括新生和成熟分子的 RNA 生命周期模型生成了稳态双变量分布的精确近似值。这种简单模型的稳态分布没有闭式解,需要密集的数值求解技术:我们的方法将可能性评估时间缩短了几个数量级。我们展示了两种方法,即通过(1)学习具有受限参数的核分布权重,或(2)学习核分布参数的权重和缩放因子来近似求解。我们证明,我们的策略(分别称为核权重回归(KWR)和参数缩放核权重回归(psKWR))能够广泛探索参数空间,并可用于现有的似然法框架,以从实验转录组数据中推断转录爆发大小、RNA剪接率和 mRNA 降解率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biophysical journal
Biophysical journal 生物-生物物理
CiteScore
6.10
自引率
5.90%
发文量
3090
审稿时长
2 months
期刊介绍: BJ publishes original articles, letters, and perspectives on important problems in modern biophysics. The papers should be written so as to be of interest to a broad community of biophysicists. BJ welcomes experimental studies that employ quantitative physical approaches for the study of biological systems, including or spanning scales from molecule to whole organism. Experimental studies of a purely descriptive or phenomenological nature, with no theoretical or mechanistic underpinning, are not appropriate for publication in BJ. Theoretical studies should offer new insights into the understanding ofexperimental results or suggest new experimentally testable hypotheses. Articles reporting significant methodological or technological advances, which have potential to open new areas of biophysical investigation, are also suitable for publication in BJ. Papers describing improvements in accuracy or speed of existing methods or extra detail within methods described previously are not suitable for BJ.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信