Automated collective variable discovery for MFSD2A transporter from molecular dynamics simulations.

IF 3.2 3区 生物学 Q2 BIOPHYSICS
Biophysical journal Pub Date : 2024-09-03 Epub Date: 2024-06-25 DOI:10.1016/j.bpj.2024.06.024
Myongin Oh, Margarida Rosa, Hengyi Xie, George Khelashvili
{"title":"Automated collective variable discovery for MFSD2A transporter from molecular dynamics simulations.","authors":"Myongin Oh, Margarida Rosa, Hengyi Xie, George Khelashvili","doi":"10.1016/j.bpj.2024.06.024","DOIUrl":null,"url":null,"abstract":"<p><p>Biomolecules often exhibit complex free energy landscapes in which long-lived metastable states are separated by large energy barriers. Overcoming these barriers to robustly sample transitions between the metastable states with classical molecular dynamics (MD) simulations presents a challenge. To circumvent this issue, collective variable (CV)-based enhanced sampling MD approaches are often employed. Traditional CV selection relies on intuition and prior knowledge of the system. This approach introduces bias, which can lead to incomplete mechanistic insights. Thus, automated CV detection is desired to gain a deeper understanding of the system/process. Analysis of MD data with various machine-learning algorithms, such as principal component analysis (PCA), support vector machine, and linear discriminant analysis (LDA) based approaches have been implemented for automated CV detection. However, their performance has not been systematically evaluated on structurally and mechanistically complex biological systems. Here, we applied these methods to MD simulations of the MFSD2A (Major Facilitator Superfamily Domain 2A) lysolipid transporter in multiple functionally relevant metastable states with the goal of identifying optimal CVs that would structurally discriminate these states. Specific emphasis was on the automated detection and interpretive power of LDA-based CVs. We found that LDA methods, which included a novel gradient descent-based multiclass harmonic variant, termed GDHLDA, we developed here, outperform PCA in class separation, exhibiting remarkable consistency in extracting CVs critical for distinguishing metastable states. Furthermore, the identified CVs included features previously associated with conformational transitions in MFSD2A. Specifically, conformational shifts in transmembrane helix 7 and in residue Y294 on this helix emerged as critical features discriminating the metastable states in MFSD2A. This highlights the effectiveness of LDA-based approaches in automatically extracting from MD trajectories CVs of functional relevance that can be used to drive biased MD simulations to efficiently sample conformational transitions in the molecular system.</p>","PeriodicalId":8922,"journal":{"name":"Biophysical journal","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11393714/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biophysical journal","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.bpj.2024.06.024","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

Biomolecules often exhibit complex free energy landscapes in which long-lived metastable states are separated by large energy barriers. Overcoming these barriers to robustly sample transitions between the metastable states with classical molecular dynamics (MD) simulations presents a challenge. To circumvent this issue, collective variable (CV)-based enhanced sampling MD approaches are often employed. Traditional CV selection relies on intuition and prior knowledge of the system. This approach introduces bias, which can lead to incomplete mechanistic insights. Thus, automated CV detection is desired to gain a deeper understanding of the system/process. Analysis of MD data with various machine-learning algorithms, such as principal component analysis (PCA), support vector machine, and linear discriminant analysis (LDA) based approaches have been implemented for automated CV detection. However, their performance has not been systematically evaluated on structurally and mechanistically complex biological systems. Here, we applied these methods to MD simulations of the MFSD2A (Major Facilitator Superfamily Domain 2A) lysolipid transporter in multiple functionally relevant metastable states with the goal of identifying optimal CVs that would structurally discriminate these states. Specific emphasis was on the automated detection and interpretive power of LDA-based CVs. We found that LDA methods, which included a novel gradient descent-based multiclass harmonic variant, termed GDHLDA, we developed here, outperform PCA in class separation, exhibiting remarkable consistency in extracting CVs critical for distinguishing metastable states. Furthermore, the identified CVs included features previously associated with conformational transitions in MFSD2A. Specifically, conformational shifts in transmembrane helix 7 and in residue Y294 on this helix emerged as critical features discriminating the metastable states in MFSD2A. This highlights the effectiveness of LDA-based approaches in automatically extracting from MD trajectories CVs of functional relevance that can be used to drive biased MD simulations to efficiently sample conformational transitions in the molecular system.

从分子动力学模拟中自动发现 MFSD2A 转运体的集体变量。
生物大分子经常表现出复杂的自由能图谱,其中长寿命的凋亡态被巨大的能量壁垒分隔开来。利用经典分子动力学(MD)模拟来克服这些障碍以稳健地采样阶跃态之间的转变是一项挑战。为了规避这一问题,通常采用基于集体变量(CV)的增强采样 MD 方法。传统的 CV 选择依赖于系统的直觉和先验知识。这种方法会产生偏差,导致对机理的认识不全面。因此,需要进行自动 CV 检测,以便更深入地了解系统/过程。利用各种机器学习算法分析 MD 数据,如主成分分析 (PCA)、支持向量机 (SVM) 和基于线性判别分析 (LDA) 的方法,已被用于自动 CV 检测。然而,这些方法的性能尚未在结构和机理复杂的生物系统中进行过系统评估。在这里,我们将这些方法应用于 MFSD2A(Major Facilitator Superfamily Domain 2A,主要促进剂超家族结构域 2A)赖氨酸脂质转运体在多种功能相关的蜕变状态下的 MD 模拟,目的是找出能从结构上区分这些状态的最佳 CV。重点是基于 LDA 的 CV 的自动检测和解释能力。我们发现,LDA 方法(包括我们在此开发的基于梯度下降的新型多类谐波变体,称为 GDHLDA)在类别分离方面优于 PCA,在提取对区分可代谢状态至关重要的 CV 方面表现出显著的一致性。此外,识别出的 CV 包括以前与 MFSD2A 中构象转变相关的特征。具体来说,跨膜螺旋 7 和该螺旋上残基 Y294 的构象转变成为了区分 MFSD2A 可代谢状态的关键特征。这突显了基于 LDA 的方法在自动从 MD 轨迹中提取功能相关的 CV 方面的有效性,这些 CV 可用于驱动偏向 MD 模拟,从而有效地采样分子系统中的构象转变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biophysical journal
Biophysical journal 生物-生物物理
CiteScore
6.10
自引率
5.90%
发文量
3090
审稿时长
2 months
期刊介绍: BJ publishes original articles, letters, and perspectives on important problems in modern biophysics. The papers should be written so as to be of interest to a broad community of biophysicists. BJ welcomes experimental studies that employ quantitative physical approaches for the study of biological systems, including or spanning scales from molecule to whole organism. Experimental studies of a purely descriptive or phenomenological nature, with no theoretical or mechanistic underpinning, are not appropriate for publication in BJ. Theoretical studies should offer new insights into the understanding ofexperimental results or suggest new experimentally testable hypotheses. Articles reporting significant methodological or technological advances, which have potential to open new areas of biophysical investigation, are also suitable for publication in BJ. Papers describing improvements in accuracy or speed of existing methods or extra detail within methods described previously are not suitable for BJ.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信