Persistent Sheaf Laplacian Analysis of Protein Flexibility.

ArXiv Pub Date : 2025-03-30

Nicole Hayes, Xiaoqi Wei, Hongsong Feng, Ekaterina Merkurjev, Guo-Wei Wei

引用次数: 0

Abstract

Protein flexibility, measured by the B-factor or Debye-Waller factor, is essential for protein functions such as structural support, enzyme activity, cellular communication, and molecular transport. Theoretical analysis and prediction of protein flexibility are crucial for protein design, engineering, and drug discovery. In this work, we introduce the persistent sheaf Laplacian (PSL), an effective tool in topological data analysis, to model and analyze protein flexibility. By representing the local topology and geometry of protein atoms through the multiscale harmonic and non-harmonic spectra of PSLs, the proposed model effectively captures protein flexibility and provides accurate, robust predictions of protein B-factors. Our PSL model demonstrates an increase in accuracy of 32% compared to the classical Gaussian network model (GNM) in predicting B-factors for a dataset of 364 proteins. Additionally, we construct a blind machine learning prediction method utilizing global and local protein features. Extensive computations and comparisons validate the effectiveness of the proposed PSL model for B-factor predictions.

本刊更多论文

蛋白质柔韧性的持续束拉普拉斯分析。

由b因子或德拜-沃勒因子测量的蛋白质柔韧性，对于蛋白质的结构支持、酶活性、细胞通讯和分子运输等功能至关重要。蛋白质柔韧性的理论分析和预测对蛋白质设计、工程和药物发现至关重要。在这项工作中，我们引入了持久束拉普拉斯算子（PSL），这是一种有效的拓扑数据分析工具，用于建模和分析蛋白质的灵活性。通过PSLs的多尺度谐波和非谐波光谱来表示蛋白质原子的局部拓扑结构和几何形状，该模型有效地捕获了蛋白质的灵活性，并提供了准确、稳健的蛋白质b因子预测。与经典高斯网络模型（GNM）相比，我们的PSL模型在预测364种蛋白质数据集的b因子方面的准确性提高了32%。此外，我们构建了一种利用全局和局部蛋白质特征的盲机器学习预测方法。大量的计算和比较验证了所提出的PSL模型对b因子预测的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ArXiv

自引率

0.00%

发文量