通过淀粉样蛋白和非淀粉样蛋白六肽导航均匀图路径

IF 4.8 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
László Keresztes, Evelin Szögi, Bálint Varga, Viktor Farkas, András Perczel, Vince Grolmusz
{"title":"通过淀粉样蛋白和非淀粉样蛋白六肽导航均匀图路径","authors":"László Keresztes,&nbsp;Evelin Szögi,&nbsp;Bálint Varga,&nbsp;Viktor Farkas,&nbsp;András Perczel,&nbsp;Vince Grolmusz","doi":"10.1002/jcc.70238","DOIUrl":null,"url":null,"abstract":"<p>Hexapeptides are increasingly applied as model systems for studying the amyloidogenic properties of oligo- and polypeptides. It is possible to construct 64 million different hexapeptides from the twenty proteinogenic amino acid residues. Today's experimental amyloid databases contain only a fraction of these annotated hexapeptides. For labeling all the possible hexapeptides as “amyloidogenic” or “non-amyloidogenic” there exist several computational predictors with good accuracy. It may be of interest to define and study a simple graph structure on the 64 million hexapeptides as nodes, when two hexapeptides are connected by an edge if they differ by only a single residue. For example, in this graph, HIKKLM is connected to AIKKLM, or HIKKNM, or HIKKLC, but it is not connected with an edge to VVKKLM or HIKNPM. In the present contribution, we consider our previously published artificial intelligence-based tool, the Budapest Amyloid Predictor (BAP for short), and demonstrate a spectacular property of this predictor in the graph defined above. We show that for any two hexapeptides predicted to be “amyloidogenic” by the BAP predictor, there exists an easily constructible path of length at most six that passes through neighboring hexapeptides all predicted to be “amyloidogenic” by BAP. For example, the predicted amyloidogenic ILVWIW and FWLCYL hexapeptides can be connected through the length-6 path ILVWIW-IWVWIW-IWVCIW-IWVCIL-FWVCIL-FWLCIL-FWLCYL in such a way that the neighbors differ in exactly one residue, and all hexapeptides on the path are predicted to be amyloidogenic by BAP. The symmetric statement also holds true for non-amyloidogenic predicted hexapeptides: For any such pair, there exists a path of length at most six, traversing only predicted non-amyloidogenic hexapeptides. It is noted that the mentioned property of the Budapest Amyloid Predictor https://pitgroup.org/bap is not proprietary; it is also true for any linear Support Vector Machine (SVM)-based predictors; therefore, for any future improvements of BAP using the linear SVM prediction technique.</p>","PeriodicalId":188,"journal":{"name":"Journal of Computational Chemistry","volume":"46 26","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jcc.70238","citationCount":"0","resultStr":"{\"title\":\"Navigating Homogeneous Graph Paths Through Amyloidogenic and Non-Amyloidogenic Hexapeptides\",\"authors\":\"László Keresztes,&nbsp;Evelin Szögi,&nbsp;Bálint Varga,&nbsp;Viktor Farkas,&nbsp;András Perczel,&nbsp;Vince Grolmusz\",\"doi\":\"10.1002/jcc.70238\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Hexapeptides are increasingly applied as model systems for studying the amyloidogenic properties of oligo- and polypeptides. It is possible to construct 64 million different hexapeptides from the twenty proteinogenic amino acid residues. Today's experimental amyloid databases contain only a fraction of these annotated hexapeptides. For labeling all the possible hexapeptides as “amyloidogenic” or “non-amyloidogenic” there exist several computational predictors with good accuracy. It may be of interest to define and study a simple graph structure on the 64 million hexapeptides as nodes, when two hexapeptides are connected by an edge if they differ by only a single residue. For example, in this graph, HIKKLM is connected to AIKKLM, or HIKKNM, or HIKKLC, but it is not connected with an edge to VVKKLM or HIKNPM. In the present contribution, we consider our previously published artificial intelligence-based tool, the Budapest Amyloid Predictor (BAP for short), and demonstrate a spectacular property of this predictor in the graph defined above. We show that for any two hexapeptides predicted to be “amyloidogenic” by the BAP predictor, there exists an easily constructible path of length at most six that passes through neighboring hexapeptides all predicted to be “amyloidogenic” by BAP. For example, the predicted amyloidogenic ILVWIW and FWLCYL hexapeptides can be connected through the length-6 path ILVWIW-IWVWIW-IWVCIW-IWVCIL-FWVCIL-FWLCIL-FWLCYL in such a way that the neighbors differ in exactly one residue, and all hexapeptides on the path are predicted to be amyloidogenic by BAP. The symmetric statement also holds true for non-amyloidogenic predicted hexapeptides: For any such pair, there exists a path of length at most six, traversing only predicted non-amyloidogenic hexapeptides. It is noted that the mentioned property of the Budapest Amyloid Predictor https://pitgroup.org/bap is not proprietary; it is also true for any linear Support Vector Machine (SVM)-based predictors; therefore, for any future improvements of BAP using the linear SVM prediction technique.</p>\",\"PeriodicalId\":188,\"journal\":{\"name\":\"Journal of Computational Chemistry\",\"volume\":\"46 26\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jcc.70238\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70238\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Chemistry","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jcc.70238","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

六肽越来越多地被用作研究寡肽和多肽淀粉样蛋白形成特性的模型系统。从20个蛋白质原氨基酸残基中可以构建6400万个不同的六肽。今天的实验性淀粉样蛋白数据库只包含这些注释六肽的一小部分。为了将所有可能的六肽标记为“淀粉样蛋白”或“非淀粉样蛋白”,存在几种具有良好准确性的计算预测方法。当两个六肽仅相差一个残基时,在6400万个六肽作为节点上定义和研究一个简单的图结构可能会引起人们的兴趣。例如,在这个图中,HIKKLM连接到AIKKLM,或HIKKNM,或HIKKLC,但它没有连接到VVKKLM或HIKNPM的边。在目前的贡献中,我们考虑了我们之前发表的基于人工智能的工具,布达佩斯淀粉样蛋白预测器(简称BAP),并在上面定义的图表中展示了该预测器的惊人特性。我们发现,对于被BAP预测为“淀粉样蛋白”的任意两个六肽,存在一条长度最多为6的易于构建的路径,该路径穿过被BAP预测为“淀粉样蛋白”的相邻六肽。例如,预测的致淀粉性ILVWIW和FWLCYL六肽可以通过长度为6的路径ILVWIW- iwvwiw - iwvciw - iwvcil - fwvcil - fwlcil -FWLCYL连接,使得相邻的残基恰好相差一个残基,并且该路径上的所有六肽都被BAP预测为致淀粉性。对于非淀粉样蛋白生成的预测六肽,对称陈述也成立:对于任何这样的对,存在长度最多为6的路径,只遍历预测的非淀粉样蛋白生成的六肽。值得注意的是,布达佩斯淀粉样蛋白预测器https://pitgroup.org/bap的上述特性不是专有的;对于任何基于线性支持向量机(SVM)的预测器也是如此;因此,对于未来使用线性支持向量机预测技术对BAP进行改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Navigating Homogeneous Graph Paths Through Amyloidogenic and Non-Amyloidogenic Hexapeptides

Navigating Homogeneous Graph Paths Through Amyloidogenic and Non-Amyloidogenic Hexapeptides

Navigating Homogeneous Graph Paths Through Amyloidogenic and Non-Amyloidogenic Hexapeptides

Hexapeptides are increasingly applied as model systems for studying the amyloidogenic properties of oligo- and polypeptides. It is possible to construct 64 million different hexapeptides from the twenty proteinogenic amino acid residues. Today's experimental amyloid databases contain only a fraction of these annotated hexapeptides. For labeling all the possible hexapeptides as “amyloidogenic” or “non-amyloidogenic” there exist several computational predictors with good accuracy. It may be of interest to define and study a simple graph structure on the 64 million hexapeptides as nodes, when two hexapeptides are connected by an edge if they differ by only a single residue. For example, in this graph, HIKKLM is connected to AIKKLM, or HIKKNM, or HIKKLC, but it is not connected with an edge to VVKKLM or HIKNPM. In the present contribution, we consider our previously published artificial intelligence-based tool, the Budapest Amyloid Predictor (BAP for short), and demonstrate a spectacular property of this predictor in the graph defined above. We show that for any two hexapeptides predicted to be “amyloidogenic” by the BAP predictor, there exists an easily constructible path of length at most six that passes through neighboring hexapeptides all predicted to be “amyloidogenic” by BAP. For example, the predicted amyloidogenic ILVWIW and FWLCYL hexapeptides can be connected through the length-6 path ILVWIW-IWVWIW-IWVCIW-IWVCIL-FWVCIL-FWLCIL-FWLCYL in such a way that the neighbors differ in exactly one residue, and all hexapeptides on the path are predicted to be amyloidogenic by BAP. The symmetric statement also holds true for non-amyloidogenic predicted hexapeptides: For any such pair, there exists a path of length at most six, traversing only predicted non-amyloidogenic hexapeptides. It is noted that the mentioned property of the Budapest Amyloid Predictor https://pitgroup.org/bap is not proprietary; it is also true for any linear Support Vector Machine (SVM)-based predictors; therefore, for any future improvements of BAP using the linear SVM prediction technique.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.60
自引率
3.30%
发文量
247
审稿时长
1.7 months
期刊介绍: This distinguished journal publishes articles concerned with all aspects of computational chemistry: analytical, biological, inorganic, organic, physical, and materials. The Journal of Computational Chemistry presents original research, contemporary developments in theory and methodology, and state-of-the-art applications. Computational areas that are featured in the journal include ab initio and semiempirical quantum mechanics, density functional theory, molecular mechanics, molecular dynamics, statistical mechanics, cheminformatics, biomolecular structure prediction, molecular design, and bioinformatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信