Reconstruction and Visualization of Protein Structures by exploiting Bidirectional Neural Networks and Discrete Classes

Alessia Auriemma Citarella, Lorenzo Porcelli, Luigi Di Biasi, M. Risi, G. Tortora
{"title":"Reconstruction and Visualization of Protein Structures by exploiting Bidirectional Neural Networks and Discrete Classes","authors":"Alessia Auriemma Citarella, Lorenzo Porcelli, Luigi Di Biasi, M. Risi, G. Tortora","doi":"10.1109/IV53921.2021.00053","DOIUrl":null,"url":null,"abstract":"In recent years, Deep Learning techniques have achieved some success in bioinformatics tasks, including protein conformation prediction. In this work, we propose a Bidirectional Long Short-Term Memory (BLSTM) network system, called Human Proteins Angles Prediction (HPAP), in order to improve the prediction of dihedral angles of proteins. We have introduced a discrete subdivision in classes of 5° for protein torsion angles and four new features related to accessible surface area and volume. In total there are 73 classes (72 classes include the angles between -180° and 180°, a further class is used to code the free angles at the beginning of the sequence) with a maximum expected error of ±2.5°. We have tested three model variants in several parameter combinations. With our model, we have obtained a decrease of the mean absolute error of about 2° for the $\\psi$ angle. Although our dataset is reduced in size, the accuracy of $\\varphi$ and $\\psi$ angles is comparable to the existing methods. Predicting angles accurately is useful for accurately reconstructing the three-dimensional structure of a protein. In this context, the prediction is limited to the $\\varphi$ and $\\psi$ angles and we will visualize what happens locally when a prediction is correct. In case the prediction is far from true angles, even a small error can deconstruct the backbone.","PeriodicalId":380260,"journal":{"name":"2021 25th International Conference Information Visualisation (IV)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 25th International Conference Information Visualisation (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV53921.2021.00053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In recent years, Deep Learning techniques have achieved some success in bioinformatics tasks, including protein conformation prediction. In this work, we propose a Bidirectional Long Short-Term Memory (BLSTM) network system, called Human Proteins Angles Prediction (HPAP), in order to improve the prediction of dihedral angles of proteins. We have introduced a discrete subdivision in classes of 5° for protein torsion angles and four new features related to accessible surface area and volume. In total there are 73 classes (72 classes include the angles between -180° and 180°, a further class is used to code the free angles at the beginning of the sequence) with a maximum expected error of ±2.5°. We have tested three model variants in several parameter combinations. With our model, we have obtained a decrease of the mean absolute error of about 2° for the $\psi$ angle. Although our dataset is reduced in size, the accuracy of $\varphi$ and $\psi$ angles is comparable to the existing methods. Predicting angles accurately is useful for accurately reconstructing the three-dimensional structure of a protein. In this context, the prediction is limited to the $\varphi$ and $\psi$ angles and we will visualize what happens locally when a prediction is correct. In case the prediction is far from true angles, even a small error can deconstruct the backbone.
利用双向神经网络和离散类的蛋白质结构重建和可视化
近年来,深度学习技术在生物信息学任务中取得了一些成功,包括蛋白质构象预测。在这项工作中,我们提出了一个双向长短期记忆(BLSTM)网络系统,称为人类蛋白质角度预测(HPAP),以提高蛋白质二面角的预测。我们介绍了5°的蛋白质扭转角类的离散细分,以及与可访问表面积和体积相关的四个新特征。总共有73个类(72个类包括-180°和180°之间的角度,另一个类用于编码序列开始时的自由角度),最大期望误差为±2.5°。我们在几个参数组合中测试了三个模型变体。利用我们的模型,我们得到了$\psi$角的平均绝对误差降低了约2°。虽然我们的数据集缩小了,但$\varphi$和$\psi$角度的精度与现有方法相当。准确地预测角度对于精确地重建蛋白质的三维结构是有用的。在这种情况下,预测仅限于$\varphi$和$\psi$角度,当预测正确时,我们将可视化局部发生的情况。如果预测的角度与真实角度相差甚远,即使是很小的误差也会破坏主干。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信