Before and after AlphaFold2: An overview of protein structure prediction.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2023-02-28 eCollection Date: 2023-01-01 DOI:10.3389/fbinf.2023.1120370
Letícia M F Bertoline, Angélica N Lima, Jose E Krieger, Samantha K Teixeira
{"title":"Before and after AlphaFold2: An overview of protein structure prediction.","authors":"Letícia M F Bertoline, Angélica N Lima, Jose E Krieger, Samantha K Teixeira","doi":"10.3389/fbinf.2023.1120370","DOIUrl":null,"url":null,"abstract":"<p><p>Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"3 ","pages":"1120370"},"PeriodicalIF":2.8000,"publicationDate":"2023-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011655/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2023.1120370","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.

Abstract Image

Abstract Image

AlphaFold2 前后:蛋白质结构预测概述。
蛋白质的三维结构与其功能直接相关,确定蛋白质的三维结构对于理解生物过程、解决人类健康和生命科学问题至关重要。尽管随着时间的推移不断有新的蛋白质结构通过实验获得,但在 Uniprot 中的蛋白质序列数量与已解析三级结构的蛋白质序列数量之间仍存在很大差距。在这种情况下,出现了通过基于模板或自由建模的方法预测蛋白质结构的研究。在过去几年中,不同的方法被结合起来以克服各自的局限性,直到 AlphaFold2 的出现,它证明了以前所未有的规模高精度预测蛋白质结构是可能的。尽管 AlphaFold2 目前在该领域颇具影响力,但它也有局限性。最近,基于蛋白质语言模型的新方法有望彻底改变蛋白质结构生物学,使人们能够仅从蛋白质序列的进化模式中发现蛋白质的结构和功能。尽管这些方法达不到 AlphaFold2 的精确度,但它们已经克服了 AlphaFold2 的一些局限性,能够从元基因组数据库中高精度预测 2 亿多个蛋白质。在这篇小型综述中,我们将概述 AlphaFold2 出现前后蛋白质结构预测领域取得的突破。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信