{"title":"用数据驱动的方法计算蛋白质设计:最近的发展和观点","authors":"Haiyan Liu, Quan Chen","doi":"10.1002/wcms.1646","DOIUrl":null,"url":null,"abstract":"<p>A fundamental and challenging task of computational protein studies is to design proteins of desired structures and functions on demand. Data-driven approaches to protein design have been gaining tremendous momentum, with recent developments concentrated on protein sequence representation and generation by using deep learning language models, structure-based sequence design or inverse protein folding, and the de novo generation of new protein backbones. Currently, design methods have been assessed mainly by several useful computational metrics. However, these metrics are still highly insufficient for predicting the performance of design methods in wet experiments. Nevertheless, some methods have been verified experimentally, which showed that proteins of novel sequences and structures can be designed with data-driven models learned from natural proteins. Despite the progress, an important current limitation is the lack of accurate data-driven approaches to model or design protein dynamics.</p><p>This article is categorized under:\n </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"13 3","pages":""},"PeriodicalIF":16.8000,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Computational protein design with data-driven approaches: Recent developments and perspectives\",\"authors\":\"Haiyan Liu, Quan Chen\",\"doi\":\"10.1002/wcms.1646\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>A fundamental and challenging task of computational protein studies is to design proteins of desired structures and functions on demand. Data-driven approaches to protein design have been gaining tremendous momentum, with recent developments concentrated on protein sequence representation and generation by using deep learning language models, structure-based sequence design or inverse protein folding, and the de novo generation of new protein backbones. Currently, design methods have been assessed mainly by several useful computational metrics. However, these metrics are still highly insufficient for predicting the performance of design methods in wet experiments. Nevertheless, some methods have been verified experimentally, which showed that proteins of novel sequences and structures can be designed with data-driven models learned from natural proteins. Despite the progress, an important current limitation is the lack of accurate data-driven approaches to model or design protein dynamics.</p><p>This article is categorized under:\\n </p>\",\"PeriodicalId\":236,\"journal\":{\"name\":\"Wiley Interdisciplinary Reviews: Computational Molecular Science\",\"volume\":\"13 3\",\"pages\":\"\"},\"PeriodicalIF\":16.8000,\"publicationDate\":\"2022-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wiley Interdisciplinary Reviews: Computational Molecular Science\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/wcms.1646\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wiley Interdisciplinary Reviews: Computational Molecular Science","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/wcms.1646","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Computational protein design with data-driven approaches: Recent developments and perspectives
A fundamental and challenging task of computational protein studies is to design proteins of desired structures and functions on demand. Data-driven approaches to protein design have been gaining tremendous momentum, with recent developments concentrated on protein sequence representation and generation by using deep learning language models, structure-based sequence design or inverse protein folding, and the de novo generation of new protein backbones. Currently, design methods have been assessed mainly by several useful computational metrics. However, these metrics are still highly insufficient for predicting the performance of design methods in wet experiments. Nevertheless, some methods have been verified experimentally, which showed that proteins of novel sequences and structures can be designed with data-driven models learned from natural proteins. Despite the progress, an important current limitation is the lack of accurate data-driven approaches to model or design protein dynamics.
期刊介绍:
Computational molecular sciences harness the power of rigorous chemical and physical theories, employing computer-based modeling, specialized hardware, software development, algorithm design, and database management to explore and illuminate every facet of molecular sciences. These interdisciplinary approaches form a bridge between chemistry, biology, and materials sciences, establishing connections with adjacent application-driven fields in both chemistry and biology. WIREs Computational Molecular Science stands as a platform to comprehensively review and spotlight research from these dynamic and interconnected fields.