{"title":"Leveraging Deep Generative Model For Computational Protein Design And Optimization","authors":"Boqiao Lai","doi":"arxiv-2408.17241","DOIUrl":null,"url":null,"abstract":"Proteins are the fundamental macromolecules that play diverse and crucial\nroles in all living matter and have tremendous implications in healthcare,\nmanufacturing, and biotechnology. Their functions are largely determined by the\nsequences of amino acids that compose them and their unique three-dimensional\nstructures when folded. The recent surge in highly accurate computational\nprotein structure prediction tools has equipped scientists with the means to\nderive preliminary structural insights without the onerous costs of\nexperimental structure determination. These breakthroughs hold profound promise\nfor building robust and efficient in silico protein design systems. While the prospect of designing de novo proteins with precise computational\naccuracy remains a grand challenge in biochemical engineering, conventional\nassembly-based and rational design methods often grapple with the expansive\ndesign space, resulting in suboptimal design success rates. Despite recently\nemerged deep learning-based models have shown promise in improving the\nefficiency of the computational protein design process, a significant gap\npersists between current design paradigms and their experimental realization.\nThis thesis will investigate the potential of deep generative models in\nrefining protein structure and sequence design methods, aiming to develop\nframeworks capable of crafting novel protein sequences with predetermined\nstructures or specific functionalities. By harnessing extensive protein\ndatabases and cutting-edge neural architectures, this research aims to enhance\nprecision and robustness in current protein design paradigms, potentially\npaving the way for advancements across various scientific fields.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.17241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Proteins are the fundamental macromolecules that play diverse and crucial
roles in all living matter and have tremendous implications in healthcare,
manufacturing, and biotechnology. Their functions are largely determined by the
sequences of amino acids that compose them and their unique three-dimensional
structures when folded. The recent surge in highly accurate computational
protein structure prediction tools has equipped scientists with the means to
derive preliminary structural insights without the onerous costs of
experimental structure determination. These breakthroughs hold profound promise
for building robust and efficient in silico protein design systems. While the prospect of designing de novo proteins with precise computational
accuracy remains a grand challenge in biochemical engineering, conventional
assembly-based and rational design methods often grapple with the expansive
design space, resulting in suboptimal design success rates. Despite recently
emerged deep learning-based models have shown promise in improving the
efficiency of the computational protein design process, a significant gap
persists between current design paradigms and their experimental realization.
This thesis will investigate the potential of deep generative models in
refining protein structure and sequence design methods, aiming to develop
frameworks capable of crafting novel protein sequences with predetermined
structures or specific functionalities. By harnessing extensive protein
databases and cutting-edge neural architectures, this research aims to enhance
precision and robustness in current protein design paradigms, potentially
paving the way for advancements across various scientific fields.