{"title":"Tracing the function expansion for a primordial protein fold in the era of fold-based function prediction: β-trefoil.","authors":"Moushmi Goswami, Subhashini Srinivasan","doi":"10.1371/journal.pone.0320177","DOIUrl":null,"url":null,"abstract":"<p><p>The incredibly narrow protein fold bottleneck, which separates the billions of unique proteins on one side to deliver diverse biological functions on the other, arises from folds that tolerate mutations during evolution. One such fold, called the β-trefoil, is present in functionally diverse proteins including cytokines involved in the immune system such as interleukin-1. The unrecognizable sequence-level diversity, even among paralogs of interleukin-1 within the same chromosomal locus, suggests the resilience of this fold to mutational onslaught. Furthermore, β-trefoil domain containing-proteins are known to coexist with other domains to achieve functional diversity. In this study, we challenge the reach and limitations of function prediction using fold-fold comparison using β-trefoil fold as an example. We identified proteins containing β-trefoil fold belonging to thirty-two distinct functional classes based on diverse domain architecture and/or functional annotation by mining both the PDB and AlphaFold databases using fold-fold comparison. Among the proteins with novel domain architecture we find β-trefoil along with chitinase, lipase, β-glucosidase, protein kinase, peptidoglycan-binding + peptidase matrixin, glycosyl hydrolases family 3 + PA14 + fibronectin type- III, alpha galactosidase A, PhoD-like phosphatase, insecticidal crystal toxin, trypsin, alginate lyase and two novel structurally uncharacterized domains. We demonstrate that fold-fold comparison can extend function prediction beyond the reach of sequence-based approach and provides an opportunity to discover novel domain architecture associated with known folds. However, since extending fold similarity to functional similarity may be challenged by convergent fold evolution, we explore if β-trefoil may be a convergent evolution and share our hypothesis.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 7","pages":"e0320177"},"PeriodicalIF":2.9000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12225799/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0320177","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The incredibly narrow protein fold bottleneck, which separates the billions of unique proteins on one side to deliver diverse biological functions on the other, arises from folds that tolerate mutations during evolution. One such fold, called the β-trefoil, is present in functionally diverse proteins including cytokines involved in the immune system such as interleukin-1. The unrecognizable sequence-level diversity, even among paralogs of interleukin-1 within the same chromosomal locus, suggests the resilience of this fold to mutational onslaught. Furthermore, β-trefoil domain containing-proteins are known to coexist with other domains to achieve functional diversity. In this study, we challenge the reach and limitations of function prediction using fold-fold comparison using β-trefoil fold as an example. We identified proteins containing β-trefoil fold belonging to thirty-two distinct functional classes based on diverse domain architecture and/or functional annotation by mining both the PDB and AlphaFold databases using fold-fold comparison. Among the proteins with novel domain architecture we find β-trefoil along with chitinase, lipase, β-glucosidase, protein kinase, peptidoglycan-binding + peptidase matrixin, glycosyl hydrolases family 3 + PA14 + fibronectin type- III, alpha galactosidase A, PhoD-like phosphatase, insecticidal crystal toxin, trypsin, alginate lyase and two novel structurally uncharacterized domains. We demonstrate that fold-fold comparison can extend function prediction beyond the reach of sequence-based approach and provides an opportunity to discover novel domain architecture associated with known folds. However, since extending fold similarity to functional similarity may be challenged by convergent fold evolution, we explore if β-trefoil may be a convergent evolution and share our hypothesis.
期刊介绍:
PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides:
* Open-access—freely accessible online, authors retain copyright
* Fast publication times
* Peer review by expert, practicing researchers
* Post-publication tools to indicate quality and impact
* Community-based dialogue on articles
* Worldwide media coverage