Şeyma DURAN , Ayça Fulya ÜSTÜNTANIR DEDE , Yeliz DÜNDAR ORHAN , Muhittin ARSLANYOLU
{"title":"Genome-wide identification and in-silico analysis of papain-family cysteine protease encoding genes in Tetrahymena thermophila","authors":"Şeyma DURAN , Ayça Fulya ÜSTÜNTANIR DEDE , Yeliz DÜNDAR ORHAN , Muhittin ARSLANYOLU","doi":"10.1016/j.ejop.2023.126033","DOIUrl":null,"url":null,"abstract":"<div><p><em>Tetrahymena thermophila</em> is a promising host for recombinant protein production, but its utilization in biotechnology is mostly limited due to the presence of intracellular and extracellular papain-family cysteine proteases (PFCPs). In this study, we employed bioinformatics approaches to investigate the <em>T. thermophila</em> PFCP genes and their encoded proteases (TtPFCPs), the most prominent protease family in the genome. Results from the multiple sequence alignment, protein modeling, and conserved motif analyses revealed that all TtPFCPs showed considerably high homology with mammalian cysteine cathepsins and contained conserved amino acid motifs. The total of 121 TtPFCP-encoding genes, 14 of which were classified as non-peptidase homologs, were found. Remaining 107 true TtPFCPs were divided into four distinct subgroups depending on their homology with mammalian lysosomal cathepsins: cathepsin L-like (TtCATLs), cathepsin B-like (TtCATBs), cathepsin C-like (TtCATCs), and cathepsin X-like (TtCATXs) PFCPs. The majority of true TtPFCPs (96 out of the total) were in TtCATL-like peptidase subgroup. Both phylogenetic and chromosomal localization analyses of TtPFCPs supported the hypothesis that TtPFCPs likely evolved through tandem gene duplication events and predominantly accumulated on micronuclear chromosome 5. Additionally, more than half of the identified TtPFCP genes are expressed in considerably low quantities compared to the rest of the TtPFCP genes, which are expressed at a higher level. However, their expression patterns fluctuate based on the stage of the life cycle. In conclusion, this study provides the first comprehensive <em>in-silico</em> analysis of TtPFCP genes and encoded proteases. The results would help designing an effective strategy for protease knockout mutant cell lines to discover biological function and to improve the recombinant protein production in <em>T. thermophila</em>.</p></div>","PeriodicalId":12042,"journal":{"name":"European journal of protistology","volume":"92 ","pages":"Article 126033"},"PeriodicalIF":1.9000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European journal of protistology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0932473923000780","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Tetrahymena thermophila is a promising host for recombinant protein production, but its utilization in biotechnology is mostly limited due to the presence of intracellular and extracellular papain-family cysteine proteases (PFCPs). In this study, we employed bioinformatics approaches to investigate the T. thermophila PFCP genes and their encoded proteases (TtPFCPs), the most prominent protease family in the genome. Results from the multiple sequence alignment, protein modeling, and conserved motif analyses revealed that all TtPFCPs showed considerably high homology with mammalian cysteine cathepsins and contained conserved amino acid motifs. The total of 121 TtPFCP-encoding genes, 14 of which were classified as non-peptidase homologs, were found. Remaining 107 true TtPFCPs were divided into four distinct subgroups depending on their homology with mammalian lysosomal cathepsins: cathepsin L-like (TtCATLs), cathepsin B-like (TtCATBs), cathepsin C-like (TtCATCs), and cathepsin X-like (TtCATXs) PFCPs. The majority of true TtPFCPs (96 out of the total) were in TtCATL-like peptidase subgroup. Both phylogenetic and chromosomal localization analyses of TtPFCPs supported the hypothesis that TtPFCPs likely evolved through tandem gene duplication events and predominantly accumulated on micronuclear chromosome 5. Additionally, more than half of the identified TtPFCP genes are expressed in considerably low quantities compared to the rest of the TtPFCP genes, which are expressed at a higher level. However, their expression patterns fluctuate based on the stage of the life cycle. In conclusion, this study provides the first comprehensive in-silico analysis of TtPFCP genes and encoded proteases. The results would help designing an effective strategy for protease knockout mutant cell lines to discover biological function and to improve the recombinant protein production in T. thermophila.
期刊介绍:
Articles deal with protists, unicellular organisms encountered free-living in various habitats or as parasites or used in basic research or applications. The European Journal of Protistology covers topics such as the structure and systematics of protists, their development, ecology, molecular biology and physiology. Beside publishing original articles the journal offers a forum for announcing scientific meetings. Reviews of recently published books are included as well. With its diversity of topics, the European Journal of Protistology is an essential source of information for every active protistologist and for biologists of various fields.