Comparative Functional Genomics Studies for Understanding the Hypothetical Proteins in Mycobacterium Tuberculosis Variant Microti 12

Q3 Computer Science
Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh
{"title":"Comparative Functional Genomics Studies for Understanding the Hypothetical Proteins in\n Mycobacterium Tuberculosis Variant Microti 12","authors":"Tejaswini Vijay Shinde, Tejas Gajanan Shinde, V. V. Chougule, Anagha Rajendra Ghorpade, Geeta Vikas Utekar, Amol S Jadhav, Bandu Shamlal Pawar, S. Sanmukh","doi":"10.2174/18750362-v16-e230711-2023-2","DOIUrl":null,"url":null,"abstract":"\n \n The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii.\n \n \n \n Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12.\n \n \n \n A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level.\n \n \n \n Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database (\n http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig.\n \n 1\n ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST (\n https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam (\n http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO (\n http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The\n in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server (\n http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins (\n https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 (\n http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein sequences for the prediction of novel functions associated with these proteins. The predictions were further characterized by the Phyre2 server for structural modeling and prediction of templates based on comparative analysis based on conserved domains. Protein modeling, prediction, and analysis through Phyre2. The hypothetical proteins which were identified to have functional properties were further characterized by the Phyre2 server (\n http://www.sbg.bio.ic.ac.uk/phyre2) for structural modeling and prediction of templates based on comparative analysis based on conserved domains.\n \n \n \n \n A computational genomics study was performed to understand the 247 hypothetical protein genes Functional annotation of virtual proteins, and was performed on different servers to maximize confidence level. The functional prediction was performed by CDD-Blast and Pfam. The gene sequences of proteins have probably been successfully functionally annotated, characterized, and their subcellular localization and 3-D structural predictions have been predicted computationally. Online automated bioinformatics tools such as CDD-Blast, Pfam, CELLO2GO and PS2-Server were used for the structural and functional characterization of screened hypothetical proteins. The structure, function, and subcellular localization of a hypothetical protein from Mycobacterium tuberculosis variant microti 12 have been obtained and presented (Fig.\n \n 2\n ). Also, the three-dimensional structure generated after using the template with the highest score was displayed as the template ID in the structure column of the respective hypothetical protein. However, as systems biology denies hypothetical protein functions, the structures of such proteins can be tested through biological processes and experiments, making them suitable for understanding their role in the life cycle, pathogenesis, and drug development. We can further explore these predictive possibilities in pharmaceuticals, and other clinically relevant studies. This study by HP helped find structure-function relationships in Mycobacterium tuberculosis variant microti 12 using a variety of bioinformatics tools. The string database made predictions about protein-protein interactions and the template helped us predict a hypothetical protein structure and even helped us find its 3D protein structure. Protein profiling can be performed on structures retrieved from these servers. This is useful for proteomics studies, including protein-protein interactions, protein expression of specific hypothetical proteins, and post-translational modifications of protein-coding genes. Further understanding of these hypothetical proteins can help us to know more about the Mycobacterium tuberculosis complex (MTBC) and may assist in Drugs and inhibitors against different pathogens within this complex.\n \n \n \n \n The all-inclusive bioinformatic study has helped to functionally elucidate 247 hypothetical proteins, which have resulted and made it easier to understand many functional proteins available in the Mycobacterium tuberculosis micro variant 12. The subcellular localization of the 247 sorted hypothetical proteins was also carried & which further helped us understand the localization of identified enzymes or proteins. We have successfully characterized the 247 unknown proteins of hypothetical protein sequences from Mycobacterium tuberculosis micro variant 12 to validate their structure and functions of the gene products. These predicted functions and three-dimensional structures may lead to establishing their role in the life cycle of the bacterium. This computationally generated data can also be further used for developing new protocols for new vaccines against Mycobacterium tuberculosis micro variant 12 that are essential for preventing infection, diseases, and transmission.\n This complete result of Hypothetical Protein is needed for further studies of the whole genomic of the Mycobacterium Tuberculosis micro variant 12 for their function interpretation which further help in the understanding of its functions as well as structure.\n Moreover, this interpretation would help us to study the evolution of Mycobacterium Tuberculosis micro variant 12 which further helps in the process of discovering the drugs to inhibit the causes of diseases.\n","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Bioinformatics Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/18750362-v16-e230711-2023-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1

Abstract

The Mycobacterium tuberculosis complex (MTBC) bacteria include the slowly growing, host-associated bacteria Mycobacterium tuberculosis, Mycobacterium Bovis, Mycobacterium microti, Mycobacterium africanum, Mycobacterium pinnipedii. Comparative Functional Genomics Studies for understanding the Hypothetical Proteins in Mycobacterium tuberculosis variant microti 12. A computational genomics study was performed to understand the 247 hypothetical protein genes. Functional annotation of virtual proteins was performed on different servers to maximize confidence level. Sequence Retrieval. The whole genome sequences for the Mycobacterium tuberculosis micro variant 12 were retrieved from the KEGG database ( http://www.genome.jp/kegg/) and were used for screening 247 hypothetical proteins (Fig. 1 ). Functional Annotation and Sub-cellular localization. The Mycobacterium tuberculosis micro variant 12 hypothetical proteins were screened and sorted out from the genome and were individually analyzed for the presence of conserved functional domains by using computational biology tools like CDD-BLAST ( https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) ;Pfam ( http://pfam.xfam.org/ncbiseq/398365647); The subcellular localization of hypothetical proteins was determined by CELLO2GO ( http://cello.life.nctu.edu.tw). These web tools can search the defined conserved domains in the sequences available in the online servers or databases and assist in the classification of proteins in the appropriate families. Protein Structure Prediction. The in-silico structure predictions of the hypothetical protein sequences showing functional properties were carried out by using the PS2 Protein Structure Prediction Server ( http://www.ps2.life.nctu.edu.tw/). The online server helps to generate the 3D structures of the hypothetical proteins. The server accepts the sequences in FASTA format as a query to generate resultant proteins 3D structures. The structure determination is completely based on the conserved template regions detected during functional annotations. Protein-protein interaction through String database: The interaction of each hypothetical protein analyzed for functional characteristics was subjected to a protein-protein interaction server for the prediction of a possible functional role in interaction amongst the available known proteins ( https://string-db.org/). This information can help us to further validated the functional role of such hypothetical proteins and their possible role in the Mycobacterium Tuberculosis micro variant. Protein secondary structure prediction through JPred4: The secondary structure prediction of all the hypothetical proteins was determined through JPred4 ( http://www.compbio.dundee.ac.uk/jpred4/index.html) and served to identify the available secondary structures in the unknown hypothetical protein sequences. These further help us to understand the available templates in the uncharacterized protein sequences for the prediction of novel functions associated with these proteins. The predictions were further characterized by the Phyre2 server for structural modeling and prediction of templates based on comparative analysis based on conserved domains. Protein modeling, prediction, and analysis through Phyre2. The hypothetical proteins which were identified to have functional properties were further characterized by the Phyre2 server ( http://www.sbg.bio.ic.ac.uk/phyre2) for structural modeling and prediction of templates based on comparative analysis based on conserved domains. A computational genomics study was performed to understand the 247 hypothetical protein genes Functional annotation of virtual proteins, and was performed on different servers to maximize confidence level. The functional prediction was performed by CDD-Blast and Pfam. The gene sequences of proteins have probably been successfully functionally annotated, characterized, and their subcellular localization and 3-D structural predictions have been predicted computationally. Online automated bioinformatics tools such as CDD-Blast, Pfam, CELLO2GO and PS2-Server were used for the structural and functional characterization of screened hypothetical proteins. The structure, function, and subcellular localization of a hypothetical protein from Mycobacterium tuberculosis variant microti 12 have been obtained and presented (Fig. 2 ). Also, the three-dimensional structure generated after using the template with the highest score was displayed as the template ID in the structure column of the respective hypothetical protein. However, as systems biology denies hypothetical protein functions, the structures of such proteins can be tested through biological processes and experiments, making them suitable for understanding their role in the life cycle, pathogenesis, and drug development. We can further explore these predictive possibilities in pharmaceuticals, and other clinically relevant studies. This study by HP helped find structure-function relationships in Mycobacterium tuberculosis variant microti 12 using a variety of bioinformatics tools. The string database made predictions about protein-protein interactions and the template helped us predict a hypothetical protein structure and even helped us find its 3D protein structure. Protein profiling can be performed on structures retrieved from these servers. This is useful for proteomics studies, including protein-protein interactions, protein expression of specific hypothetical proteins, and post-translational modifications of protein-coding genes. Further understanding of these hypothetical proteins can help us to know more about the Mycobacterium tuberculosis complex (MTBC) and may assist in Drugs and inhibitors against different pathogens within this complex. The all-inclusive bioinformatic study has helped to functionally elucidate 247 hypothetical proteins, which have resulted and made it easier to understand many functional proteins available in the Mycobacterium tuberculosis micro variant 12. The subcellular localization of the 247 sorted hypothetical proteins was also carried & which further helped us understand the localization of identified enzymes or proteins. We have successfully characterized the 247 unknown proteins of hypothetical protein sequences from Mycobacterium tuberculosis micro variant 12 to validate their structure and functions of the gene products. These predicted functions and three-dimensional structures may lead to establishing their role in the life cycle of the bacterium. This computationally generated data can also be further used for developing new protocols for new vaccines against Mycobacterium tuberculosis micro variant 12 that are essential for preventing infection, diseases, and transmission. This complete result of Hypothetical Protein is needed for further studies of the whole genomic of the Mycobacterium Tuberculosis micro variant 12 for their function interpretation which further help in the understanding of its functions as well as structure. Moreover, this interpretation would help us to study the evolution of Mycobacterium Tuberculosis micro variant 12 which further helps in the process of discovering the drugs to inhibit the causes of diseases.
比较功能基因组学研究以了解结核分枝杆菌变异株Microti 12中的假设蛋白质
结核分枝杆菌复合体(MTBC)细菌包括缓慢生长的宿主相关细菌结核分枝杆菌、牛分枝杆菌、微小分枝杆菌、非洲分枝杆菌、鳍足分枝杆菌。比较功能基因组学研究以了解结核分枝杆菌变异株microti中的假设蛋白质12。进行了一项计算基因组学研究,以了解247个假设的蛋白质基因。在不同的服务器上进行虚拟蛋白质的功能注释,以最大限度地提高置信水平。序列检索。结核分枝杆菌微变体12的全基因组序列从KEGG数据库中检索(http://www.genome.jp/kegg/)并用于筛选247种假设蛋白质(图1)。功能注释和亚细胞定位。从基因组中筛选和分选结核分枝杆菌微变体12种假设蛋白质,并通过使用计算生物学工具如CDD-BLAST(https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi);Pfam(http://pfam.xfam.org/ncbiseq/398365647);假设蛋白质的亚细胞定位通过CELLO2GO(http://cello.life.nctu.edu.tw)。这些网络工具可以搜索在线服务器或数据库中可用序列中定义的保守结构域,并帮助将蛋白质分类到适当的家族中。蛋白质结构预测。通过使用PS2蛋白质结构预测服务器(http://www.ps2.life.nctu.edu.tw/)。在线服务器帮助生成假设蛋白质的3D结构。服务器接受FASTA格式的序列作为查询,以生成生成的蛋白质3D结构。结构的确定完全基于在功能注释期间检测到的保守模板区域。通过字符串数据库进行的蛋白质-蛋白质相互作用:对功能特征分析的每个假设蛋白质的相互作用进行蛋白质-蛋白质交互服务器,以预测可用已知蛋白质之间的相互作用中可能的功能作用(https://string-db.org/)。这些信息可以帮助我们进一步验证这些假设蛋白质的功能作用及其在结核分枝杆菌微变体中的可能作用。通过JPred4预测蛋白质二级结构:所有假设蛋白质的二级结构预测通过JPred4http://www.compbio.dundee.ac.uk/jpred4/index.html)并用于鉴定未知假设蛋白质序列中的可用二级结构。这些进一步帮助我们了解未表征蛋白质序列中可用于预测与这些蛋白质相关的新功能的模板。通过Phyre2服务器对预测进行进一步表征,用于基于保守结构域的比较分析的模板的结构建模和预测。通过Phyre2。通过Phyre2服务器(http://www.sbg.bio.ic.ac.uk/phyre2)用于基于保守结构域的比较分析的模板结构建模和预测。进行了一项计算基因组学研究,以了解247个假设的蛋白质基因虚拟蛋白质的功能注释,并在不同的服务器上进行,以最大限度地提高置信水平。通过CDD-Blast和Pfam进行功能预测。蛋白质的基因序列可能已经成功地进行了功能注释和表征,并且已经通过计算预测了它们的亚细胞定位和三维结构预测。在线自动化生物信息学工具,如CDD-Blast、Pfam、CELLO2GO和PS2-Server,用于筛选的假设蛋白质的结构和功能表征。已经获得并展示了来自结核分枝杆菌变体microti 12的假设蛋白质的结构、功能和亚细胞定位(图2)。此外,在使用具有最高得分的模板之后生成的三维结构被显示为各个假设蛋白质的结构栏中的模板ID。然而,由于系统生物学否认了假设的蛋白质功能,因此可以通过生物过程和实验来测试这些蛋白质的结构,使其适合理解其在生命周期、发病机制和药物开发中的作用。我们可以在药物和其他临床相关研究中进一步探索这些预测可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Open Bioinformatics Journal
Open Bioinformatics Journal Computer Science-Computer Science (miscellaneous)
CiteScore
2.40
自引率
0.00%
发文量
4
期刊介绍: The Open Bioinformatics Journal is an Open Access online journal, which publishes research articles, reviews/mini-reviews, letters, clinical trial studies and guest edited single topic issues in all areas of bioinformatics and computational biology. The coverage includes biomedicine, focusing on large data acquisition, analysis and curation, computational and statistical methods for the modeling and analysis of biological data, and descriptions of new algorithms and databases. The Open Bioinformatics Journal, a peer reviewed journal, is an important and reliable source of current information on the developments in the field. The emphasis will be on publishing quality articles rapidly and freely available worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信