{"title":"巴基斯坦、中国和印度流行的乙型肝炎病毒基因型D亚基因型D1的计算机分析","authors":"Muneeb Bahar, M. T. Pervez, Akhtar Ali, M. Babar","doi":"10.1177/1176934319861337","DOIUrl":null,"url":null,"abstract":"The focus of this study was the computational analysis of hepatitis B virus (HBV) genotype D subgenotype D1 in Pakistan, China, and India. In total, 54 complete genome sequences of HBV genotype D subgenotype D1 were downloaded from National Center for Biotechnology Information (NCBI). Of these, 6 complete genome sequences were from Pakistan, 14 were from China, and 34 were from India. Sequence alignment showed less than 4% divergence in these sequences. C and X genes showed divergence of less than 3%. Comparison over the S gene showed more than 97% similarity among the nucleotide sequences of genotype D subgenotype D1. The identity and similarity matrix of 54 nucleotide sequences of HBV genotype D subgenotype D1 from Pakistan, China, and India revealed more than 93% identity and 93% similarity. Phylogenetic analysis highlighted that complete genome isolates of HBV circulating in Pakistan had the closest evolutionary relationship with its neighboring countries China and India. China’s (HQ833466) and Pakistan’s (AB583680.1) isolates shared the same ancestor. Gene structure analysis showed that “P” gene exons were the longest, about three-fourth of the genome size, whereas gene “S” had the second longest coding regions with 2 exons and 1 intron. However, “C” and “X” genes had 1 smallest exon. X proteins had proven role in spreading of the HBV infection diseases. For HBx analysis, 1 X protein sequence of HBV genotype D subgenotype D1 belonging to each country was obtained. Homology models of the 3 X proteins generated using SWISS-MODEL revealed GMQE (Global Model Quality Estimation) = 0.1. Global and local quality estimate scores including Z-scores for Qualitative Model Energy Analysis (QMEAN) C-beta, all-atom, solvation, and torsion energy scores were similar indicating good quality, accuracy, and reliability of the predicted models. Three-dimensional (3D) visualization showed similar structures and Ramachandran plots showed a high percentage of protein residues into the favorable region for X protein models.","PeriodicalId":136690,"journal":{"name":"Evolutionary Bioinformatics Online","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"In Silico Analysis of Hepatitis B Virus Genotype D Subgenotype D1 Circulating in Pakistan, China, and India\",\"authors\":\"Muneeb Bahar, M. T. Pervez, Akhtar Ali, M. Babar\",\"doi\":\"10.1177/1176934319861337\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The focus of this study was the computational analysis of hepatitis B virus (HBV) genotype D subgenotype D1 in Pakistan, China, and India. In total, 54 complete genome sequences of HBV genotype D subgenotype D1 were downloaded from National Center for Biotechnology Information (NCBI). Of these, 6 complete genome sequences were from Pakistan, 14 were from China, and 34 were from India. Sequence alignment showed less than 4% divergence in these sequences. C and X genes showed divergence of less than 3%. Comparison over the S gene showed more than 97% similarity among the nucleotide sequences of genotype D subgenotype D1. The identity and similarity matrix of 54 nucleotide sequences of HBV genotype D subgenotype D1 from Pakistan, China, and India revealed more than 93% identity and 93% similarity. Phylogenetic analysis highlighted that complete genome isolates of HBV circulating in Pakistan had the closest evolutionary relationship with its neighboring countries China and India. China’s (HQ833466) and Pakistan’s (AB583680.1) isolates shared the same ancestor. Gene structure analysis showed that “P” gene exons were the longest, about three-fourth of the genome size, whereas gene “S” had the second longest coding regions with 2 exons and 1 intron. However, “C” and “X” genes had 1 smallest exon. X proteins had proven role in spreading of the HBV infection diseases. For HBx analysis, 1 X protein sequence of HBV genotype D subgenotype D1 belonging to each country was obtained. Homology models of the 3 X proteins generated using SWISS-MODEL revealed GMQE (Global Model Quality Estimation) = 0.1. Global and local quality estimate scores including Z-scores for Qualitative Model Energy Analysis (QMEAN) C-beta, all-atom, solvation, and torsion energy scores were similar indicating good quality, accuracy, and reliability of the predicted models. Three-dimensional (3D) visualization showed similar structures and Ramachandran plots showed a high percentage of protein residues into the favorable region for X protein models.\",\"PeriodicalId\":136690,\"journal\":{\"name\":\"Evolutionary Bioinformatics Online\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Evolutionary Bioinformatics Online\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/1176934319861337\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Evolutionary Bioinformatics Online","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/1176934319861337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
本研究的重点是对巴基斯坦、中国和印度的乙型肝炎病毒(HBV)基因型D亚基因型D1的计算分析。从美国国家生物技术信息中心(NCBI)下载了54个HBV基因型D亚基因型D1全基因组序列。其中,6个完整基因组序列来自巴基斯坦,14个来自中国,34个来自印度。序列比对显示,这些序列的差异小于4%。C和X基因的差异小于3%。S基因的比较表明,基因型D亚基因型D1的核苷酸序列相似性大于97%。来自巴基斯坦、中国和印度的HBV基因型D亚基因型D1的54个核苷酸序列的同源性和相似性矩阵显示,同源性和相似性超过93%。系统发育分析强调,巴基斯坦流行的HBV全基因组分离株与其邻国中国和印度的进化关系最密切。中国的(HQ833466)和巴基斯坦的(AB583680.1)分离株具有相同的祖先。基因结构分析表明,P基因外显子最长,约占基因组大小的四分之三;S基因编码区最长,有2个外显子和1个内含子。而“C”和“X”基因有1个最小的外显子。X蛋白已被证实在HBV感染疾病的传播中起作用。HBx分析,获得每个国家的HBV基因型D亚基因型D1的1 X个蛋白序列。使用SWISS-MODEL生成的3个X蛋白同源性模型显示GMQE (Global Model Quality Estimation) = 0.1。整体和局部质量估计分数,包括定性模型能量分析(QMEAN) C-beta、全原子、溶剂化和扭转能分数的z分数相似,表明预测模型的质量、准确性和可靠性都很好。三维(3D)可视化显示了相似的结构,Ramachandran图显示了高比例的蛋白质残基进入X蛋白质模型的有利区域。
In Silico Analysis of Hepatitis B Virus Genotype D Subgenotype D1 Circulating in Pakistan, China, and India
The focus of this study was the computational analysis of hepatitis B virus (HBV) genotype D subgenotype D1 in Pakistan, China, and India. In total, 54 complete genome sequences of HBV genotype D subgenotype D1 were downloaded from National Center for Biotechnology Information (NCBI). Of these, 6 complete genome sequences were from Pakistan, 14 were from China, and 34 were from India. Sequence alignment showed less than 4% divergence in these sequences. C and X genes showed divergence of less than 3%. Comparison over the S gene showed more than 97% similarity among the nucleotide sequences of genotype D subgenotype D1. The identity and similarity matrix of 54 nucleotide sequences of HBV genotype D subgenotype D1 from Pakistan, China, and India revealed more than 93% identity and 93% similarity. Phylogenetic analysis highlighted that complete genome isolates of HBV circulating in Pakistan had the closest evolutionary relationship with its neighboring countries China and India. China’s (HQ833466) and Pakistan’s (AB583680.1) isolates shared the same ancestor. Gene structure analysis showed that “P” gene exons were the longest, about three-fourth of the genome size, whereas gene “S” had the second longest coding regions with 2 exons and 1 intron. However, “C” and “X” genes had 1 smallest exon. X proteins had proven role in spreading of the HBV infection diseases. For HBx analysis, 1 X protein sequence of HBV genotype D subgenotype D1 belonging to each country was obtained. Homology models of the 3 X proteins generated using SWISS-MODEL revealed GMQE (Global Model Quality Estimation) = 0.1. Global and local quality estimate scores including Z-scores for Qualitative Model Energy Analysis (QMEAN) C-beta, all-atom, solvation, and torsion energy scores were similar indicating good quality, accuracy, and reliability of the predicted models. Three-dimensional (3D) visualization showed similar structures and Ramachandran plots showed a high percentage of protein residues into the favorable region for X protein models.