Genomics & informatics最新文献

筛选
英文 中文
Bioregulatory event extraction using large language models: a case study of rice literature. 使用大型语言模型提取生物调控事件:水稻文献案例研究。
Genomics & informatics Pub Date : 2024-10-31 DOI: 10.1186/s44342-024-00022-3
Xinzhi Yao, Zhihan He, Jingbo Xia
{"title":"Bioregulatory event extraction using large language models: a case study of rice literature.","authors":"Xinzhi Yao, Zhihan He, Jingbo Xia","doi":"10.1186/s44342-024-00022-3","DOIUrl":"10.1186/s44342-024-00022-3","url":null,"abstract":"<p><p>The extraction of biological regulation events has been a key focus in the field of biomedical nature language processing (BioNLP). However, existing methods often encounter challenges such as cascading errors in text mining pipelines and limitations in topic coverage from the selected corpus. Fortunately, the emergence of large language models (LLMs) presents a potential solution due to their robust semantic understanding and extensive knowledge base. To explore this potential, our project at the Biomedical Linked Annotation Hackathon 8 (BLAH 8) investigates the feasibility of using LLMs to extract biological regulation events. Our findings, based on the analysis of rice literature, demonstrate the promising performance of LLMs in this task, while also highlighting several concerns that must be addressed in future LLM-based application in low-resource topic.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"20"},"PeriodicalIF":0.0,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529424/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142560352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast and accurate short-read alignment with hybrid hash-tree data structure. 利用混合哈希树数据结构实现快速准确的短读取配准。
Genomics & informatics Pub Date : 2024-10-29 DOI: 10.1186/s44342-024-00012-5
Junichiro Makino, Toshikazu Ebisuzaki, Ryutaro Himeno, Yoshihide Hayashizaki
{"title":"Fast and accurate short-read alignment with hybrid hash-tree data structure.","authors":"Junichiro Makino, Toshikazu Ebisuzaki, Ryutaro Himeno, Yoshihide Hayashizaki","doi":"10.1186/s44342-024-00012-5","DOIUrl":"10.1186/s44342-024-00012-5","url":null,"abstract":"<p><p>Rapidly increasing the amount of short-read data generated by NGSs (new-generation sequencers) calls for the development of fast and accurate read alignment programs. The programs based on the hash table (BLAST) and Burrows-Wheeler transform (bwa-mem) are used, and the latter is known to give superior performance. We here present a new algorithm, a hybrid of hash table and suffix tree, which we designed to speed up the alignment of short reads against large reference sequences such as the human genome. The total turnaround time for processing one human genome sample (read depth of 30) is just 31 min with our system while that was more than 25 h with bwa-mem/gatk. The time for the aligner alone is 28 min for our system but around 2 h for bwa-mem. Our new algorithm is 4.4 times faster than bwa-mem while achieving similar accuracy. Variant calling and other downstream analyses after the alignment can be done with open-source tools such as SAMtools and Genome Analysis Toolkit (gatk) packages, as well as our own fast variant caller, which is well parallelized and much faster than gatk.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"19"},"PeriodicalIF":0.0,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11520436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight technology stacks for assistive linked annotations. 用于辅助链接注释的轻量级技术栈。
Genomics & informatics Pub Date : 2024-10-10 DOI: 10.1186/s44342-024-00021-4
Nishad Thalhath
{"title":"Lightweight technology stacks for assistive linked annotations.","authors":"Nishad Thalhath","doi":"10.1186/s44342-024-00021-4","DOIUrl":"10.1186/s44342-024-00021-4","url":null,"abstract":"<p><p>This report presents the findings of a project from the 8th Biomedical Linked Annotation Hackathon (BLAH) to explore lightweight technology stacks to enhance assistive linked annotations. Using modern JavaScript frameworks and edge functions, in-browser Named Entity Recognition (NER), serverless embedding and vector search within web interfaces, and efficient serverless full-text search were implemented. Through this experimental approach, a proof of concept to demonstrate the feasibility and performance of these technologies was demonstrated. The results show that lightweight stacks can significantly improve the efficiency and cost-effectiveness of annotation tools and provide a local-first, privacy-oriented, and secure alternative to traditional server-based solutions in various use cases. This work emphasizes the potential of developing annotation interfaces that are more responsive, scalable, and user-friendly, which would benefit bioinformatics researchers, practitioners, and software developers.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"17"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11468380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142402555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular diagnostic approach to rare neurological diseases from a clinician viewpoint. 从临床医生的角度看罕见神经系统疾病的分子诊断方法。
Genomics & informatics Pub Date : 2024-10-10 DOI: 10.1186/s44342-024-00025-0
Jin Sook Lee
{"title":"Molecular diagnostic approach to rare neurological diseases from a clinician viewpoint.","authors":"Jin Sook Lee","doi":"10.1186/s44342-024-00025-0","DOIUrl":"10.1186/s44342-024-00025-0","url":null,"abstract":"<p><p>Advancements in sequencing technology have significantly enhanced diagnostic capabilities for rare neurological diseases. This progress in molecular diagnostics can greatly impact clinical management and facilitate the development of personalized treatments for patients with rare neurological diseases. Neurologists with expertise should raise clinical awareness, as phenotyping remains crucial for making a clinical diagnosis, even in the genomics era. They should prioritize different types of genomic tests, considering both the benefits and the limitations inherent to each test. Notably, long-read sequencing is being utilized in cases suspected to involve repeat expansion disorders or complex structural variants. Repeat expansion disorders are highly prevalent in neurological diseases, particularly within the ataxia group. Significant efforts, including periodic reanalysis, data sharing, or integration of genomics with multi-omics studies, should be directed toward cases that remain undiagnosed after standard next-generation sequencing.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"18"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11468364/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142402556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Compression rates of microbial genomes are associated with genome size and base composition. 微生物基因组的压缩率与基因组大小和碱基组成有关。
Genomics & informatics Pub Date : 2024-10-10 DOI: 10.1186/s44342-024-00018-z
Jon Bohlin, John H-O Pettersson
{"title":"Compression rates of microbial genomes are associated with genome size and base composition.","authors":"Jon Bohlin, John H-O Pettersson","doi":"10.1186/s44342-024-00018-z","DOIUrl":"10.1186/s44342-024-00018-z","url":null,"abstract":"<p><strong>Background: </strong>To what degree a string of symbols can be compressed reveals important details about its complexity. For instance, strings that are not compressible are random and carry a low information potential while the opposite is true for highly compressible strings. We explore to what extent microbial genomes are amenable to compression as they vary considerably both with respect to size and base composition. For instance, microbial genome sizes vary from less than 100,000 base pairs in symbionts to more than 10 million in soil-dwellers. Genomic base composition, often summarized as genomic AT or GC content due to the similar frequencies of adenine and thymine on one hand and cytosine and guanine on the other, also vary substantially; the most extreme microbes can have genomes with AT content below 25% or above 85% AT. Base composition determines the frequency of DNA words, consisting of multiple nucleotides or oligonucleotides, and may therefore also influence compressibility. Using 4,713 RefSeq genomes, we examined the association between compressibility, using both a DNA based- (MBGC) and a general purpose (ZPAQ) compression algorithm, and genome size, AT content as well as genomic oligonucleotide usage variance (OUV) using generalized additive models.</p><p><strong>Results: </strong>We find that genome size (p < 0.001) and OUV (p < 0.001) are both strongly associated with genome redundancy for both type of file compressors. The DNA-based MBGC compressor managed to improve compression with approximately 3% on average with respect to ZPAQ. Moreover, MBGC detected a significant (p < 0.001) compression ratio difference between AT poor and AT rich genomes which was not detected with ZPAQ.</p><p><strong>Conclusion: </strong>As lack of compressibility is equivalent to randomness, our findings suggest that smaller and AT rich genomes may have accumulated more random mutations on average than larger and AT poor genomes which, in turn, were significantly more redundant. Moreover, we find that OUV is a strong proxy for genome compressibility in microbial genomes. The ZPAQ compressor was found to agree with the MBGC compressor, albeit with a poorer performance, except for the compressibility of AT-rich and AT-poor/GC-rich genomes.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"16"},"PeriodicalIF":0.0,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11468749/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142402554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A prediction of mutations in infectious viruses using artificial intelligence. 利用人工智能预测传染性病毒的突变。
Genomics & informatics Pub Date : 2024-10-08 DOI: 10.1186/s44342-024-00019-y
Won Jong Choi, Jongkeun Park, Do Young Seong, Dae Sun Chung, Dongwan Hong
{"title":"A prediction of mutations in infectious viruses using artificial intelligence.","authors":"Won Jong Choi, Jongkeun Park, Do Young Seong, Dae Sun Chung, Dongwan Hong","doi":"10.1186/s44342-024-00019-y","DOIUrl":"10.1186/s44342-024-00019-y","url":null,"abstract":"<p><p>Many subtypes of SARS-CoV-2 have emerged since its early stages, with mutations showing regional and racial differences. These mutations significantly affected the infectivity and severity of the virus. This study aimed to predict the mutations that occur during the evolution of SARS-CoV-2 and identify the key characteristics for making these predictions. We collected and organized data on the lineage, date, clade, and mutations of SARS-CoV-2 from publicly available databases and processed them to predict the mutations. In addition, we utilized various artificial intelligence models to predict newly emerging mutations and created various training sets based on clade information. Using only mutation information resulted in low performance of the learning models, whereas incorporating clade differentiation resulted in high performance in machine learning models, including XGBoost (accuracy: 0.999). However, mutations fixed in the receptor-binding motif (RBM) region of Omicron resulted in decreased predictive performance. Using these models, we predicted potential mutation positions for 24C, following the recently emerged 24A and 24B clades. We identified a mutation at position Q493 in the RBM region. Our study developed effective artificial intelligence models and characteristics for predicting new mutations in continuously evolving infectious viruses.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"15"},"PeriodicalIF":0.0,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11463117/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142396543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of the technology used for structural characterization of the GMO genome using NGS data. 回顾利用 NGS 数据对转基因生物基因组进行结构表征的技术。
Genomics & informatics Pub Date : 2024-10-02 DOI: 10.1186/s44342-024-00016-1
Kahee Moon, Prakash Basnet, Taeyoung Um, Ik-Young Choi
{"title":"Review of the technology used for structural characterization of the GMO genome using NGS data.","authors":"Kahee Moon, Prakash Basnet, Taeyoung Um, Ik-Young Choi","doi":"10.1186/s44342-024-00016-1","DOIUrl":"10.1186/s44342-024-00016-1","url":null,"abstract":"<p><p>The molecular characterization of genetically modified organisms (GMOs) is essential for ensuring safety and gaining regulatory approval for commercialization. According to CODEX standards, this characterization involves evaluating the presence of introduced genes, insertion sites, copy number, and nucleotide sequence structure. Advances in technology have led to the increased use of next-generation sequencing (NGS) over traditional methods such as Southern blotting. While both methods provide high reproducibility and accuracy, Southern blotting is labor-intensive and time-consuming due to the need for repetitive probe design and analyses for each target, resulting in low throughput. Conversely, NGS facilitates rapid and comprehensive analysis by mapping whole-genome sequencing (WGS) data to plasmid sequences, accurately identifying T-DNA insertion sites and flanking regions. This advantage allows for efficient detection of T-DNA presence, copy number, and unintended gene insertions without additional probe work. This paper reviews the current status of GMO genome characterization using NGS and proposes more efficient strategies for this purpose.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"14"},"PeriodicalIF":0.0,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11445869/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142368146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Antibiotic resistance challenge: evaluating anthraquinones as rifampicin monooxygenase inhibitors through integrated bioinformatics analysis. 抗生素耐药性挑战:通过综合生物信息学分析评估作为利福平单氧化酶抑制剂的蒽醌类化合物。
Genomics & informatics Pub Date : 2024-09-04 DOI: 10.1186/s44342-024-00015-2
Mohammad Reza Arabestani, Masoumeh Saadat, Amir Taherkhani
{"title":"Antibiotic resistance challenge: evaluating anthraquinones as rifampicin monooxygenase inhibitors through integrated bioinformatics analysis.","authors":"Mohammad Reza Arabestani, Masoumeh Saadat, Amir Taherkhani","doi":"10.1186/s44342-024-00015-2","DOIUrl":"10.1186/s44342-024-00015-2","url":null,"abstract":"<p><strong>Objective: </strong>Antibiotic resistance poses a pressing and crucial global public health challenge, leading to significant clinical and health-related consequences. Substantial evidence highlights the pivotal involvement of rifampicin monooxygenase (RIFMO) in the context of antibiotic resistance. Hence, inhibiting RIFMO could offer potential in the treatment of various infections. Anthraquinones, a group of organic compounds, have shown promise in addressing tuberculosis. This study employed integrated bioinformatics approaches to evaluate the potential inhibitory effects of a selection of anthraquinones on RIFMO. The findings were subsequently compared with those of rifampicin (RIF), serving as a positive control inhibitor.</p><p><strong>Methods: </strong>The AutoDock 4.0 tool assessed the binding free energy between 21 anthraquinones and the RIFMO catalytic cleft. The ligands were ranked based on the most favorable scores derived from ΔG<sub>binding</sub>. The docking analyses for the highest-ranked anthraquinone and RIF underwent a cross-validation process. This validation procedure utilized the SwissDock server and the Schrödinger Maestro docking software. Molecular dynamics simulations were conducted to scrutinize the stability of the backbone atoms in free RIFMO, RIFMO-RIF, and RIFMO complexed with the top-ranked anthraquinone throughout a 100-ns computer simulation. The Discovery Studio Visualizer tool visualized interactions between RIFMO residues and ligands. An evaluation of the pharmacokinetics and toxicity profiles of the tested compounds was also conducted.</p><p><strong>Results: </strong>Five anthraquinones were indicated with ΔG<sub>binding</sub> scores less than - 10 kcal/mol. Hypericin emerged as the most potent RIFMO inhibitor, boasting a ΔG<sub>binding</sub> score and inhibition constant value of - 12.11 kcal/mol and 798.99 pM, respectively. The agreement across AutoDock 4.0, SwissDock, and Schrödinger Maestro results highlighted hypericin's notable binding affinity to the RIFMO catalytic cleft. The RIFMO-hypericin complex achieved stability after a 70-ns computer simulation, exhibiting a root-mean-square deviation of 0.55 nm. Oral bioavailability analysis revealed that all anthraquinones except hypericin, sennidin A, and sennidin B may be suitable for oral administration. Furthermore, the carcinogenicity prediction analysis indicated a favorable safety profile for all examined anthraquinones.</p><p><strong>Conclusion: </strong>Inhibiting RIFMO, particularly with anthraquinones such as hypericin, holds promise as a potential therapeutic strategy for infectious diseases.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"13"},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11375879/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142134908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shared alleles and genetic structures in different Thai domestic cat breeds: the possible influence of common racial origins. 泰国不同家猫品种的共同等位基因和遗传结构:共同种族起源的可能影响。
Genomics & informatics Pub Date : 2024-07-31 DOI: 10.1186/s44342-024-00013-4
Wattanawan Jaito, Worapong Singchat, Chananya Patta, Chadaphon Thatukan, Nichakorn Kumnan, Piangjai Chalermwong, Trifan Budi, Thitipong Panthum, Wongsathit Wongloet, Pish Wattanadilokchatkun, Thanyapat Thong, Narongrit Muangmai, Kyudong Han, Prateep Duengkae, Rattanin Phatcharakullawarawat, Kornsorn Srikulnath
{"title":"Shared alleles and genetic structures in different Thai domestic cat breeds: the possible influence of common racial origins.","authors":"Wattanawan Jaito, Worapong Singchat, Chananya Patta, Chadaphon Thatukan, Nichakorn Kumnan, Piangjai Chalermwong, Trifan Budi, Thitipong Panthum, Wongsathit Wongloet, Pish Wattanadilokchatkun, Thanyapat Thong, Narongrit Muangmai, Kyudong Han, Prateep Duengkae, Rattanin Phatcharakullawarawat, Kornsorn Srikulnath","doi":"10.1186/s44342-024-00013-4","DOIUrl":"10.1186/s44342-024-00013-4","url":null,"abstract":"<p><p>Over hundreds of years, cats have been domesticated and selectively bred, resulting in numerous pedigreed breeds expedited by recent cat shows and breeding associations. Concerns have been raised about the limited breeding options and the genetic implications of inbreeding, indicating challenges in maintaining genetic diversity and accurate identification in purebred cats. In this study, genetic variability and structure were examined in 5 Thai domestic cat breeds using 15 microsatellite markers and mitochondrial DNA (mtDNA) D-loop sequencing. In total, 184 samples representing the Wichien Maat (WCM), Suphalak (SL), Khao-Manee (KM), Korat (KR), and Konja (KJ) breeds were analyzed. High genetic diversity (H<sub>o</sub> and H<sub>e</sub> > 0.5) was observed in all breeds, and mtDNA analysis revealed two primary haplogroups (A and B) that were shared among all domestic cat breeds in Thailand and globally. However, minor differences were observed between Thai domestic cat breeds based on clustering analyses, in which a distinct genetic structure was observed in the WCM breed. This suggests that allele fixation for distinctive morphological traits has occurred in Thai domestic cat breeds that emerged in isolated regions with shared racial origins. Analysis of relationships among individuals within the breed revealed high identification efficiency in Thai domestic cat breeds (P<sub>(ID)sibs</sub> < 10<sup>-4</sup>). Additionally, diverse and effective individual identification can be ensured by optimizing marker efficiency by using only nine loci. This comprehensive genetic characterization provides valuable insights into conservation strategies and breeding practices for Thai domestic cat breeds.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"12"},"PeriodicalIF":0.0,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11292921/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141862011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genetic diversity and natural selection analysis of VAR2CSA and vir genes: implication for vaccine development. VAR2CSA 和 vir 基因的遗传多样性和自然选择分析:对疫苗开发的影响。
Genomics & informatics Pub Date : 2024-07-15 DOI: 10.1186/s44342-024-00009-0
Joseph Hawadak, Aditi Arya, Shewta Chaudhry, Vineeta Singh
{"title":"Genetic diversity and natural selection analysis of VAR2CSA and vir genes: implication for vaccine development.","authors":"Joseph Hawadak, Aditi Arya, Shewta Chaudhry, Vineeta Singh","doi":"10.1186/s44342-024-00009-0","DOIUrl":"10.1186/s44342-024-00009-0","url":null,"abstract":"<p><p>Variable surface antigens (VSAs) encoded by var and vir genes in Plasmodium falciparum and Plasmodium vivax, respectively, are known to be involved in malaria pathogenesis and host immune escape through antigenic variations. Knowledge of the genetic diversity of these antigens is essential for malaria control and effective vaccine development. In this study, we analysed the genetic diversity and evolutionary patterns of two fragments (DBL2X and DBL3X) of VAR2CSA gene and four vir genes (vir 4, vir 12, vir 21 and vir 27) from different endemic regions, including Southeast Asia and sub-Saharan Africa. High levels of segregating sites (S) and haplotype diversity (Hd) were observed in both var and vir genes. Among vir genes, vir 12 (S = 131, Hd = 0.996) and vir 21 (S = 171, Hd = 892) were found to be more diverse as compared to vir 4 (S = 11, Hd = 0.748) and vir 27 (S = 23, Hd = 0.814). DBL2X (S = 99, Hd = 0.996) and DBL3X (S = 307, Hd = 0.999) fragments showed higher genetic diversity. Our analysis indicates that var and vir genes are highly diverse and follow the similar evolutionary pattern globally. Some codons showed signatures of positive or negative selection pressure, but vir and var genes are likely to be under balancing selection. This study highlights the high variability of var and vir genes and underlines the need of functional experimental studies to determine the most relevant allelic forms for effective progress towards vaccine formulation and testing.</p>","PeriodicalId":94288,"journal":{"name":"Genomics & informatics","volume":"22 1","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11247734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141622092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信