Victor A Tagliacollo, Milton Tan, Roberto E Reis, Ronielson Gaia, Virgilio Carrijo, Marcelo Ranuzi, Jack M Craig, James S Albert
{"title":"Time-calibrated phylogeny of neotropical freshwater fishes.","authors":"Victor A Tagliacollo, Milton Tan, Roberto E Reis, Ronielson Gaia, Virgilio Carrijo, Marcelo Ranuzi, Jack M Craig, James S Albert","doi":"10.3389/fbinf.2024.1433995","DOIUrl":"10.3389/fbinf.2024.1433995","url":null,"abstract":"<p><p>Neotropical Freshwater Fish (NFF) fauna exhibits the greatest phenotypic disparity and species richness among all continental aquatic vertebrate faunas, with more than 6,345 species distributed across the mostly tropical regions of Central and South America. The last two decades have seen a proliferation of molecular phylogenies, often at the species level, covering almost all 875 valid NFF genera. This study presents the most comprehensive genome-wide, time-calibrated phylogenetic hypothesis of NFF species to date, based on DNA sequences generated over decades through the collaborative efforts of the multinational ichthyological research community. Our purpose is to build and curate an extensive molecular dataset allowing researchers to evaluate macroevolutionary hypotheses in the NFF while facilitating continuous refinement and expansion. Using thousands of DNA sequences from dozens of studies, we compiled a supermatrix of 51 markers for 5,984 taxa, representing 3,167 NFF species. Based on this dataset, we built the most species-rich time-calibrated phylogeny of the NFF taxa to date, summarizing the collective efforts of the ichthyological research community since the midpoint of the last century. We provide a summary review of this remarkable evolutionary history and hope this dataset provides a framework for forthcoming studies of the NFF fauna, documenting compelling, emergent patterns in the world's most diverse continental vertebrate fauna.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1433995"},"PeriodicalIF":2.8,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659281/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142877796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Benchmarking of Hi-C tools for scaffolding plant genomes obtained from PacBio HiFi and ONT reads.","authors":"Lia Obinu, Urmi Trivedi, Andrea Porceddu","doi":"10.3389/fbinf.2024.1462923","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1462923","url":null,"abstract":"<p><p>The implementation of Hi-C reads in the <i>de novo</i> genome assembly process allows the ordering of large regions of the genome in scaffolds and the generation of chromosome-level assemblies. Several bioinformatics tools have been developed for genome scaffolding with Hi-C, and each tool has advantages and disadvantages that need to be carefully evaluated before their adoption. We generated two <i>de novo</i> assemblies of <i>Arabidopsis thaliana</i> obtained from the same raw PacBio HiFi and Oxford Nanopore Technologies data. We scaffolded the assemblies implementing Hi-C reads with the scaffolders 3D-DNA, SALSA2, and YaHS, with the aim of identifying the tool providing the most accurate assembly. The scaffolded assemblies were evaluated according to contiguity, completeness, accuracy, and structural correctness. In our analysis, YaHS proved to be the best-performing bioinformatics tool for scaffolding <i>de novo</i> genome assemblies in <i>Arabidopsis thaliana</i>.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1462923"},"PeriodicalIF":2.8,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11604747/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142775052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Climatic factors, but not geographic distance, promote genetic structure and differentiation of Cleistogenes squarrosa (Trin.) Keng populations.","authors":"Ruyan Song, Xueli Zhang, Zhuo Zhang, Chan Zhou","doi":"10.3389/fbinf.2024.1454689","DOIUrl":"10.3389/fbinf.2024.1454689","url":null,"abstract":"<p><p>Climate can shape plant genetic diversity and genetic structure, and genetic diversity and genetic structure can reflect the adaptation of plants to climate change. We used rbcl and trnL-trnF sequences to analyze the genetic diversity and genetic structure of <i>C. squarrosa</i> under the influence of different environmental factors in Inner Mongolia grassland. The results showed that the genetic diversity of this species was low. (The trnL-trnF sequences have higher genetic diversity than rbcl sequences.) <i>C. squarrosa</i> had low genetic diversity compared to other prairie plants, but had a more pronounced genetic structure. The haplotype network diagram of the combined sequences could be divided into two categories, and the results of the NJ, MP, and ML trees also showed that the haplotypes were divided into two branches. The results of genetic structure analysis showed that that the populations located in the desert steppe fall into exactly one cluster, and the populations located in the typical steppe fall into exactly another cluster. The neutrality tests were all negative and the mismatch distribution also showed a single peak across the population, suggesting that <i>C. squarrosa</i> had undergone population expansion and was well adapted to the local environment. The results of the mantel test showed that climate had a greater influence on the genetic distance of <i>C. squarrosa</i>, with annual precipitation having a higher influence than mean annual temperature. This study provided basic genetic information on the genetic structure of <i>C. squarrosa</i> and contributes to the study of genetic adaptation mechanisms in grassland plants.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1454689"},"PeriodicalIF":2.8,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11599168/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142741602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Charlene Z L Ong, N Jannah M Nasir, Roy E Welsch, Lisa Tucker-Kellogg, Jagath C Rajapakse
{"title":"Quantification of muscle fiber malformations using edge detection to investigate chronic muscle pressure ulcers.","authors":"Charlene Z L Ong, N Jannah M Nasir, Roy E Welsch, Lisa Tucker-Kellogg, Jagath C Rajapakse","doi":"10.3389/fbinf.2024.1450146","DOIUrl":"10.3389/fbinf.2024.1450146","url":null,"abstract":"<p><strong>Background: </strong>Microscopy of regenerated tissue shows different morphologies between the healing of acute wounds and chronic wounds. This difference can be seen manually by biologists, but computational methods are needed to automate the characterization of morphology and regenerative quality in regenerated muscle tissue.</p><p><strong>Results: </strong>From the detected edge segments, we computed several imaging biomarkers of interest, such as median tortuosity, number of edge segments normalized by area, median edge segment distance and interquartile range of orientation angles of edge segments of the microscope images of successful and unsuccessful muscle regeneration. We observed that muscle fibers in saline-treated pressure ulcers had a larger interquartile range of orientation angles of the edge segments (p = 0.05) and shorter edge segment distances (p = 0.003) compared to those of acute cardiotoxin injuries.</p><p><strong>Conclusion: </strong>Our edge detection method was able to identify statistically significant differences in some of the imaging biomarkers between saline-treated pressure ulcers and cardiotoxin injuries, suggesting that chronic pressure ulcers have increased muscle fiber malformations compared to cardiotoxin injuries.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1450146"},"PeriodicalIF":2.8,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11532102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142577383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Buzhong Zhang, Meili Zheng, Yuzhou Zhang, Lijun Quan
{"title":"DCMA: faster protein backbone dihedral angle prediction using a dilated convolutional attention-based neural network.","authors":"Buzhong Zhang, Meili Zheng, Yuzhou Zhang, Lijun Quan","doi":"10.3389/fbinf.2024.1477909","DOIUrl":"10.3389/fbinf.2024.1477909","url":null,"abstract":"<p><p>The dihedral angle of the protein backbone can describe the main structure of the protein, which is of great significance for determining the protein structure. Many computational methods have been proposed to predict this critically important protein structure, including deep learning. However, these heavyweight methods require more computational resources, and the training time becomes intolerable. In this article, we introduce a novel lightweight method, named dilated convolution and multi-head attention (DCMA), that predicts protein backbone torsion dihedral angles <math><mrow><mo>(</mo> <mrow><mi>ϕ</mi> <mo>,</mo> <mi>ψ</mi></mrow> <mo>)</mo></mrow> </math> . DCMA is stacked by five layers of two hybrid inception blocks and one multi-head attention block (I2A1) module. The hybrid inception blocks consisting of multi-scale convolutional neural networks and dilated convolutional neural networks are designed for capturing local and long-range sequence-based features. The multi-head attention block supplementally strengthens this operation. The proposed DCMA is validated on public critical assessment of protein structure prediction (CASP) benchmark datasets. Experimental results show that DCMA obtains better or comparable generalization performance. Compared to best-so-far methods, which are mostly ensemble models and constructed of recurrent neural networks, DCMA is an individual model that is more lightweight and has a shorter training time. The proposed model could be applied as an alternative method for predicting other protein structural features.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1477909"},"PeriodicalIF":2.8,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527783/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Juan Segura-Vega, Allan González-Herrera, Ramón Molina-Bravo, Stefany Solano-González
{"title":"Computational identification and characterization of chitinase 1 and chitinase 2 from neotropical isolates of <i>Beauveria bassiana</i>.","authors":"Juan Segura-Vega, Allan González-Herrera, Ramón Molina-Bravo, Stefany Solano-González","doi":"10.3389/fbinf.2024.1434442","DOIUrl":"10.3389/fbinf.2024.1434442","url":null,"abstract":"<p><strong>Background: </strong>The fungus <i>Beauveria bassiana</i> is widely used for agronomical applications, mainly in biological control. <i>B. bassiana</i> uses chitinase enzymes to degrade chitin, a major chemical component found in insect exoskeletons and fungal cell walls. However, until recently, genomic information on neotropical isolates, as well as their metabolic and biotechnological potential, has been limited.</p><p><strong>Methods: </strong>Eight complete <i>B. bassiana</i> genomes of Neotropical origin and three references were studied to identify chitinase genes and its corresponding proteins, which were curated and characterized using manual curation and computational tools. We conducted a computational study to highlight functional differences and similarities for chitinase proteins in these Neotropical isolates.</p><p><strong>Results: </strong>Eleven chitinase 1 genes were identified, categorized as chitinase 1.1 and chitinase 1.2. Five chitinase 2 genes were identified but presented a higher sequence conservation across all sequences. Interestingly, physicochemical parameters were more similar between chitinase 1.1 and chitinase 2 than between chitinase 1.1 and 1.2.</p><p><strong>Conclusion: </strong>Chitinases 1 and 2 demonstrated variations, especially within chitinase 1, which presented a potential paralog. These differences were observed in their physical parameters. Additionally, CHIT2 completely lacks a signal peptide. This implies that CHIT1 might be associated with infection processes, while CHIT2 could be involved in morphogenesis and cellular growth. Therefore, our work highlights the importance of computational studies on local isolates, providing valuable resources for further experimental validation. Intrinsic changes within local species can significantly impact our understanding of complex pathogen-host interactions and offer practical applications, such as biological control.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1434442"},"PeriodicalIF":2.8,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527780/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Identification of novel drug targets for <i>Helicobacter pylori</i>: structure-based virtual screening of potential inhibitors against DAH7PS protein involved in the shikimate pathway.","authors":"Narjes Noori Goodarzi, Mahshid Khazani Asforooshani, Behzad Shahbazi, Nayereh Rezaie Rahimi, Farzad Badmasti","doi":"10.3389/fbinf.2024.1482338","DOIUrl":"10.3389/fbinf.2024.1482338","url":null,"abstract":"<p><strong>Background: </strong><i>Helicobacter pylori</i>, a bacterium associated with severe gastrointestinal diseases and malignancies, poses a significant challenge because of its increasing antibiotic resistance rates. This study aimed to identify potential drug targets and inhibitors against <i>H. pylori</i> using a structure-based virtual screening (SBVS) approach.</p><p><strong>Methods: </strong>Core-proteome analysis of 132 <i>H. pylori</i> genomes was performed using the EDGAR database. Essential genes were identified and human and gut microbiota homolog proteins were excluded. The DAH7PS protein involved in the shikimate pathway was selected for the structure-based virtual screening (SBVS) approach. The tertiary structure of the protein was predicted through homology modeling (based on PDB ID: 5UXM). Molecular docking was performed to identify potential inhibitors of DAH7PS among StreptomeDB compounds using the AutoDock Vina tool. Molecular dynamics (MD) simulations assessed the stability of DAH7PS-ligand complexes. The complexes were further evaluated in terms of their binding affinity, Lipinski's Rule of Five, and ADMET properties.</p><p><strong>Results: </strong>A total of 54 novel drug targets with desirable properties were identified. DAH7PS was selected for further investigation, and virtual screening of StreptomeDB compounds yielded 36 high-affinity binding of the ligands. Two small molecules, 6,8-Dihydroxyisocoumarin-3-carboxylic acid and Epicatechin, also showed favorable RO5 and ADMET properties. MD simulations confirmed the stability and reliability of DAH7PS-ligand complexes, indicating their potential as inhibitors.</p><p><strong>Conclusion: </strong>This study identified 54 novel drug targets against <i>H. pylori</i>. The DAH7PS protein as a promising drug target was evaluated using a computer-aided drug design. 6,8-Dihydroxyisocoumarin-3-carboxylic acid and Epicatechin demonstrated desirable properties and stable interactions, highlighting their potential to inhibit DAH7PS as an essential protein. Undoubtedly, more experimental validations are needed to advance these findings into practical therapies for treating drug-resistant <i>H. pylori</i>.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1482338"},"PeriodicalIF":2.8,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527725/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Editorial: Women in bioinformatics.","authors":"Irma Martínez-Flores, Constanza Cárdenas Carvajal, Viviana Monje-Galvan","doi":"10.3389/fbinf.2024.1499514","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1499514","url":null,"abstract":"","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1499514"},"PeriodicalIF":2.8,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11494442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ruslan Kalendar, Alexandr Shevtsov, Zhenis Otarbay, Aisulu Ismailova
{"title":"<i>In silico</i> PCR analysis: a comprehensive bioinformatics tool for enhancing nucleic acid amplification assays.","authors":"Ruslan Kalendar, Alexandr Shevtsov, Zhenis Otarbay, Aisulu Ismailova","doi":"10.3389/fbinf.2024.1464197","DOIUrl":"10.3389/fbinf.2024.1464197","url":null,"abstract":"<p><p>Nucleic acid amplification assays represent a pivotal category of methodologies for targeted sequence detection within contemporary biological research, boasting diverse utility in diagnostics, identification, and DNA sequencing. The foundational principles of these assays have been extrapolated to various simple and intricate nucleic acid amplification technologies. Concurrently, a burgeoning trend toward computational or virtual methodologies is exemplified by <i>in silico</i> PCR analysis. <i>In silico</i> PCR analysis is a valuable and productive adjunctive approach for ensuring primer or probe specificity across a broad spectrum of PCR applications encompassing gene discovery through homology analysis, molecular diagnostics, DNA profiling, and repeat sequence identification. The prediction of primer and probe sensitivity and specificity necessitates thorough database searches, accounting for an optimal balance of mismatch tolerance, sequence similarity, and thermal stability. This software facilitates <i>in silico</i> PCR analyses of both linear and circular DNA templates, including bisulfited treatment DNA, enabling multiple primer or probe searches within databases of varying scales alongside advanced search functionalities. This tool is suitable for processing batch files and is essential for automation when working with large amounts of data.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1464197"},"PeriodicalIF":2.8,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11491563/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142485996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Discovery of plasma biomarkers related to blood-brain barrier dysregulation in Alzheimer's disease.","authors":"Yuet Ruh Dan, Keng-Hwee Chiam","doi":"10.3389/fbinf.2024.1463001","DOIUrl":"10.3389/fbinf.2024.1463001","url":null,"abstract":"<p><strong>Introduction: </strong>Blood-based biomarkers are quantitative, non-invasive diagnostic tools. This study aimed to identify candidate biomarkers for Alzheimer's disease (AD) using publicly available omics datasets, using the hypothesis that with blood-brain barrier dysfunction in AD, brain-synthesized proteins can leak into plasma for detection.</p><p><strong>Methods: </strong>Differential abundance results of plasma and brain proteomic datasets were integrated to obtain a list of potential biomarkers. Biological validity was investigated with intercellular communication and gene regulatory analyses on brain single-cell transcriptomics data.</p><p><strong>Results: </strong>Five proteins (APOD, B2M, CFH, CLU, and C3) fit biomarker criteria. 4 corresponding transcripts (APOD, B2M, CLU, and C3) were overexpressed in AD astrocytes, mediated by AD-related dysregulations in transcription factors regulating neuroinflammation. Additionally, CLU specifically induced downstream expression of neuronal death genes.</p><p><strong>Discussion: </strong>In conclusion, a 5-protein panel is shown to effectively identify AD patients, with evidence of disease specificity and biological validity. Future research should investigate the mechanism of protein leakage through the blood-brain barrier.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1463001"},"PeriodicalIF":2.8,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11487119/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}