Bioinformatics advances最新文献

筛选
英文 中文
AAclust: k-optimized clustering for selecting redundancy-reduced sets of amino acid scales. AAclust:用于选择减少冗余的氨基酸尺度集的 k 优化聚类。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-30 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae165
Stephan Breimann, Dmitrij Frishman
{"title":"AAclust: <i>k</i>-optimized clustering for selecting redundancy-reduced sets of amino acid scales.","authors":"Stephan Breimann, Dmitrij Frishman","doi":"10.1093/bioadv/vbae165","DOIUrl":"10.1093/bioadv/vbae165","url":null,"abstract":"<p><strong>Summary: </strong>Amino acid scales are crucial for sequence-based protein prediction tasks, yet no gold standard scale set or simple scale selection methods exist. We developed AAclust, a wrapper for clustering models that require a pre-defined number of clusters <i>k</i>, such as <i>k</i>-means. AAclust obtains redundancy-reduced scale sets by clustering and selecting one representative scale per cluster, where <i>k</i> can either be optimized by AAclust or defined by the user. The utility of AAclust scale selections was assessed by applying machine learning models to 24 protein benchmark datasets. We found that top-performing scale sets were different for each benchmark dataset and significantly outperformed scale sets used in previous studies. Noteworthy is the strong dependence of the model performance on the scale set size. AAclust enables a systematic optimization of scale-based feature engineering in machine learning applications.</p><p><strong>Availability and implementation: </strong>The AAclust algorithm is part of AAanalysis, a Python-based framework for interpretable sequence-based protein prediction, which is documented and accessible at https://aaanalysis.readthedocs.io/en/latest and https://github.com/breimanntools/aaanalysis.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae165"},"PeriodicalIF":2.4,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11562964/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exon nomenclature and classification of transcripts database (ENACTdb): a resource for analyzing alternative splicing mediated proteome diversity. 外显子命名和转录本分类数据库(ENACTdb):分析替代剪接介导的蛋白质组多样性的资源。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-29 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae157
Paras Verma, Deeksha Thakur, Shashi B Pandit
{"title":"Exon nomenclature and classification of transcripts database (ENACTdb): a resource for analyzing alternative splicing mediated proteome diversity.","authors":"Paras Verma, Deeksha Thakur, Shashi B Pandit","doi":"10.1093/bioadv/vbae157","DOIUrl":"10.1093/bioadv/vbae157","url":null,"abstract":"<p><strong>Motivation: </strong>Gene transcripts are distinguished by the composition of their exons, and this different exon composition may contribute to advancing proteome complexity. Despite the availability of alternative splicing information documented in various databases, a ready association of exonic variations to the protein sequence remains a mammoth task.</p><p><strong>Results: </strong>To associate exonic variation(s) with the protein systematically, we designed the Exon Nomenclature and Classification of Transcripts (ENACT) framework for uniquely annotating exons that tracks their loci in gene architecture context with encapsulating variations in splice site(s) and amino acid coding status. After ENACT annotation, predicted protein features (secondary structure/disorder/Pfam domains) are mapped to exon attributes. Thus, ENACTdb provides trackable exonic variation(s) association to isoform(s) and protein features, enabling the assessment of functional variation due to changes in exon composition. Such analyses can be readily performed through multiple views supported by the server. The exon-centric visualizations of ENACT annotated isoforms could provide insights on the functional repertoire of genes due to alternative splicing and its related processes and can serve as an important resource for the research community.</p><p><strong>Availability and implementation: </strong>The database is publicly available at https://www.iscbglab.in/enactdb/. It contains protein-coding genes and isoforms for <i>Caenorhabditis elegans</i>, <i>Drosophila melanogaster</i>, <i>Danio rerio</i>, <i>Mus musculus</i>, and <i>Homo sapiens</i>.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae157"},"PeriodicalIF":2.4,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11576355/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MicroNet-MIMRF: a microbial network inference approach based on mutual information and Markov random fields. MicroNet-MIMRF:基于互信息和马尔可夫随机场的微生物网络推断方法。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-28 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae167
Chenqionglu Feng, Huiqun Jia, Hui Wang, Jiaojiao Wang, Mengxuan Lin, Xiaoyan Hu, Chenjing Yu, Hongbin Song, Ligui Wang
{"title":"MicroNet-MIMRF: a microbial network inference approach based on mutual information and Markov random fields.","authors":"Chenqionglu Feng, Huiqun Jia, Hui Wang, Jiaojiao Wang, Mengxuan Lin, Xiaoyan Hu, Chenjing Yu, Hongbin Song, Ligui Wang","doi":"10.1093/bioadv/vbae167","DOIUrl":"https://doi.org/10.1093/bioadv/vbae167","url":null,"abstract":"<p><strong>Motivation: </strong>The human microbiome, comprises complex associations and communication networks among microbial communities, which are crucial for maintaining health. The construction of microbial networks is vital for elucidating these associations. However, existing microbial networks inference methods cannot solve the issues of zero-inflation and non-linear associations. Therefore, necessitating novel methods to improve the accuracy of microbial networks inference.</p><p><strong>Results: </strong>In this study, we introduce the Microbial Network based on Mutual Information and Markov Random Fields (MicroNet-MIMRF) as a novel approach for inferring microbial networks. Abundance data of microbes are modeled through the zero-inflated Poisson distribution, and the discrete matrix is estimated for further calculation. Markov random fields based on mutual information are used to construct accurate microbial networks. MicroNet-MIMRF excels at estimating pairwise associations between microbes, effectively addressing zero-inflation and non-linear associations in microbial abundance data. It outperforms commonly used techniques in simulation experiments, achieving area under the curve values exceeding 0.75 for all parameters. A case study on inflammatory bowel disease data further demonstrates the method's ability to identify insightful associations. Conclusively, MicroNet-MIMRF is a powerful tool for microbial network inference that handles the biases caused by zero-inflation and overestimation of associations.</p><p><strong>Availability and implementation: </strong>The MicroNet-MIMRF is provided at https://github.com/Fionabiostats/MicroNet-MIMRF.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae167"},"PeriodicalIF":2.4,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549015/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MetaboScope: a statistical toolbox for analyzing 1H nuclear magnetic resonance spectra from human clinical studies. MetaboScope:用于分析人体临床研究 1H 核磁共振谱的统计工具箱。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-28 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae142
Ruey Leng Loo, Javier Osorio Mosquera, Michael Zasso, Jacqueline Mathews, Desmond G Johnston, Jeremy K Nicholson, Luc Patiny, Elaine Holmes, Julien Wist
{"title":"MetaboScope: a statistical toolbox for analyzing <sup>1</sup>H nuclear magnetic resonance spectra from human clinical studies.","authors":"Ruey Leng Loo, Javier Osorio Mosquera, Michael Zasso, Jacqueline Mathews, Desmond G Johnston, Jeremy K Nicholson, Luc Patiny, Elaine Holmes, Julien Wist","doi":"10.1093/bioadv/vbae142","DOIUrl":"10.1093/bioadv/vbae142","url":null,"abstract":"<p><strong>Motivation: </strong>Metabolic phenotyping, using high-resolution spectroscopic molecular fingerprints of biological samples, has demonstrated diagnostic, prognostic, and mechanistic value in clinical studies. However, clinical translation is hindered by the lack of viable workflows and challenges in converting spectral data into usable information.</p><p><strong>Results: </strong>MetaboScope is an analytical and statistical workflow for learning, designing and analyzing clinically relevant <sup>1</sup>H nuclear magnetic resonance data. It features modular preprocessing pipelines, multivariate modeling tools including Principal Components Analysis (PCA), Orthogonal-Projection to Latent Structure Discriminant Analysis (OPLS-DA), and biomarker discovery tools (multiblock PCA and statistical spectroscopy). A simulation tool is also provided, allowing users to create synthetic spectra for hypothesis testing and power calculations.</p><p><strong>Availability and implementation: </strong>MetaboScope is built as a pipeline where each module accepts the output generated by the previous one. This provides flexibility and simplicity of use, while being straightforward to maintain. The system and its libraries were developed in JavaScript and run as a web app; therefore, all the operations are performed on the local computer, circumventing the need to upload data. The MetaboScope tool is available at https://www.cheminfo.org/flavor/metabolomics/index.html. The code is open-source and can be deployed locally if necessary. Module notes, video tutorials, and clinical spectral datasets are provided for modeling.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae142"},"PeriodicalIF":2.4,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11576352/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population-aware permutation-based significance thresholds for genome-wide association studies. 全基因组关联研究中基于人群感知的置换显著性阈值。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-28 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae168
Maura John, Arthur Korte, Marco Todesco, Dominik G Grimm
{"title":"Population-aware permutation-based significance thresholds for genome-wide association studies.","authors":"Maura John, Arthur Korte, Marco Todesco, Dominik G Grimm","doi":"10.1093/bioadv/vbae168","DOIUrl":"10.1093/bioadv/vbae168","url":null,"abstract":"<p><strong>Motivation: </strong>Permutation-based significance thresholds have been shown to be a robust alternative to classical Bonferroni significance thresholds in genome-wide association studies (GWAS) for skewed phenotype distributions. The recently published method permGWAS introduced a batch-wise approach to efficiently compute permutation-based GWAS. However, running multiple univariate tests in parallel leads to many repetitive computations and increased computational resources. More importantly, traditional permutation methods that permute only the phenotype break the underlying population structure.</p><p><strong>Results: </strong>We propose permGWAS2, an improved method that does not break the population structure during permutations and uses an elegant block matrix decomposition to optimize computations, thereby reducing redundancies. We show on synthetic data that this improved approach yields a lower false discovery rate for skewed phenotype distributions compared to the previous version and the commonly used Bonferroni correction. In addition, we re-analyze a dataset covering phenotypic variation in 86 traits in a population of 615 wild sunflowers (<i>Helianthus annuus</i> L.). This led to the identification of dozens of novel associations with putatively adaptive traits, and removed several likely false-positive associations with limited biological support.</p><p><strong>Availability and implementation: </strong>permGWAS2 is open-source and publicly available on GitHub for download: https://github.com/grimmlab/permGWAS.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae168"},"PeriodicalIF":2.4,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639184/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
motifbreakR v2: expanded variant analysis including indels and integrated evidence from transcription factor binding databases. motifbreakR v2:扩展的变异分析,包括嵌合和来自转录因子结合数据库的综合证据。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-23 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae162
Simon G Coetzee, Dennis J Hazelett
{"title":"<i>motifbreakR</i> v2: expanded variant analysis including indels and integrated evidence from transcription factor binding databases.","authors":"Simon G Coetzee, Dennis J Hazelett","doi":"10.1093/bioadv/vbae162","DOIUrl":"https://doi.org/10.1093/bioadv/vbae162","url":null,"abstract":"<p><strong>Motivation: </strong><i>motifbreakR</i> scans genetic variants against position weight matrices of transcription factors (TFs) to determine the potential for the disruption of binding at the site of the variant. It leverages the Bioconductor suite of software packages and annotations to query a diverse array of genomes and motif databases. Initially developed to interrogate the effect of single-nucleotide variants on TF binding sites, in <i>motifbreakR</i> v2, we have updated the functionality.</p><p><strong>Results: </strong>New features include the ability to query other types of complex genetic variants, such as short insertions and deletions. This capability allows modeling a more extensive array of variants that may have significant effects on TF binding. Additionally, predictions based on sequence preference alone can indicate many more potential binding events than observed. Adding information from DNA-binding sequencing datasets lends confidence to motif disruption prediction by demonstrating TF binding in cell lines and tissue types. Therefore, <i>motifbreakR can directly query</i> the ReMap2022 database for evidence that a TF matching the disrupted motif binds over the disrupting variant. Finally, in <i>motifbreakR</i>, in addition to the existing interface, we implemented an R/Shiny graphical user interface to simplify and enhance access to researchers with different skill sets.</p><p><strong>Availability and implementation: </strong><i>motifbreakR</i> is implemented in R. Source code, documentation, and tutorials are available on Bioconductor at https://bioconductor.org/packages/release/bioc/html/motifbreakR.html and GitHub at https://github.com/Simon-Coetzee/motifBreakR.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae162"},"PeriodicalIF":2.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11520234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TransAnnot-a fast transcriptome annotation pipeline. TransAnnot--快速转录组注释管道。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-22 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae152
Mariia Zelenskaia, Yazhini Arangasamy, Milot Mirdita, Johannes Söding, Venket Raghavan
{"title":"TransAnnot-a fast transcriptome annotation pipeline.","authors":"Mariia Zelenskaia, Yazhini Arangasamy, Milot Mirdita, Johannes Söding, Venket Raghavan","doi":"10.1093/bioadv/vbae152","DOIUrl":"10.1093/bioadv/vbae152","url":null,"abstract":"<p><strong>Summary: </strong>The annotation of deeply sequenced, <i>de novo</i> assembled transcriptomes continues to be a challenge as some of the state-of-the-art tools are slow, difficult to install, and hard to use. We have tackled these issues with TransAnnot, a fast, automated transcriptome annotation pipeline that is easy to install and use. Leveraging the fast sequence searches provided by the MMseqs2 suite, TransAnnot offers one-step annotation of homologs from Swiss-Prot, gene ontology terms and orthogroups from eggNOG, and functional domains from Pfam. Users also have the option to annotate against custom databases. TransAnnot accepts sequencing reads (short and long), nucleotide sequences, or amino acid sequences as input for annotation. When benchmarked with test data sets of amino acid sequences, TransAnnot was 333, 284, and 18 times faster than comparable tools such as EnTAP, Trinotate, and eggNOG-mapper respectively.</p><p><strong>Availability and implementation: </strong>TransAnnot is free to use, open sourced under GPLv3, and is implemented in C++ and Bash. Source code, documentation, and pre-compiled binaries are available at https://github.com/soedinglab/transannot. TransAnnot is also available via bioconda (https://anaconda.org/bioconda/transannot).</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae152"},"PeriodicalIF":2.4,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11530227/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ProCogGraph: a graph-based mapping of cognate ligand domain interactions. ProCogGraph:基于图形的同源配体结构域相互作用图谱。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-22 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae161
Matthew Crown, Matthew Bashton
{"title":"ProCogGraph: a graph-based mapping of cognate ligand domain interactions.","authors":"Matthew Crown, Matthew Bashton","doi":"10.1093/bioadv/vbae161","DOIUrl":"10.1093/bioadv/vbae161","url":null,"abstract":"<p><strong>Motivation: </strong>Mappings of domain-cognate ligand interactions can enhance our understanding of the core concepts of evolution and be used to aid docking and protein design. Since the last available cognate-ligand domain database was released, the PDB has grown significantly and new tools are available for measuring similarity and determining contacts.</p><p><strong>Results: </strong>We present ProCogGraph, a graph database of cognate-ligand domain mappings in PDB structures. Building upon the work of the predecessor database, PROCOGNATE, we use data-driven approaches to develop thresholds and interaction modes. We explore new aspects of domain-cognate ligand interactions, including the chemical similarity of bound cognate ligands and how domain combinations influence cognate ligand binding. Finally, we use the graph to add specificity to partial EC IDs, showing that ProCogGraph can complete partial annotations systematically through assigned cognate ligands.</p><p><strong>Availability and implementation: </strong>The ProCogGraph pipeline, database and flat files are available at https://github.com/bashton-lab/ProCogGraph and https://doi.org/10.5281/zenodo.13165851.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae161"},"PeriodicalIF":2.4,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561043/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two subtle problems with overrepresentation analysis. 过度代表性分析存在两个微妙的问题。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-21 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae159
Mark Ziemann, Barry Schroeter, Anusuiya Bora
{"title":"Two subtle problems with overrepresentation analysis.","authors":"Mark Ziemann, Barry Schroeter, Anusuiya Bora","doi":"10.1093/bioadv/vbae159","DOIUrl":"10.1093/bioadv/vbae159","url":null,"abstract":"<p><strong>Motivation: </strong>Overrepresentation analysis (ORA) is used widely to assess the enrichment of functional categories in a gene list compared to a background list. ORA is therefore a critical method in the interpretation of 'omics data, relating gene lists to biological functions and themes. Although ORA is hugely popular, we and others have noticed two potentially undesired behaviours of some ORA tools. The first one we call the 'background problem', because it involves the software eliminating large numbers of genes from the background list if they are not annotated as belonging to any category. The second one we call the 'false discovery rate problem', because some tools underestimate the true number of parallel tests conducted.</p><p><strong>Results: </strong>Here, we demonstrate the impact of these issues on several real RNA-seq datasets and use simulated RNA-seq data to quantify the impact of these problems. We show that the severity of these problems depends on the gene set library, the number of genes in the list, and the degree of noise in the dataset. These problems can be mitigated by changing packages/websites for ORA or by changing to another approach such as functional class scoring.</p><p><strong>Availability and implementation: </strong>An R/Shiny tool has been provided at https://oratool.ziemann-lab.net/ and the supporting materials are available from Zenodo (https://zenodo.org/records/13823301).</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae159"},"PeriodicalIF":2.4,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11557902/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting 'pain genes': multi-modal data integration using probabilistic classifiers and interaction networks. 预测 "疼痛基因":利用概率分类器和交互网络进行多模态数据整合。
IF 2.4
Bioinformatics advances Pub Date : 2024-10-18 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae156
Na Zhao, David L Bennett, Georgios Baskozos, Allison M Barry
{"title":"Predicting 'pain genes': multi-modal data integration using probabilistic classifiers and interaction networks.","authors":"Na Zhao, David L Bennett, Georgios Baskozos, Allison M Barry","doi":"10.1093/bioadv/vbae156","DOIUrl":"10.1093/bioadv/vbae156","url":null,"abstract":"<p><strong>Motivation: </strong>Accurate identification of pain-related genes remains challenging due to the complex nature of pain pathophysiology and the subjective nature of pain reporting in humans. Here, we use machine learning to identify possible 'pain genes'. Labelling was based on a gold-standard list with validated involvement across pain conditions, and was trained on a selection of -omics, protein-protein interaction network features, and biological function readouts for each gene.</p><p><strong>Results: </strong>The top-performing model was selected to predict a 'pain score' per gene. The top-ranked genes were then validated against pain-related human SNPs. Functional analysis revealed JAK2/STAT3 signal, ErbB, and Rap1 signalling pathways as promising targets for further exploration, while network topological features contribute significantly to the identification of 'pain' genes. As such, a network based on top-ranked genes was constructed to reveal previously uncharacterized pain-related genes. Together, these novel insights into pain pathogenesis can indicate promising directions for future experimental research.</p><p><strong>Availability and implementation: </strong>These analyses can be further explored using the linked open-source database at https://livedataoxford.shinyapps.io/drg-directory/, which is accompanied by a freely accessible code template and user guide for wider adoption across disciplines.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae156"},"PeriodicalIF":2.4,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11549022/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信