BMC Bioinformatics最新文献

筛选
英文 中文
MISDP: multi-task fusion visit interval for sequential diagnosis prediction.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-20 DOI: 10.1186/s12859-024-05998-x
Shengrong Zhu, Ruijia Yang, Zifeng Pan, Xuan Tian, Hong Ji
{"title":"MISDP: multi-task fusion visit interval for sequential diagnosis prediction.","authors":"Shengrong Zhu, Ruijia Yang, Zifeng Pan, Xuan Tian, Hong Ji","doi":"10.1186/s12859-024-05998-x","DOIUrl":"https://doi.org/10.1186/s12859-024-05998-x","url":null,"abstract":"<p><strong>Backgrounds: </strong>Diagnostic prediction is a central application that spans various medical specialties and scenarios, sequential diagnosis prediction is the process of predicting future diagnoses based on patients' historical visits. Prior research has underexplored the impact of irregular intervals between patient visits on predictive models, despite its significance.</p><p><strong>Method: </strong>We developed the Multi-task Fusion Visit Interval for Sequential Diagnosis Prediction (MISDP) framework to address this research gap. The MISDP framework integrated sequential diagnosis prediction with visit interval prediction within a multi-task learning paradigm. It uses positional encoding and interval encoding to handle irregular patient visit intervals. Furthermore, it incorporates historical attention residue to enhance the multi-head self-attention mechanism, focusing on extracting long-term dependencies from clinical historical visits.</p><p><strong>Results: </strong>The MISDP model exhibited superior performance across real-world healthcare dataset, irrespective of the training data scarcity or abundance. With only 20% training data, MISDP achieved a 4. 2% improvement over KAME; when training data ranged from 60 to 80%, MISDP surpassed SETOR, the top baseline, by 0. 8% in accuracy, underscoring its robustness and efficacy in sequential diagnosis prediction task.</p><p><strong>Conclusions: </strong>The MISDP model significantly improves the accuracy of Sequential Diagnosis Prediction. The result highlights the advantage of multi-task learning in synergistically enhancing the performance of individual sub-task. Notably, irregular visit interval factors and historical attention residue has been particularly instrumental in refining the precision of sequential diagnosis prediction, suggesting a promising avenue for advancing clinical decision-making through data-driven modeling approaches.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"387"},"PeriodicalIF":2.9,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142871119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of miRNA-disease associations based on PCA and cascade forest.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-19 DOI: 10.1186/s12859-024-05999-w
Chuanlei Zhang, Yubo Li, Yinglun Dong, Wei Chen, Changqing Yu
{"title":"Prediction of miRNA-disease associations based on PCA and cascade forest.","authors":"Chuanlei Zhang, Yubo Li, Yinglun Dong, Wei Chen, Changqing Yu","doi":"10.1186/s12859-024-05999-w","DOIUrl":"https://doi.org/10.1186/s12859-024-05999-w","url":null,"abstract":"<p><strong>Background: </strong>As a key non-coding RNA molecule, miRNA profoundly affects gene expression regulation and connects to the pathological processes of several kinds of human diseases. However, conventional experimental methods for validating miRNA-disease associations are laborious. Consequently, the development of efficient and reliable computational prediction models is crucial for the identification and validation of these associations.</p><p><strong>Results: </strong>In this research, we developed the PCACFMDA method to predict the potential associations between miRNAs and diseases. To construct a multidimensional feature matrix, we consider the fusion similarities of miRNA and disease and miRNA-disease pairs. We then use principal component analysis(PCA) to reduce data complexity and extract low-dimensional features. Subsequently, a tuned cascade forest is used to mine the features and output prediction scores deeply. The results of the 5-fold cross-validation using the HMDD v2.0 database indicate that the PCACFMDA algorithm achieved an AUC of 98.56%. Additionally, we perform case studies on breast, esophageal and lung neoplasms. The findings revealed that the top 50 miRNAs most strongly linked to each disease have been validated.</p><p><strong>Conclusions: </strong>Based on PCA and optimized cascade forests, we propose the PCACFMDA model for predicting undiscovered miRNA-disease associations. The experimental results demonstrate superior prediction performance and commendable stability. Consequently, the PCACFMDA is a potent instrument for in-depth exploration of miRNA-disease associations.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"386"},"PeriodicalIF":2.9,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142862959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spall: accurate and robust unveiling cellular landscapes from spatially resolved transcriptomics data using a decomposition network.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-06003-1
Zhongning Jiang, Wei Huang, Raymond H W Lam, Wei Zhang
{"title":"Spall: accurate and robust unveiling cellular landscapes from spatially resolved transcriptomics data using a decomposition network.","authors":"Zhongning Jiang, Wei Huang, Raymond H W Lam, Wei Zhang","doi":"10.1186/s12859-024-06003-1","DOIUrl":"https://doi.org/10.1186/s12859-024-06003-1","url":null,"abstract":"<p><p>Recent developments in spatially resolved transcriptomics (SRT) enable the characterization of spatial structures for different tissues. Many decomposition methods have been proposed to depict the cellular distribution within tissues. However, existing computational methods struggle to balance spatial continuity in cell distribution with the preservation of cell-specific characteristics. To address this, we propose Spall, a novel decomposition network that integrates scRNA-seq data with SRT data to accurately infer cell type proportions. Spall introduced the GATv2 module, featuring a flexible dynamic attention mechanism to capture relationships between spots. This improves the identification of cellular distribution patterns in spatial analysis. Additionally, Spall incorporates skip connections to address the loss of cell-specific information, thereby enhancing the prediction capability for rare cell types. Experimental results show that Spall outperforms the state-of-the-art methods in reconstructing cell distribution patterns on multiple datasets. Notably, Spall reveals tumor heterogeneity in human pancreatic ductal adenocarcinoma samples and delineates complex tissue structures, such as the laminar organization of the mouse cerebral cortex and the mouse cerebellum. These findings highlight the ability of Spall to provide reliable low-dimensional embeddings for downstream analyses, offering new opportunities for deciphering tissue structures.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"379"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepMiRBP: a hybrid model for predicting microRNA-protein interactions based on transfer learning and cosine similarity.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-05985-2
Sasan Azizian, Juan Cui
{"title":"DeepMiRBP: a hybrid model for predicting microRNA-protein interactions based on transfer learning and cosine similarity.","authors":"Sasan Azizian, Juan Cui","doi":"10.1186/s12859-024-05985-2","DOIUrl":"https://doi.org/10.1186/s12859-024-05985-2","url":null,"abstract":"<p><strong>Background: </strong>Interactions between microRNAs and RNA-binding proteins are crucial for microRNA-mediated gene regulation and sorting. Despite their significance, the molecular mechanisms governing these interactions remain underexplored, apart from sequence motifs identified on microRNAs. To date, only a limited number of microRNA-binding proteins have been confirmed, typically through labor-intensive experimental procedures. Advanced bioinformatics tools are urgently needed to facilitate this research.</p><p><strong>Methods: </strong>We present DeepMiRBP, a novel hybrid deep learning model specifically designed to predict microRNA-binding proteins by modeling molecular interactions. This innovation approach is the first to target the direct interactions between small RNAs and proteins. DeepMiRBP consists of two main components. The first component employs bidirectional long short-term memory (Bi-LSTM) neural networks to capture sequential dependencies and context within RNA sequences, attention mechanisms to enhance the model's focus on the most relevant features and transfer learning to apply knowledge gained from a large dataset of RNA-protein binding sites to the specific task of predicting microRNA-protein interactions. Cosine similarity is applied to assess RNA similarities. The second component utilizes Convolutional Neural Networks (CNNs) to process the spatial data inherent in protein structures based on Position-Specific Scoring Matrices (PSSM) and contact maps to generate detailed and accurate representations of potential microRNA-binding sites and assess protein similarities.</p><p><strong>Results: </strong>DeepMiRBP achieved a prediction accuracy of 87.4% during training and 85.4% using testing, with an F score of 0.860. Additionally, we validated our method using three case studies, focusing on microRNAs such as miR-451, -19b, -23a, -21, -223, and -let-7d. DeepMiRBP successfully predicted known miRNA interactions with recently discovered RNA-binding proteins, including AGO, YBX1, and FXR2, identified in various exosomes.</p><p><strong>Conclusions: </strong>Our proposed DeepMiRBP strategy represents the first of its kind designed for microRNA-protein interaction prediction. Its promising performance underscores the model's potential to uncover novel interactions critical for small RNA sorting and packaging, as well as to infer new RNA transporter proteins. The methodologies and insights from DeepMiRBP offer a scalable template for future small RNA research, from mechanistic discovery to modeling disease-related cell-to-cell communication, emphasizing its adaptability and potential for developing novel small RNA-centric therapeutic interventions and personalized medicine.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"381"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Loop-mediated isothermal amplification assays for the detection of antimicrobial resistance elements in Vibrio cholera.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-06001-3
Daniel Antonio Negrón, Shipra Trivedi, Nicholas Tolli, David Ashford, Gabrielle Melton, Stephanie Guertin, Katharine Jennings, Bryan D Necciai, Shanmuga Sozhamannan, Bradley W Abramson
{"title":"Loop-mediated isothermal amplification assays for the detection of antimicrobial resistance elements in Vibrio cholera.","authors":"Daniel Antonio Negrón, Shipra Trivedi, Nicholas Tolli, David Ashford, Gabrielle Melton, Stephanie Guertin, Katharine Jennings, Bryan D Necciai, Shanmuga Sozhamannan, Bradley W Abramson","doi":"10.1186/s12859-024-06001-3","DOIUrl":"https://doi.org/10.1186/s12859-024-06001-3","url":null,"abstract":"<p><strong>Background: </strong>The bacterium Vibrio cholerae causes diarrheal illness and can acquire genetic material leading to multiple drug resistance (MDR). Rapid detection of resistance-conferring mobile genetic elements helps avoid the prescription of ineffective antibiotics for specific strains. Colorimetric loop-mediated isothermal amplification (LAMP) assays provide a rapid and cost-effective means for detection at point-of-care since they do not require specialized equipment, require limited expertise to perform, and can take less than 30 min to perform in resource limited regions. LAMP output is a color change that can be viewed by eye, but it can be difficult to design primer sets, determine target specificity, and interpret subjective color changes.</p><p><strong>Methods: </strong>We developed an algorithm for the in silico design and evaluation of LAMP assays within the open-source PCR Signature Erosion Tool (PSET) and a computer vision application for the quantitative analysis of colorimetric outputs. First, Primer3 calculates LAMP primer sequence candidates with settings based on GC-content optimization. Next, PSET aligns the primer sequences of each assay against large sequence databases to calculate sufficient sequence similarity, coverage, and primer arrangement to the intended taxa, ultimately generating a confusion matrix. Finally, we tested assay candidates in the laboratory against synthetic constructs.</p><p><strong>Results: </strong>As an example, we generated new LAMP assays targeting drug resistance in V. cholerae and evaluated existing ones from the literature based on in silico target specificity and in vitro testing. Improvements in the design and testing of LAMP assays, with heightened target specificity and a simple analysis platform, increase utility for in-field applications. Overall, 9 of the 16 tested LAMP assays had positive signal through visual and computer vision-based detection methods developed here. Here we show LAMP assays tested on synthetic AMR gene targets for aph(6), varG, floR, qnrVC5, and almG, which allow for resistance to aminoglycosides, penicillins, carbapenems, phenicols, fluoroquinolones, and polymyxins respectively.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"384"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MinLinMo: a minimalist approach to variable selection and linear model prediction.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-06000-4
Jon Bohlin, Siri E Håberg, Per Magnus, Håkon K Gjessing
{"title":"MinLinMo: a minimalist approach to variable selection and linear model prediction.","authors":"Jon Bohlin, Siri E Håberg, Per Magnus, Håkon K Gjessing","doi":"10.1186/s12859-024-06000-4","DOIUrl":"https://doi.org/10.1186/s12859-024-06000-4","url":null,"abstract":"<p><p>Generating prediction models from high dimensional data often result in large models with many predictors. Causal inference for such models can therefore be difficult or even impossible in practice. The stand-alone software package MinLinMo emphasizes small linear prediction models over highest possible predictability with a particular focus on including variables correlated with the outcome, minimal memory usage and speed. MinLinMo is demonstrated on large epigenetic datasets with prediction models for chronological age, gestational age, and birth weight comprising, respectively, 15, 14 and 10 predictors. The parsimonious MinLinMo models perform comparably to established prediction models requiring hundreds of predictors.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"380"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Informeasure: an R/bioconductor package for quantifying nonlinear dependence between variables in biological networks from an information theory perspective.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-05996-z
Chu Pan, Yanlin Chen
{"title":"Informeasure: an R/bioconductor package for quantifying nonlinear dependence between variables in biological networks from an information theory perspective.","authors":"Chu Pan, Yanlin Chen","doi":"10.1186/s12859-024-05996-z","DOIUrl":"https://doi.org/10.1186/s12859-024-05996-z","url":null,"abstract":"<p><strong>Background: </strong>Using information measures to infer biological regulatory networks can capture nonlinear relationships between variables. However, it is computationally challenging, and there is a lack of convenient tools.</p><p><strong>Results: </strong>We introduce Informeasure, an R package designed to quantify nonlinear dependencies in biological regulatory networks from an information theory perspective. This package compiles a comprehensive set of information measurements, including mutual information, conditional mutual information, interaction information, partial information decomposition, and part mutual information. Mutual information is used for bivariate network inference, while the other four estimators are dedicated to trivariate network analysis.</p><p><strong>Conclusions: </strong>Informeasure is a turnkey solution, allowing users to utilize these information measures immediately upon installation. Informeasure is available as an R/Bioconductor package at https://bioconductor.org/packages/Informeasure .</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"382"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NERVE 2.0: boosting the new enhanced reverse vaccinology environment via artificial intelligence and a user-friendly web interface.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-06004-0
Andrea Conte, Nicola Gulmini, Francesco Costa, Matteo Cartura, Felix Bröhl, Francesco Patanè, Francesco Filippini
{"title":"NERVE 2.0: boosting the new enhanced reverse vaccinology environment via artificial intelligence and a user-friendly web interface.","authors":"Andrea Conte, Nicola Gulmini, Francesco Costa, Matteo Cartura, Felix Bröhl, Francesco Patanè, Francesco Filippini","doi":"10.1186/s12859-024-06004-0","DOIUrl":"https://doi.org/10.1186/s12859-024-06004-0","url":null,"abstract":"<p><strong>Background: </strong>Vaccines development in this millennium started by the milestone work on Neisseria meningitidis B, reporting the invention of Reverse Vaccinology (RV), which allows to identify vaccine candidates (VCs) by screening bacterial pathogens genome or proteome through computational analyses. When NERVE (New Enhanced RV Environment), the first RV software integrating tools to perform the selection of VCs, was released, it prompted further development in the field. However, the problem-solving potential of most, if not all, RV programs is still largely unexploited by experimental vaccinologists that impaired by somehow difficult interfaces, requiring bioinformatic skills.</p><p><strong>Results: </strong>We report here on the development and release of NERVE 2.0 (available at: https://nerve-bio.org ) which keeps the original integrative and modular approach of NERVE, while showing higher predictive performance than its previous version and other web-RV programs (Vaxign and Vaxijen). We renewed some of its modules and added innovative ones, such as Loop-Razor, to recover fragments of promising vaccine candidates or Epitope Prediction for the epitope prediction binding affinities and population coverage. Along with two newly built AI (Artificial Intelligence)-based models: ESPAAN and Virulent. To improve user-friendliness, NERVE was shifted to a tutored, web-based interface, with a noSQL-database to consent the user to submit, obtain and retrieve analysis results at any moment.</p><p><strong>Conclusions: </strong>With its redesigned and updated environment, NERVE 2.0 allows customisable and refinable bacterial protein vaccine analyses to all different kinds of users.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"378"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-05997-y
Jeet Sukumaran, Marina Meila
{"title":"Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space.","authors":"Jeet Sukumaran, Marina Meila","doi":"10.1186/s12859-024-05997-y","DOIUrl":"https://doi.org/10.1186/s12859-024-05997-y","url":null,"abstract":"<p><strong>Background: </strong>Existing software for comparison of species delimitation models do not provide a (true) metric or distance functions between species delimitation models, nor a way to compare these models in terms of relative clustering differences along a lattice of partitions.</p><p><strong>Results: </strong>Piikun is a Python package for analyzing and visualizing species delimitation models in an information theoretic framework that, in addition to classic measures of information such as the entropy and mutual information [1], provides for the calculation of the Variation of Information (VI) criterion [2], a true metric or distance function for species delimitation models that is aligned with the lattice of partitions.</p><p><strong>Conclusions: </strong>Piikun is available under the MIT license from its public repository ( https://github.com/jeetsukumaran/piikun ), and can be installed locally using the Python package manager 'pip'.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"385"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNEA: an R package for fast and versatile data-driven network analysis of metabolomics data.
IF 2.9 3区 生物学
BMC Bioinformatics Pub Date : 2024-12-18 DOI: 10.1186/s12859-024-05994-1
Christopher Patsalis, Gayatri Iyer, Marci Brandenburg, Alla Karnovsky, George Michailidis
{"title":"DNEA: an R package for fast and versatile data-driven network analysis of metabolomics data.","authors":"Christopher Patsalis, Gayatri Iyer, Marci Brandenburg, Alla Karnovsky, George Michailidis","doi":"10.1186/s12859-024-05994-1","DOIUrl":"https://doi.org/10.1186/s12859-024-05994-1","url":null,"abstract":"<p><strong>Background: </strong>Metabolomics is a high-throughput technology that measures small molecule metabolites in cells, tissues or biofluids. Analysis of metabolomics data is a multi-step process that involves data processing, quality control and normalization, followed by statistical and bioinformatics analysis. The latter step often involves pathway analysis to aid biological interpretation of the data. This approach is limited to endogenous metabolites that can be readily mapped to metabolic pathways. An alternative to pathway analysis that can be used for any classes of metabolites, including unknown compounds that are ubiquitous in untargeted metabolomics data, involves defining metabolite-metabolite interactions using experimental data. Our group has developed several network-based methods that use partial correlations of experimentally determined metabolite measurements. These were implemented in CorrelationCalculator and Filigree, two software tools for the analysis of metabolomics data we developed previously. The latter tool implements the Differential Network Enrichment Analysis (DNEA) algorithm. This analysis is useful for building differential networks from metabolomics data containing two experimental groups and identifying differentially enriched metabolic modules. While Filigree is a user-friendly tool, it has certain limitations when used for the analysis of large-scale metabolomics datasets.</p><p><strong>Results: </strong>We developed the DNEA R package for the data-driven network analysis of metabolomics data. We present the DNEA workflow and functionality, algorithm enhancements implemented with respect to the package's predecessor, Filigree, and discuss best practices for analyses. We tested the performance of the DNEA R package and illustrated its features using publicly available metabolomics data from the environmental determinants of diabetes in the young. To our knowledge, this package is the only publicly available tool designed for the construction of biological networks and subsequent enrichment testing for datasets containing exogenous, secondary, and unknown compounds. This greatly expands the scope of traditional enrichment analysis tools that can be used to analyze a relatively small set of well-annotated metabolites.</p><p><strong>Conclusions: </strong>The DNEA R package is a more flexible and powerful implementation of our previously published software tool, Filigree. The modular structure of the package, along with the parallel processing framework built into the most computationally extensive steps of the algorithm, make it a powerful tool for the analysis of large and complex metabolomics datasets.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"25 1","pages":"383"},"PeriodicalIF":2.9,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142852195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信