PLoS Computational Biology最新文献

筛选
英文 中文
Calibrating dimension reduction hyperparameters in the presence of noise 在存在噪声的情况下校准降维超参数
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-12 DOI: 10.1371/journal.pcbi.1012427
Justin Lin, Julia Fukuyama
{"title":"Calibrating dimension reduction hyperparameters in the presence of noise","authors":"Justin Lin, Julia Fukuyama","doi":"10.1371/journal.pcbi.1012427","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012427","url":null,"abstract":"The goal of dimension reduction tools is to construct a low-dimensional representation of high-dimensional data. These tools are employed for a variety of reasons such as noise reduction, visualization, and to lower computational costs. However, there is a fundamental issue that is discussed in other modeling problems that is often overlooked in dimension reduction—overfitting. In the context of other modeling problems, techniques such as feature-selection, cross-validation, and regularization are employed to combat overfitting, but rarely are such precautions taken when applying dimension reduction. Prior applications of the two most popular non-linear dimension reduction methods, t-SNE and UMAP, fail to acknowledge data as a combination of signal and noise when assessing performance. These methods are typically calibrated to capture the entirety of the data, not just the signal. In this paper, we demonstrate the importance of acknowledging noise when calibrating hyperparameters and present a framework that enables users to do so. We use this framework to explore the role hyperparameter calibration plays in overfitting the data when applying t-SNE and UMAP. More specifically, we show previously recommended values for perplexity and n_neighbors are too small and overfit the noise. We also provide a workflow others may use to calibrate hyperparameters in the presence of noise.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shedding light on blue-green photosynthesis: A wavelength-dependent mathematical model of photosynthesis in Synechocystis sp. PCC 6803 揭示蓝绿光合作用:Synechocystis sp. PCC 6803光合作用的波长依赖性数学模型
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-12 DOI: 10.1371/journal.pcbi.1012445
Tobias Pfennig, Elena Kullmann, Tomáš Zavřel, Andreas Nakielski, Oliver Ebenhöh, Jan Červený, Gábor Bernát, Anna Barbara Matuszyńska
{"title":"Shedding light on blue-green photosynthesis: A wavelength-dependent mathematical model of photosynthesis in Synechocystis sp. PCC 6803","authors":"Tobias Pfennig, Elena Kullmann, Tomáš Zavřel, Andreas Nakielski, Oliver Ebenhöh, Jan Červený, Gábor Bernát, Anna Barbara Matuszyńska","doi":"10.1371/journal.pcbi.1012445","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012445","url":null,"abstract":"Cyanobacteria hold great potential to revolutionize conventional industries and farming practices with their light-driven chemical production. To fully exploit their photosynthetic capacity and enhance product yield, it is crucial to investigate their intricate interplay with the environment including the light intensity and spectrum. Mathematical models provide valuable insights for optimizing strategies in this pursuit. In this study, we present an ordinary differential equation-based model for the cyanobacterium <jats:italic>Synechocystis</jats:italic> sp. PCC 6803 to assess its performance under various light sources, including monochromatic light. Our model can reproduce a variety of physiologically measured quantities, e.g. experimentally reported partitioning of electrons through four main pathways, O<jats:sub>2</jats:sub> evolution, and the rate of carbon fixation for ambient and saturated CO<jats:sub>2</jats:sub>. By capturing the interactions between different components of a photosynthetic system, our model helps in understanding the underlying mechanisms driving system behavior. Our model qualitatively reproduces fluorescence emitted under various light regimes, replicating Pulse-amplitude modulation (PAM) fluorometry experiments with saturating pulses. Using our model, we test four hypothesized mechanisms of cyanobacterial state transitions for ensemble of parameter sets and found no physiological benefit of a model assuming phycobilisome detachment. Moreover, we evaluate metabolic control for biotechnological production under diverse light colors and irradiances. We suggest gene targets for overexpression under different illuminations to increase the yield. By offering a comprehensive computational model of cyanobacterial photosynthesis, our work enhances the basic understanding of light-dependent cyanobacterial behavior and sets the first wavelength-dependent framework to systematically test their producing capacity for biocatalysis.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing 对 CTEPH 药物的探索:将自然语言处理、嵌入式学习、体外模型和真实世界证据相结合,促进药物再利用
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-12 DOI: 10.1371/journal.pcbi.1012417
Daniel Steiert, Corey Wittig, Priyanka Banerjee, Robert Preissner, Robert Szulcek
{"title":"An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing","authors":"Daniel Steiert, Corey Wittig, Priyanka Banerjee, Robert Preissner, Robert Szulcek","doi":"10.1371/journal.pcbi.1012417","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012417","url":null,"abstract":"Background In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines. Objective We apply natural language processing (NLP) and embedding learning concepts to design PubDigest, a tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed. Methods Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of PubDigest, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with an overall limited number of scientific publications. Results Our literature analysis identified key clinical features linked to CTEPH by applying term frequency-inverse document frequency (TF-IDF) scoring, a technique measuring a term’s significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong semantic links with CTEPH. Looking deeper, we discovered potential repurposing candidates for CTEPH through large-scale neural network-based contextualization of literature and predictive modeling on both the CTEPH and the VT literature corpora to find novel, yet unrecognized associations between the two diseases. Alongside the anti-thrombotic agent caplacizumab, benzofuran derivatives were an intriguing find. In particular, the benzofuran derivative amiodarone displayed potential anti-thrombotic properties in the literature. Our <jats:italic>in vitro</jats:italic> tests confirmed amiodarone’s ability to reduce platelet aggregation significantly by 68% (p = 0.02). However, real-world clinical data indicated that CTEPH patients receiving amiodarone treatment faced a significant 15.9% higher mortality risk (p&lt;0.001). Conclusions While NLP offers an innovative approach to interpreting scientific literature, especially for drug repurposing, it is crucial to combine it with complementary methods like <jats:italic>in vitro</jats:italic> testing and real-world evidence. Our exploration with benzofuran derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and real-world clinical data can pave the way for faster and safer drug repurposing approaches, especially for rare diseases like CTEPH.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison and benchmark of deep learning methods for non-coding RNA classification 非编码 RNA 分类深度学习方法的比较与基准
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-12 DOI: 10.1371/journal.pcbi.1012446
Constance Creux, Farida Zehraoui, François Radvanyi, Fariza Tahi
{"title":"Comparison and benchmark of deep learning methods for non-coding RNA classification","authors":"Constance Creux, Farida Zehraoui, François Radvanyi, Fariza Tahi","doi":"10.1371/journal.pcbi.1012446","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012446","url":null,"abstract":"The involvement of non-coding RNAs in biological processes and diseases has made the exploration of their functions crucial. Most non-coding RNAs have yet to be studied, creating the need for methods that can rapidly classify large sets of non-coding RNAs into functional groups, or classes. In recent years, the success of deep learning in various domains led to its application to non-coding RNA classification. Multiple novel architectures have been developed, but these advancements are not covered by current literature reviews. We present an exhaustive comparison of the different methods proposed in the state-of-the-art and describe their associated datasets. Moreover, the literature lacks objective benchmarks. We perform experiments to fairly evaluate the performance of various tools for non-coding RNA classification on popular datasets. The robustness of methods to non-functional sequences and sequence boundary noise is explored. We also measure computation time and CO<jats:sub>2</jats:sub> emissions. With regard to these results, we assess the relevance of the different architectural choices and provide recommendations to consider in future methods.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142225046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and implementation of an asynchronous online course-based undergraduate research experience (CURE) in computational genomics 设计和实施基于异步在线课程的计算基因组学本科生研究体验 (CURE)
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-12 DOI: 10.1371/journal.pcbi.1012384
Seema B. Plaisier, Danielle O. Alarid, Joelle A. Denning, Sara E. Brownell, Kenneth H. Buetow, Katelyn M. Cooper, Melissa A. Wilson
{"title":"Design and implementation of an asynchronous online course-based undergraduate research experience (CURE) in computational genomics","authors":"Seema B. Plaisier, Danielle O. Alarid, Joelle A. Denning, Sara E. Brownell, Kenneth H. Buetow, Katelyn M. Cooper, Melissa A. Wilson","doi":"10.1371/journal.pcbi.1012384","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012384","url":null,"abstract":"As genomics technologies advance, there is a growing demand for computational biologists trained for genomics analysis but instructors face significant hurdles in providing formal training in computer programming, statistics, and genomics to biology students. Fully online learners represent a significant and growing community that can contribute to meet this need, but they are frequently excluded from valuable research opportunities which mostly do not offer the flexibility they need. To address these opportunity gaps, we developed an asynchronous course-based undergraduate research experience (CURE) for computational genomics specifically for fully online biology students. We generated custom learning materials and leveraged remotely accessible computational tools to address 2 novel research questions over 2 iterations of the genomics CURE, one testing bioinformatics approaches and one mining cancer genomics data. Here, we present how the instructional team distributed analysis needed to address these questions between students over a 7.5-week CURE and provided concurrent training in biology and statistics, computer programming, and professional development. Scores from identical learning assessments administered before and after completion of each CURE showed significant learning gains across biology and coding course objectives. Open-response progress reports were submitted weekly and identified self-reported adaptive coping strategies for challenges encountered throughout the course. Progress reports identified problems that could be resolved through collaboration with instructors and peers via messaging platforms and virtual meetings. We implemented asynchronous communication using the Slack messaging platform and an asynchronous journal club where students discussed relevant publications using the Perusall social annotation platform. The online genomics CURE resulted in unanticipated positive outcomes, including students voluntarily discussing plans to continue research after the course. These outcomes underscore the effectiveness of this genomics CURE for scientific training, recruitment and student-mentor relationships, and student successes. Asynchronous genomics CUREs can contribute to a more skilled, diverse, and inclusive workforce for the advancement of biomedical science.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SEMbap: Bow-free covariance search and data de-correlation SEMbap:无弓协方差搜索和数据去相关性
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-11 DOI: 10.1371/journal.pcbi.1012448
Mario Grassi, Barbara Tarantino
{"title":"SEMbap: Bow-free covariance search and data de-correlation","authors":"Mario Grassi, Barbara Tarantino","doi":"10.1371/journal.pcbi.1012448","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012448","url":null,"abstract":"Large-scale studies of gene expression are commonly influenced by biological and technical sources of expression variation, including batch effects, sample characteristics, and environmental impacts. Learning the causal relationships between observable variables may be challenging in the presence of unobserved confounders. Furthermore, many high-dimensional regression techniques may perform worse. In fact, controlling for unobserved confounding variables is essential, and many deconfounding methods have been suggested for application in a variety of situations. The main contribution of this article is the development of a two-stage deconfounding procedure based on Bow-free Acyclic Paths (BAP) search developed into the framework of Structural Equation Models (SEM), called <jats:monospace specific-use=\"no-wrap\">SEMbap()</jats:monospace>. In the first stage, an exhaustive search of missing edges with significant covariance is performed via Shipley d-separation tests; then, in the second stage, a Constrained Gaussian Graphical Model (CGGM) is fitted or a low dimensional representation of bow-free edges structure is obtained via Graph Laplacian Principal Component Analysis (gLPCA). We compare four popular deconfounding methods to BAP search approach with applications on simulated and observed expression data. In the former, different structures of the hidden covariance matrix have been replicated. Compared to existing methods, BAP search algorithm is able to correctly identify hidden confounding whilst controlling false positive rate and achieving good fitting and perturbation metrics.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creating cell-specific computational models of stem cell-derived cardiomyocytes using optical experiments 利用光学实验创建干细胞衍生心肌细胞的细胞特异性计算模型
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-11 DOI: 10.1371/journal.pcbi.1011806
Janice Yang, Neil J. Daily, Taylor K. Pullinger, Tetsuro Wakatsuki, Eric A. Sobie
{"title":"Creating cell-specific computational models of stem cell-derived cardiomyocytes using optical experiments","authors":"Janice Yang, Neil J. Daily, Taylor K. Pullinger, Tetsuro Wakatsuki, Eric A. Sobie","doi":"10.1371/journal.pcbi.1011806","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1011806","url":null,"abstract":"Human induced pluripotent stem cell-derived cardiomyocytes (iPSC-CMs) have gained traction as a powerful model in cardiac disease and therapeutics research, since iPSCs are self-renewing and can be derived from healthy and diseased patients without invasive surgery. However, current iPSC-CM differentiation methods produce cardiomyocytes with immature, fetal-like electrophysiological phenotypes, and the variety of maturation protocols in the literature results in phenotypic differences between labs. Heterogeneity of iPSC donor genetic backgrounds contributes to additional phenotypic variability. Several mathematical models of iPSC-CM electrophysiology have been developed to help to predict cell responses, but these models individually do not capture the phenotypic variability observed in iPSC-CMs. Here, we tackle these limitations by developing a computational pipeline to calibrate cell preparation-specific iPSC-CM electrophysiological parameters. We used the genetic algorithm (GA), a heuristic parameter calibration method, to tune ion channel parameters in a mathematical model of iPSC-CM physiology. To systematically optimize an experimental protocol that generates sufficient data for parameter calibration, we created <jats:italic>in silico</jats:italic> datasets by simulating various protocols applied to a population of models with known conductance variations, and then fitted parameters to those datasets. We found that calibrating to voltage and calcium transient data under 3 varied experimental conditions, including electrical pacing combined with ion channel blockade and changing buffer ion concentrations, improved model parameter estimates and model predictions of unseen channel block responses. This observation also held when the fitted data were normalized, suggesting that normalized fluorescence recordings, which are more accessible and higher throughput than patch clamp recordings, could sufficiently inform conductance parameters. Therefore, this computational pipeline can be applied to different iPSC-CM preparations to determine cell line-specific ion channel properties and understand the mechanisms behind variability in perturbation responses.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142225043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oscillations in an artificial neural network convert competing inputs into a temporal code 人工神经网络中的振荡将竞争输入转化为时间代码
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-11 DOI: 10.1371/journal.pcbi.1012429
Katharina Duecker, Marco Idiart, Marcel van Gerven, Ole Jensen
{"title":"Oscillations in an artificial neural network convert competing inputs into a temporal code","authors":"Katharina Duecker, Marco Idiart, Marcel van Gerven, Ole Jensen","doi":"10.1371/journal.pcbi.1012429","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012429","url":null,"abstract":"The field of computer vision has long drawn inspiration from neuroscientific studies of the human and non-human primate visual system. The development of convolutional neural networks (CNNs), for example, was informed by the properties of simple and complex cells in early visual cortex. However, the computational relevance of oscillatory dynamics experimentally observed in the visual system are typically not considered in artificial neural networks (ANNs). Computational models of neocortical dynamics, on the other hand, rarely take inspiration from computer vision. Here, we combine methods from computational neuroscience and machine learning to implement multiplexing in a simple ANN using oscillatory dynamics. We first trained the network to classify individually presented letters. Post-training, we added temporal dynamics to the hidden layer, introducing refraction in the hidden units as well as pulsed inhibition mimicking neuronal alpha oscillations. Without these dynamics, the trained network correctly classified individual letters but produced a mixed output when presented with two letters simultaneously, indicating a bottleneck problem. When introducing refraction and oscillatory inhibition, the output nodes corresponding to the two stimuli activate sequentially, ordered along the phase of the inhibitory oscillations. Our model implements the idea that inhibitory oscillations segregate competing inputs in time. The results of our simulations pave the way for applications in deeper network architectures and more complicated machine learning problems.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The primacy model and the structure of olfactory space 首要模式和嗅觉空间结构
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-10 DOI: 10.1371/journal.pcbi.1012379
Hamza Giaffar, Sergey Shuvaev, Dmitry Rinberg, Alexei A. Koulakov
{"title":"The primacy model and the structure of olfactory space","authors":"Hamza Giaffar, Sergey Shuvaev, Dmitry Rinberg, Alexei A. Koulakov","doi":"10.1371/journal.pcbi.1012379","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012379","url":null,"abstract":"Understanding sensory processing relies on the establishment of a consistent relationship between the stimulus space, its neural representation, and perceptual quality. In olfaction, the difficulty in establishing these links lies partly in the complexity of the underlying odor input space and perceptual responses. Based on the recently proposed primacy model for concentration invariant odor identity representation and a few assumptions, we have developed a theoretical framework for mapping the odor input space to the response properties of olfactory receptors. We analyze a geometrical structure containing odor representations in a multidimensional space of receptor affinities and describe its low-dimensional implementation, the primacy hull. We propose the implications of the primacy hull for the structure of feedforward connectivity in early olfactory networks. We test the predictions of our theory by comparing the existing receptor-ligand affinity and connectivity data obtained in the fruit fly olfactory system. We find that the Kenyon cells of the insect mushroom body integrate inputs from the high-affinity (primacy) sets of olfactory receptors in agreement with the primacy theory.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construct prognostic models of multiple myeloma with pathway information incorporated 构建包含路径信息的多发性骨髓瘤预后模型
IF 4.3 2区 生物学
PLoS Computational Biology Pub Date : 2024-09-10 DOI: 10.1371/journal.pcbi.1012444
Shuo Wang, ShanJin Wang, Wei Pan, YuYang Yi, Junyan Lu
{"title":"Construct prognostic models of multiple myeloma with pathway information incorporated","authors":"Shuo Wang, ShanJin Wang, Wei Pan, YuYang Yi, Junyan Lu","doi":"10.1371/journal.pcbi.1012444","DOIUrl":"https://doi.org/10.1371/journal.pcbi.1012444","url":null,"abstract":"Multiple myeloma (MM) is a hematological disease exhibiting aberrant clonal expansion of cancerous plasma cells in the bone marrow. The effects of treatments for MM vary between patients, highlighting the importance of developing prognostic models for informed therapeutic decision-making. Most previous models were constructed at the gene level, ignoring the fact that the dysfunction of the pathway is closely associated with disease development and progression. The present study considered two strategies that construct predictive models by taking pathway information into consideration: pathway score method and group lasso using pathway information. The former simply converted gene expression to sample-wise pathway scores for model fitting. We considered three methods for pathway score calculation (ssGSEA, GSVA, and z-scores) and 14 data sources providing pathway information. We implemented these methods in microarray data for MM (GSE136324) and obtained a candidate model with the best prediction performance in interval validation. The candidate model is further compared with the gene-based model and previously published models in two external data. We also investigated the effects of missing values on prediction. The results showed that group lasso incorporating Vax pathway information (Vax(grp)) was more competitive in prediction than the gene model in both internal and external validation. Immune information, including VAX pathways, seemed to be more predictive for MM. Vax(grp) also outperformed the previously published models. Moreover, the new model was more resistant to missing values, and the presence of missing values (&lt;5%) would not evidently deteriorate its prediction accuracy using our missing data imputation method. In a nutshell, pathway-based models (using group lasso) were competitive alternatives to gene-based models for MM. These models were documented in an R package (<jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" ext-link-type=\"uri\" xlink:href=\"https://github.com/ShuoStat/MMMs\" xlink:type=\"simple\">https://github.com/ShuoStat/MMMs</jats:ext-link>), where a missing data imputation method was also integrated to facilitate future validation.","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信