Frontiers in bioinformatics最新文献

筛选
英文 中文
Maximum-scoring path sets on pangenome graphs of constant treewidth. 恒定树宽的盘根图上的最大得分路径集。
IF 2.8
Frontiers in bioinformatics Pub Date : 2024-07-01 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1391086
Broňa Brejová, Travis Gagie, Eva Herencsárová, Tomáš Vinař
{"title":"Maximum-scoring path sets on pangenome graphs of constant treewidth.","authors":"Broňa Brejová, Travis Gagie, Eva Herencsárová, Tomáš Vinař","doi":"10.3389/fbinf.2024.1391086","DOIUrl":"10.3389/fbinf.2024.1391086","url":null,"abstract":"<p><p>We generalize a problem of finding maximum-scoring segment sets, previously studied by Csűrös (IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2004, 1, 139-150), from sequences to graphs. Namely, given a vertex-weighted graph <i>G</i> and a non-negative startup penalty <i>c</i>, we can find a set of vertex-disjoint paths in <i>G</i> with maximum total score when each path's score is its vertices' total weight minus <i>c</i>. We call this new problem <i>maximum-scoring path sets</i> (MSPS). We present an algorithm that has a linear-time complexity for graphs with a constant treewidth. Generalization from sequences to graphs allows the algorithm to be used on pangenome graphs representing several related genomes and can be seen as a common abstraction for several biological problems on pangenomes, including searching for CpG islands, ChIP-seq data analysis, analysis of region enrichment for functional elements, or simple chaining problems.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1391086"},"PeriodicalIF":2.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11246863/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141621903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The evolution of mammalian Rem2: unraveling the impact of purifying selection and coevolution on protein function, and implications for human disorders. 哺乳动物 Rem2 的进化:揭示纯化选择和共同进化对蛋白质功能的影响,以及对人类疾病的影响。
IF 2.8
Frontiers in bioinformatics Pub Date : 2024-06-24 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1381540
Alexander G Lucaci, William E Brew, Jason Lamanna, Avery Selberg, Vincenzo Carnevale, Anna R Moore, Sergei L Kosakovsky Pond
{"title":"The evolution of mammalian Rem2: unraveling the impact of purifying selection and coevolution on protein function, and implications for human disorders.","authors":"Alexander G Lucaci, William E Brew, Jason Lamanna, Avery Selberg, Vincenzo Carnevale, Anna R Moore, Sergei L Kosakovsky Pond","doi":"10.3389/fbinf.2024.1381540","DOIUrl":"10.3389/fbinf.2024.1381540","url":null,"abstract":"<p><p>Rad And Gem-Like GTP-Binding Protein 2 (Rem2), a member of the RGK family of Ras-like GTPases, is implicated in Huntington's disease and Long QT Syndrome and is highly expressed in the brain and endocrine cells. We examine the evolutionary history of Rem2 identified in various mammalian species, focusing on the role of purifying selection and coevolution in shaping its sequence and protein structural constraints. Our analysis of Rem2 sequences across 175 mammalian species found evidence for strong purifying selection in 70% of non-invariant codon sites which is characteristic of essential proteins that play critical roles in biological processes and is consistent with Rem2's role in the regulation of neuronal development and function. We inferred epistatic effects in 50 pairs of codon sites in Rem2, some of which are predicted to have deleterious effects on human health. Additionally, we reconstructed the ancestral evolutionary history of mammalian Rem2 using protein structure prediction of extinct and extant sequences which revealed the dynamics of how substitutions that change the gene sequence of Rem2 can impact protein structure in variable regions while maintaining core functional mechanisms. By understanding the selective pressures, protein- and gene - interactions that have shaped the sequence and structure of the Rem2 protein, we gain a stronger understanding of its biological and functional constraints.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1381540"},"PeriodicalIF":2.8,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11228553/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141560465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bioinformatics proficiency among African students. 非洲学生的生物信息学能力。
IF 2.8
Frontiers in bioinformatics Pub Date : 2024-06-20 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1328714
Ashraf Akintayo Akintola, Abdullahi Tunde Aborode, Muhammed Taofiq Hamza, Augustine Amakiri, Benjamin Moore, Suliat Abdulai, Oluyinka Ajibola Iyiola, Lateef Adegboyega Sulaimon, Effiong Effiong, Adedeji Ogunyemi, Boluwatife Dosunmu, Abdulkadir Yusif Maigoro, Opeyemi Lawal, Kayode Raheem, Ui Wook Hwang
{"title":"Bioinformatics proficiency among African students.","authors":"Ashraf Akintayo Akintola, Abdullahi Tunde Aborode, Muhammed Taofiq Hamza, Augustine Amakiri, Benjamin Moore, Suliat Abdulai, Oluyinka Ajibola Iyiola, Lateef Adegboyega Sulaimon, Effiong Effiong, Adedeji Ogunyemi, Boluwatife Dosunmu, Abdulkadir Yusif Maigoro, Opeyemi Lawal, Kayode Raheem, Ui Wook Hwang","doi":"10.3389/fbinf.2024.1328714","DOIUrl":"10.3389/fbinf.2024.1328714","url":null,"abstract":"<p><p>Bioinformatics, the interdisciplinary field that combines biology, computer science, and data analysis, plays a pivotal role in advancing our understanding of life sciences. In the African context, where the diversity of biological resources and healthcare challenges is substantial, fostering bioinformatics literacy and proficiency among students is important. This perspective provides an overview of the state of bioinformatics literacy among African students, highlighting the significance, challenges, and potential solutions in addressing this critical educational gap. It proposes various strategies to enhance bioinformatics literacy among African students. These include expanding educational resources, fostering collaboration between institutions, and engaging students in research projects. By addressing the current challenges and implementing comprehensive strategies, African students can harness the power of bioinformatics to contribute to innovative solutions in healthcare, agriculture, and biodiversity conservation, ultimately advancing the continent's scientific capabilities and improving the quality of life for her people. In conclusion, promoting bioinformatics literacy among African students is imperative for the continent's scientific development and advancing frontiers of biological research.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1328714"},"PeriodicalIF":2.8,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11222312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141536364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive multi-omics analysis reveals unique signatures to predict Alzheimer's disease. 全面的多组学分析揭示了预测阿尔茨海默病的独特特征。
IF 2.8
Frontiers in bioinformatics Pub Date : 2024-06-19 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1390607
Michael Vacher, Rodrigo Canovas, Simon M Laws, James D Doecke
{"title":"A comprehensive multi-omics analysis reveals unique signatures to predict Alzheimer's disease.","authors":"Michael Vacher, Rodrigo Canovas, Simon M Laws, James D Doecke","doi":"10.3389/fbinf.2024.1390607","DOIUrl":"10.3389/fbinf.2024.1390607","url":null,"abstract":"<p><strong>Background: </strong>Complex disorders, such as Alzheimer's disease (AD), result from the combined influence of multiple biological and environmental factors. The integration of high-throughput data from multiple omics platforms can provide system overviews, improving our understanding of complex biological processes underlying human disease. In this study, integrated data from four omics platforms were used to characterise biological signatures of AD.</p><p><strong>Method: </strong>The study cohort consists of 455 participants (Control:148, Cases:307) from the Religious Orders Study and Memory and Aging Project (ROSMAP). Genotype (SNP), methylation (CpG), RNA and proteomics data were collected, quality-controlled and pre-processed (SNP = 130; CpG = 83; RNA = 91; Proteomics = 119). Using a diagnosis of Mild Cognitive Impairment (MCI)/AD combined as the target phenotype, we first used Partial Least Squares Regression as an unsupervised classification framework to assess the prediction capabilities for each omics dataset individually. We then used a variation of the sparse generalized canonical correlation analysis (sGCCA) to assess predictions of the combined datasets and identify multi-omics signatures characterising each group of participants.</p><p><strong>Results: </strong>Analysing datasets individually we found methylation data provided the best predictions with an accuracy of 0.63 (95%CI = [0.54-0.71]), followed by RNA, 0.61 (95%CI = [0.52-0.69]), SNP, 0.59 (95%CI = [0.51-0.68]) and proteomics, 0.58 (95%CI = [0.51-0.67]). After integration of the four datasets, predictions were dramatically improved with a resulting accuracy of 0.95 (95% CI = [0.89-0.98]).</p><p><strong>Conclusion: </strong>The integration of data from multiple platforms is a powerful approach to explore biological systems and better characterise the biological signatures of AD. The results suggest that integrative methods can identify biomarker panels with improved predictive performance compared to individual platforms alone. Further validation in independent cohorts is required to validate and refine the results presented in this study.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1390607"},"PeriodicalIF":2.8,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11219798/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141499827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human cytokine and coronavirus nucleocapsid protein interactivity using large-scale virtual screens. 利用大规模虚拟筛选研究人类细胞因子与冠状病毒核壳蛋白的相互作用。
Frontiers in bioinformatics Pub Date : 2024-05-24 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1397968
Phillip J Tomezsko, Colby T Ford, Avery E Meyer, Adam M Michaleas, Rafael Jaimes
{"title":"Human cytokine and coronavirus nucleocapsid protein interactivity using large-scale virtual screens.","authors":"Phillip J Tomezsko, Colby T Ford, Avery E Meyer, Adam M Michaleas, Rafael Jaimes","doi":"10.3389/fbinf.2024.1397968","DOIUrl":"10.3389/fbinf.2024.1397968","url":null,"abstract":"<p><p>Understanding the interactions between SARS-CoV-2 and the human immune system is paramount to the characterization of novel variants as the virus co-evolves with the human host. In this study, we employed state-of-the-art molecular docking tools to conduct large-scale virtual screens, predicting the binding affinities between 64 human cytokines against 17 nucleocapsid proteins from six betacoronaviruses. Our comprehensive <i>in silico</i> analyses reveal specific changes in cytokine-nucleocapsid protein interactions, shedding light on potential modulators of the host immune response during infection. These findings offer valuable insights into the molecular mechanisms underlying viral pathogenesis and may guide the future development of targeted interventions. This manuscript serves as insight into the comparison of deep learning based AlphaFold2-Multimer and the semi-physicochemical based HADDOCK for protein-protein docking. We show the two methods are complementary in their predictive capabilities. We also introduce a novel algorithm for rapidly assessing the binding interface of protein-protein docks using graph edit distance: graph-based interface residue assessment function (GIRAF). The high-performance computational framework presented here will not only aid in accelerating the discovery of effective interventions against emerging viral threats, but extend to other applications of high throughput protein-protein screens.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1397968"},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11157076/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141297494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BayesAge: A maximum likelihood algorithm to predict epigenetic age. BayesAge:预测表观遗传年龄的最大似然法算法。
Frontiers in bioinformatics Pub Date : 2024-04-04 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1329144
Lajoyce Mboning, Liudmilla Rubbi, Michael Thompson, Louis-S Bouchard, Matteo Pellegrini
{"title":"BayesAge: A maximum likelihood algorithm to predict epigenetic age.","authors":"Lajoyce Mboning, Liudmilla Rubbi, Michael Thompson, Louis-S Bouchard, Matteo Pellegrini","doi":"10.3389/fbinf.2024.1329144","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1329144","url":null,"abstract":"<p><p><b>Introduction:</b> DNA methylation, specifically the formation of 5-methylcytosine at the C5 position of cytosine, undergoes reproducible changes as organisms age, establishing it as a significant biomarker in aging studies. Epigenetic clocks, which integrate methylation patterns to predict age, often employ linear models based on penalized regression, yet they encounter challenges in handling missing data, count-based bisulfite sequence data, and interpretation. <b>Methods:</b> To address these limitations, we introduce BayesAge, an extension of the scAge methodology originally designed for single-cell DNA methylation analysis. BayesAge employs maximum likelihood estimation (MLE) for age inference, models count data using binomial distributions, and incorporates LOWESS smoothing to capture non-linear methylation-age dynamics. This approach is tailored for bulk bisulfite sequencing datasets. <b>Results:</b> BayesAge demonstrates superior performance compared to scAge. Notably, its age residuals exhibit no age association, offering a less biased representation of epigenetic age variation across populations. Furthermore, BayesAge facilitates the estimation of error bounds on age inference. When applied to down-sampled data, BayesAge achieves a higher coefficient of determination between predicted and actual ages compared to both scAge and penalized regression. <b>Discussion:</b> BayesAge presents a promising advancement in epigenetic age prediction, addressing key challenges encountered by existing models. By integrating robust statistical techniques and tailored methodologies for count-based data, BayesAge offers improved accuracy and interpretability in predicting age from bulk bisulfite sequencing datasets. Its ability to estimate error bounds enhances the reliability of age inference, thereby contributing to a more comprehensive understanding of epigenetic aging processes.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1329144"},"PeriodicalIF":0.0,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11024280/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140869053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ursaPGx: a new R package to annotate pharmacogenetic star alleles using phased whole-genome sequencing data. ursaPGx:利用分阶段全基因组测序数据注释药物遗传星等位基因的新 R 软件包。
IF 2.8
Frontiers in bioinformatics Pub Date : 2024-03-12 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1351620
Gennaro Calendo, Dara Kusic, Jozef Madzo, Neda Gharani, Laura Scheinfeldt
{"title":"ursaPGx: a new R package to annotate pharmacogenetic star alleles using phased whole-genome sequencing data.","authors":"Gennaro Calendo, Dara Kusic, Jozef Madzo, Neda Gharani, Laura Scheinfeldt","doi":"10.3389/fbinf.2024.1351620","DOIUrl":"10.3389/fbinf.2024.1351620","url":null,"abstract":"<p><p>Long-read sequencing technologies offer new opportunities to generate high-confidence phased whole-genome sequencing data for robust pharmacogenetic annotation. Here, we describe a new user-friendly R package, ursaPGx, designed to accept multi-sample phased whole-genome sequencing data VCF input files and output star allele annotations for pharmacogenes annotated in PharmVar.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1351620"},"PeriodicalIF":2.8,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10963438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140295559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A network-based method for associating genes with autism spectrum disorder. 基于网络的自闭症谱系障碍基因关联方法。
Frontiers in bioinformatics Pub Date : 2024-03-08 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1295600
Neta Zadok, Gil Ast, Roded Sharan
{"title":"A network-based method for associating genes with autism spectrum disorder.","authors":"Neta Zadok, Gil Ast, Roded Sharan","doi":"10.3389/fbinf.2024.1295600","DOIUrl":"10.3389/fbinf.2024.1295600","url":null,"abstract":"<p><p>Autism spectrum disorder (ASD) is a highly heritable complex disease that affects 1% of the population, yet its underlying molecular mechanisms are largely unknown. Here we study the problem of predicting causal genes for ASD by combining genome-scale data with a network propagation approach. We construct a predictor that integrates multiple omic data sets that assess genomic, transcriptomic, proteomic, and phosphoproteomic associations with ASD. In cross validation our predictor yields mean area under the ROC curve of 0.87 and area under the precision-recall curve of 0.89. We further show that it outperforms previous gene-level predictors of autism association. Finally, we show that we can use the model to predict genes associated with Schizophrenia which is known to share genetic components with ASD.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1295600"},"PeriodicalIF":0.0,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10960359/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140208310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting subtle transcriptomic perturbations induced by lncRNAs knock-down in single-cell CRISPRi screening using a new sparse supervised autoencoder neural network. 在单细胞 CRISPRi 筛选中使用新型稀疏监督自动编码器神经网络检测 lncRNAs 敲除引起的微妙转录组扰动
Frontiers in bioinformatics Pub Date : 2024-03-04 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1340339
Marin Truchi, Caroline Lacoux, Cyprien Gille, Julien Fassy, Virginie Magnone, Rafael Lopes Goncalves, Cédric Girard-Riboulleau, Iris Manosalva-Pena, Marine Gautier-Isola, Kevin Lebrigand, Pascal Barbry, Salvatore Spicuglia, Georges Vassaux, Roger Rezzonico, Michel Barlaud, Bernard Mari
{"title":"Detecting subtle transcriptomic perturbations induced by lncRNAs knock-down in single-cell CRISPRi screening using a new sparse supervised autoencoder neural network.","authors":"Marin Truchi, Caroline Lacoux, Cyprien Gille, Julien Fassy, Virginie Magnone, Rafael Lopes Goncalves, Cédric Girard-Riboulleau, Iris Manosalva-Pena, Marine Gautier-Isola, Kevin Lebrigand, Pascal Barbry, Salvatore Spicuglia, Georges Vassaux, Roger Rezzonico, Michel Barlaud, Bernard Mari","doi":"10.3389/fbinf.2024.1340339","DOIUrl":"10.3389/fbinf.2024.1340339","url":null,"abstract":"<p><p>Single-cell CRISPR-based transcriptome screens are potent genetic tools for concomitantly assessing the expression profiles of cells targeted by a set of guides RNA (gRNA), and inferring target gene functions from the observed perturbations. However, due to various limitations, this approach lacks sensitivity in detecting weak perturbations and is essentially reliable when studying master regulators such as transcription factors. To overcome the challenge of detecting subtle gRNA induced transcriptomic perturbations and classifying the most responsive cells, we developed a new supervised autoencoder neural network method. Our Sparse supervised autoencoder (SSAE) neural network provides selection of both relevant features (genes) and actual perturbed cells. We applied this method on an in-house single-cell CRISPR-interference-based (CRISPRi) transcriptome screening (CROP-Seq) focusing on a subset of long non-coding RNAs (lncRNAs) regulated by hypoxia, a condition that promote tumor aggressiveness and drug resistance, in the context of lung adenocarcinoma (LUAD). The CROP-seq library of validated gRNA against a subset of lncRNAs and, as positive controls, HIF1A and HIF2A, the 2 main transcription factors of the hypoxic response, was transduced in A549 LUAD cells cultured in normoxia or exposed to hypoxic conditions during 3, 6 or 24 h. We first validated the SSAE approach on HIF1A and HIF2 by confirming the specific effect of their knock-down during the temporal switch of the hypoxic response. Next, the SSAE method was able to detect stable short hypoxia-dependent transcriptomic signatures induced by the knock-down of some lncRNAs candidates, outperforming previously published machine learning approaches. This proof of concept demonstrates the relevance of the SSAE approach for deciphering weak perturbations in single-cell transcriptomic data readout as part of CRISPR-based screening.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1340339"},"PeriodicalIF":0.0,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10945021/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting cell population-specific gene expression from genomic sequence. 从基因组序列预测细胞群特异性基因表达。
Frontiers in bioinformatics Pub Date : 2024-03-04 eCollection Date: 2024-01-01 DOI: 10.3389/fbinf.2024.1347276
Lieke Michielsen, Marcel J T Reinders, Ahmed Mahfouz
{"title":"Predicting cell population-specific gene expression from genomic sequence.","authors":"Lieke Michielsen, Marcel J T Reinders, Ahmed Mahfouz","doi":"10.3389/fbinf.2024.1347276","DOIUrl":"10.3389/fbinf.2024.1347276","url":null,"abstract":"<p><p>Most regulatory elements, especially enhancer sequences, are cell population-specific. One could even argue that a distinct set of regulatory elements is what defines a cell population. However, discovering which non-coding regions of the DNA are essential in which context, and as a result, which genes are expressed, is a difficult task. Some computational models tackle this problem by predicting gene expression directly from the genomic sequence. These models are currently limited to predicting bulk measurements and mainly make tissue-specific predictions. Here, we present a model that leverages single-cell RNA-sequencing data to predict gene expression. We show that cell population-specific models outperform tissue-specific models, especially when the expression profile of a cell population and the corresponding tissue are dissimilar. Further, we show that our model can prioritize GWAS variants and learn motifs of transcription factor binding sites. We envision that our model can be useful for delineating cell population-specific regulatory elements.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1347276"},"PeriodicalIF":0.0,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10944912/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140159702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信