Briefings in bioinformatics最新文献

筛选
英文 中文
MLSNet: a deep learning model for predicting transcription factor binding sites. MLSNet:用于预测转录因子结合位点的深度学习模型。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae489
Yuchuan Zhang, Zhikang Wang, Fang Ge, Xiaoyu Wang, Yiwen Zhang, Shanshan Li, Yuming Guo, Jiangning Song, Dong-Jun Yu
{"title":"MLSNet: a deep learning model for predicting transcription factor binding sites.","authors":"Yuchuan Zhang, Zhikang Wang, Fang Ge, Xiaoyu Wang, Yiwen Zhang, Shanshan Li, Yuming Guo, Jiangning Song, Dong-Jun Yu","doi":"10.1093/bib/bbae489","DOIUrl":"10.1093/bib/bbae489","url":null,"abstract":"<p><p>Accurate prediction of transcription factor binding sites (TFBSs) is essential for understanding gene regulation mechanisms and the etiology of diseases. Despite numerous advances in deep learning for predicting TFBSs, their performance can still be enhanced. In this study, we propose MLSNet, a novel deep learning architecture designed specifically to predict TFBSs. MLSNet innovatively integrates multisize convolutional fusion with long short-term memory (LSTM) networks to effectively capture DNA-sparse higher-order sequence features. Further, MLSNet incorporates super token attention and Bi-LSTM to systematically extract and integrate higher-order DNA shape features. Experimental results on 165 ChIP-seq (chromatin immunoprecipitation followed by sequencing) datasets indicate that MLSNet consistently outperforms several state-of-the-art algorithms in the prediction of TFBSs. Specifically, MLSNet reports average metrics: 0.8306 for ACC, 0.8992 for AUROC, and 0.9035 for AUPRC, surpassing the second-best methods by 1.82%, 1.68%, and 1.54%, respectively. This research delineates the effectiveness of combining multi-size convolutional layers with LSTM and DNA shape-based features in enhancing predictive accuracy. Moreover, this study comprehensively assesses the variability in model performance across different cell lines and transcription factors. The source code of MLSNet is available at https://github.com/minghaidea/MLSNet.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11442149/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142341947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping. 新颖的多组学解构变异自动编码器可获得有意义的疾病亚型。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae512
Zuqi Li, Sonja Katz, Edoardo Saccenti, David W Fardo, Peter Claes, Vitor A P Martins Dos Santos, Kristel Van Steen, Gennady V Roshchupkin
{"title":"Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.","authors":"Zuqi Li, Sonja Katz, Edoardo Saccenti, David W Fardo, Peter Claes, Vitor A P Martins Dos Santos, Kristel Van Steen, Gennady V Roshchupkin","doi":"10.1093/bib/bbae512","DOIUrl":"https://doi.org/10.1093/bib/bbae512","url":null,"abstract":"<p><p>Unsupervised learning, particularly clustering, plays a pivotal role in disease subtyping and patient stratification, especially with the abundance of large-scale multi-omics data. Deep learning models, such as variational autoencoders (VAEs), can enhance clustering algorithms by leveraging inter-individual heterogeneity. However, the impact of confounders-external factors unrelated to the condition, e.g. batch effect or age-on clustering is often overlooked, introducing bias and spurious biological conclusions. In this work, we introduce four novel VAE-based deconfounding frameworks tailored for clustering multi-omics data. These frameworks effectively mitigate confounding effects while preserving genuine biological patterns. The deconfounding strategies employed include (i) removal of latent features correlated with confounders, (ii) a conditional VAE, (iii) adversarial training, and (iv) adding a regularization term to the loss function. Using real-life multi-omics data from The Cancer Genome Atlas, we simulated various confounding effects (linear, nonlinear, categorical, mixed) and assessed model performance across 50 repetitions based on reconstruction error, clustering stability, and deconfounding efficacy. Our results demonstrate that our novel models, particularly the conditional multi-omics VAE (cXVAE), successfully handle simulated confounding effects and recover biologically driven clustering structures. cXVAE accurately identifies patient labels and unveils meaningful pathological associations among cancer types, validating deconfounded representations. Furthermore, our study suggests that some of the proposed strategies, such as adversarial training, prove insufficient in confounder removal. In summary, our study contributes by proposing innovative frameworks for simultaneous multi-omics data integration, dimensionality reduction, and deconfounding in clustering. Benchmarking on open-access data offers guidance to end-users, facilitating meaningful patient stratification for optimized precision medicine.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11483139/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142458388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSlocPRED: deep transfer learning-based identification of multi-label mRNA subcellular localization. MSlocPRED:基于深度迁移学习的多标签 mRNA 亚细胞定位识别。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae504
Yun Zuo, Bangyi Zhang, Wenying He, Yue Bi, Xiangrong Liu, Xiangxiang Zeng, Zhaohong Deng
{"title":"MSlocPRED: deep transfer learning-based identification of multi-label mRNA subcellular localization.","authors":"Yun Zuo, Bangyi Zhang, Wenying He, Yue Bi, Xiangrong Liu, Xiangxiang Zeng, Zhaohong Deng","doi":"10.1093/bib/bbae504","DOIUrl":"https://doi.org/10.1093/bib/bbae504","url":null,"abstract":"<p><p>Subcellular localization of messenger ribonucleic acid (mRNA) is a universal mechanism for precise and efficient control of the translation process. Although many computational methods have been constructed by researchers for predicting mRNA subcellular localization, very few of these computational methods have been designed to predict subcellular localization with multiple localization annotations, and their generalization performance could be improved. In this study, the prediction model MSlocPRED was constructed to identify multi-label mRNA subcellular localization. First, the preprocessed Dataset 1 and Dataset 2 are transformed into the form of images. The proposed MDNDO-SMDU resampling technique is then used to balance the number of samples in each category in the training dataset. Finally, deep transfer learning was used to construct the predictive model MSlocPRED to identify subcellular localization for 16 classes (Dataset 1) and 18 classes (Dataset 2). The results of comparative tests of different resampling techniques show that the resampling technique proposed in this study is more effective in preprocessing for subcellular localization. The prediction results of the datasets constructed by intercepting different NC end (Both the 5' and 3' untranslated regions that flank the protein-coding sequence and influence mRNA function without encoding proteins themselves.) lengths show that for Dataset 1 and Dataset 2, the prediction performance is best when the NC end is intercepted by 35 nucleotides, respectively. The results of both independent testing and five-fold cross-validation comparisons with established prediction tools show that MSlocPRED is significantly better than established tools for identifying multi-label mRNA subcellular localization. Additionally, to understand how the MSlocPRED model works during the prediction process, SHapley Additive exPlanations was used to explain it. The predictive model and associated datasets are available on the following github: https://github.com/ZBYnb1/MSlocPRED/tree/main.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11472759/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142458386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Progress and opportunities of foundation models in bioinformatics. 生物信息学基础模型的进展与机遇。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae548
Qing Li, Zhihang Hu, Yixuan Wang, Lei Li, Yimin Fan, Irwin King, Gengjie Jia, Sheng Wang, Le Song, Yu Li
{"title":"Progress and opportunities of foundation models in bioinformatics.","authors":"Qing Li, Zhihang Hu, Yixuan Wang, Lei Li, Yimin Fan, Irwin King, Gengjie Jia, Sheng Wang, Le Song, Yu Li","doi":"10.1093/bib/bbae548","DOIUrl":"10.1093/bib/bbae548","url":null,"abstract":"<p><p>Bioinformatics has undergone a paradigm shift in artificial intelligence (AI), particularly through foundation models (FMs), which address longstanding challenges in bioinformatics such as limited annotated data and data noise. These AI techniques have demonstrated remarkable efficacy across various downstream validation tasks, effectively representing diverse biological entities and heralding a new era in computational biology. The primary goal of this survey is to conduct a general investigation and summary of FMs in bioinformatics, tracing their evolutionary trajectory, current research landscape, and methodological frameworks. Our primary focus is on elucidating the application of FMs to specific biological problems, offering insights to guide the research community in choosing appropriate FMs for tasks like sequence analysis, structure prediction, and function annotation. Each section delves into the intricacies of the targeted challenges, contrasting the architectures and advancements of FMs with conventional methods and showcasing their utility across different biological domains. Further, this review scrutinizes the hurdles and constraints encountered by FMs in biology, including issues of data noise, model interpretability, and potential biases. This analysis provides a theoretical groundwork for understanding the circumstances under which certain FMs may exhibit suboptimal performance. Lastly, we outline prospective pathways and methodologies for the future development of FMs in biological research, facilitating ongoing innovation in the field. This comprehensive examination not only serves as an academic reference but also as a roadmap for forthcoming explorations and applications of FMs in biology.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512649/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142495365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pioneering bioinformatics with agent-based modelling: an innovative protocol to accurately forecast skin or respiratory allergic reactions to chemical sensitizers. 生物信息学与基于代理的建模的先锋:准确预测对化学致敏剂的皮肤或呼吸道过敏反应的创新方案。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae506
Giulia Russo, Elena Crispino, Silvia Casati, Emanuela Corsini, Andrew Worth, Francesco Pappalardo
{"title":"Pioneering bioinformatics with agent-based modelling: an innovative protocol to accurately forecast skin or respiratory allergic reactions to chemical sensitizers.","authors":"Giulia Russo, Elena Crispino, Silvia Casati, Emanuela Corsini, Andrew Worth, Francesco Pappalardo","doi":"10.1093/bib/bbae506","DOIUrl":"https://doi.org/10.1093/bib/bbae506","url":null,"abstract":"<p><p>The assessment of the allergenic potential of chemicals, crucial for ensuring public health safety, faces challenges in accuracy and raises ethical concerns due to reliance on animal testing. This paper presents a novel bioinformatic protocol designed to address the critical challenge of predicting immune responses to chemical sensitizers without the use of animal testing. The core innovation lies in the integration of advanced bioinformatics tools, including the Universal Immune System Simulator (UISS), which models detailed immune system dynamics. By leveraging data from structural predictions and docking simulations, our approach provides a more accurate and ethical method for chemical safety evaluations, especially in distinguishing between skin and respiratory sensitizers. Our approach integrates a comprehensive eight-step process, beginning with the meticulous collection of chemical and protein data from databases like PubChem and the Protein Data Bank. Following data acquisition, structural predictions are performed using cutting-edge tools such as AlphaFold to model proteins whose structures have not been previously elucidated. This structural information is then utilized in subsequent docking simulations, leveraging both ligand-protein and protein-protein interactions to predict how chemical compounds may trigger immune responses. The core novelty of our method lies in the application of UISS-an advanced agent-based modelling system that simulates detailed immune system dynamics. By inputting the results from earlier stages, including docking scores and potential epitope identifications, UISS meticulously forecasts the type and severity of immune responses, distinguishing between Th1-mediated skin and Th2-mediated respiratory allergic reactions. This ability to predict distinct immune pathways is a crucial advance over current methods, which often cannot differentiate between the sensitization mechanisms. To validate the accuracy and robustness of our approach, we applied the protocol to well-known sensitizers: 2,4-dinitrochlorobenzene for skin allergies and trimellitic anhydride for respiratory allergies. The results clearly demonstrate the protocol's ability to differentiate between these distinct immune responses, underscoring its potential for replacing traditional animal-based testing methods. The results not only support the potential of our method to replace animal testing in chemical safety assessments but also highlight its role in enhancing the understanding of chemical-induced immune reactions. Through this innovative integration of computational biology and immunological modelling, our protocol offers a transformative approach to toxicological evaluations, increasing the reliability of safety assessments.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471897/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142458389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multi-omics study of brain tissue transcription and DNA methylation revealing the genetic pathogenesis of ADHD. 揭示多动症遗传发病机制的脑组织转录和 DNA 甲基化多组学研究。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae502
Jingkai Wang, Qiu-Wen Zhu, Jia-Hao Mai, Shun Zhang, Yuqing Wang, Jiatong Liang, Ji-Yuan Zhou
{"title":"A multi-omics study of brain tissue transcription and DNA methylation revealing the genetic pathogenesis of ADHD.","authors":"Jingkai Wang, Qiu-Wen Zhu, Jia-Hao Mai, Shun Zhang, Yuqing Wang, Jiatong Liang, Ji-Yuan Zhou","doi":"10.1093/bib/bbae502","DOIUrl":"https://doi.org/10.1093/bib/bbae502","url":null,"abstract":"<p><p>Attention-deficit/hyperactivity disorder (ADHD) is a chronic psychiatric disease that often affects a patient's whole life. Research has found that genetics plays an important role in the development of ADHD. However, there is still a lack of knowledge about the tissue-specific causal effects of biological processes beyond gene expression, such as alternative splicing (AS) and DNA methylation (DNAm), on ADHD. In this paper, a multi-omics study was conducted to investigate the causal effects of the transcription and the DNAm on ADHD, by integrating ADHD genome-wide association data with quantitative trait loci data of gene expression, AS, and DNAm across 14 different brain tissues. The causal effects were estimated using four different two-sample Mendelian randomization methods. Finally, we also prioritized the expression of 866 genes showing significant causal effects, including COMMD5, ENSG00000271904, HYAL3, etc., within at least one brain tissue. We prioritized 966 unique genes that have statistically significant causal AS events, within at least one of the 14 different brain tissues. These genes include PPP1R16A, GGT7, TREM2, etc. Furthermore, through mediation analysis, 106 regulatory pathways were inferred where DNAm influences ADHD through gene expression or AS processes. Our research findings provide guidance for future experimental studies on the molecular mechanisms of ADHD development, and also put forward valuable knowledge for the prevention, diagnosis, and treatment of ADHD.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479714/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142458360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
dHICA: a deep transformer-based model enables accurate histone imputation from chromatin accessibility. dHICA:一种基于深度转换器的模型,可根据染色质的可及性实现组蛋白的精确推算。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae459
Wen Wen, Jiaxin Zhong, Zhaoxi Zhang, Lijuan Jia, Tinyi Chu, Nating Wang, Charles G Danko, Zhong Wang
{"title":"dHICA: a deep transformer-based model enables accurate histone imputation from chromatin accessibility.","authors":"Wen Wen, Jiaxin Zhong, Zhaoxi Zhang, Lijuan Jia, Tinyi Chu, Nating Wang, Charles G Danko, Zhong Wang","doi":"10.1093/bib/bbae459","DOIUrl":"https://doi.org/10.1093/bib/bbae459","url":null,"abstract":"<p><p>Histone modifications (HMs) are pivotal in various biological processes, including transcription, replication, and DNA repair, significantly impacting chromatin structure. These modifications underpin the molecular mechanisms of cell-type-specific gene expression and complex diseases. However, annotating HMs across different cell types solely using experimental approaches is impractical due to cost and time constraints. Herein, we present dHICA (deep histone imputation using chromatin accessibility), a novel deep learning framework that integrates DNA sequences and chromatin accessibility data to predict multiple HM tracks. Employing the transformer architecture alongside dilated convolutions, dHICA boasts an extensive receptive field and captures more cell-type-specific information. dHICA outperforms state-of-the-art baselines and achieves superior performance in cell-type-specific loci and gene elements, aligning with biological expectations. Furthermore, dHICA's imputations hold significant potential for downstream applications, including chromatin state segmentation and elucidating the functional implications of SNPs (Single Nucleotide Polymorphisms). In conclusion, dHICA serves as a valuable tool for advancing the understanding of chromatin dynamics, offering enhanced predictive capabilities and interpretability.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11421843/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142341942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Graphs for drug repurposing: a review of databases and methods. 用于药物再利用的知识图谱:数据库和方法综述。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae461
Pablo Perdomo-Quinteiro, Alberto Belmonte-Hernández
{"title":"Knowledge Graphs for drug repurposing: a review of databases and methods.","authors":"Pablo Perdomo-Quinteiro, Alberto Belmonte-Hernández","doi":"10.1093/bib/bbae461","DOIUrl":"https://doi.org/10.1093/bib/bbae461","url":null,"abstract":"<p><p>Drug repurposing has emerged as a effective and efficient strategy to identify new treatments for a variety of diseases. One of the most effective approaches for discovering potential new drug candidates involves the utilization of Knowledge Graphs (KGs). This review comprehensively explores some of the most prominent KGs, detailing their structure, data sources, and how they facilitate the repurposing of drugs. In addition to KGs, this paper delves into various artificial intelligence techniques that enhance the process of drug repurposing. These methods not only accelerate the identification of viable drug candidates but also improve the precision of predictions by leveraging complex datasets and advanced algorithms. Furthermore, the importance of explainability in drug repurposing is emphasized. Explainability methods are crucial as they provide insights into the reasoning behind AI-generated predictions, thereby increasing the trustworthiness and transparency of the repurposing process. We will discuss several techniques that can be employed to validate these predictions, ensuring that they are both reliable and understandable.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11426166/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142341945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Therapeutic peptides identification via kernel risk sensitive loss-based k-nearest neighbor model and multi-Laplacian regularization. 通过基于核风险敏感损失的 k-nearest neighbor 模型和多拉普拉斯正则化识别治疗肽。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae534
Wenyu Zhang, Yijie Ding, Leyi Wei, Xiaoyi Guo, Fengming Ni
{"title":"Therapeutic peptides identification via kernel risk sensitive loss-based k-nearest neighbor model and multi-Laplacian regularization.","authors":"Wenyu Zhang, Yijie Ding, Leyi Wei, Xiaoyi Guo, Fengming Ni","doi":"10.1093/bib/bbae534","DOIUrl":"https://doi.org/10.1093/bib/bbae534","url":null,"abstract":"<p><p>Therapeutic peptides are therapeutic agents synthesized from natural amino acids, which can be used as carriers for precisely transporting drugs and can activate the immune system for preventing and treating various diseases. However, screening therapeutic peptides using biochemical assays is expensive, time-consuming, and limited by experimental conditions and biological samples, and there may be ethical considerations in the clinical stage. In contrast, screening therapeutic peptides using machine learning and computational methods is efficient, automated, and can accurately predict potential therapeutic peptides. In this study, a k-nearest neighbor model based on multi-Laplacian and kernel risk sensitive loss was proposed, which introduces a kernel risk loss function derived from the K-local hyperplane distance nearest neighbor model as well as combining the Laplacian regularization method to predict therapeutic peptides. The findings indicated that the suggested approach achieved satisfactory results and could effectively predict therapeutic peptide sequences.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11495874/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142495368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BANDRP: a bilinear attention network for anti-cancer drug response prediction based on fingerprint and multi-omics. BANDRP:基于指纹和多组学的抗癌药物反应预测双线性注意网络。
IF 6.8 2区 生物学
Briefings in bioinformatics Pub Date : 2024-09-23 DOI: 10.1093/bib/bbae493
Cheng Cao, Haochen Zhao, Jianxin Wang
{"title":"BANDRP: a bilinear attention network for anti-cancer drug response prediction based on fingerprint and multi-omics.","authors":"Cheng Cao, Haochen Zhao, Jianxin Wang","doi":"10.1093/bib/bbae493","DOIUrl":"https://doi.org/10.1093/bib/bbae493","url":null,"abstract":"<p><p>Predicting anti-cancer drug response can help with personalized cancer treatment and is an important topic in modern oncology research. Although some methods have been used for anti-cancer drug response prediction, how to effectively integrate various features related to cancer cell lines, drugs, and their known responses is still affected by the redundant information of input features and the complex interactions between features. In this study, we propose a bilinear attention model, named BANDRP, based on multiple omics data of cancer cell lines and multiple molecular fingerprints of drugs to predict potential anti-cancer drug responses. Compared with existing models, BANDRP uses gene expression data to calculate pathway enrichment scores to enrich the features of cancer cell lines and can automatically learn the interactive information of cancer cell lines and drugs through bilinear attention networks. Benchmarking and independent tests demonstrate that BANDRP surpasses baseline models and exhibits robust generalization performance. Ablation experiments affirm the optimality of the current model architecture and feature selection scheme for our prediction task. Furthermore, analytical experiments and case studies on unknown anti-cancer drug response predictions underscore BANDRP's potential as a potent and reliable framework for predicting anti-cancer drug response.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142458364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信