Vito Paolo Pastore, Riccardo Rorato, Larbi Touijer, Roberto Di Via, Francesca Odone, Lisa M Galli, Laura W Burrus, Simone Bianco
{"title":"FiloAnalyzer: a deep learning approach for cell filopodia segmentation.","authors":"Vito Paolo Pastore, Riccardo Rorato, Larbi Touijer, Roberto Di Via, Francesca Odone, Lisa M Galli, Laura W Burrus, Simone Bianco","doi":"10.1186/s12859-026-06437-9","DOIUrl":"https://doi.org/10.1186/s12859-026-06437-9","url":null,"abstract":"","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147761090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Protein and ligand novelty in drug-target interaction prediction: a dual-encoder fusion strategy for more interpretable and generalizable modeling.","authors":"Lina Abou-Abbas, Khadidja Henni","doi":"10.1186/s12859-026-06457-5","DOIUrl":"https://doi.org/10.1186/s12859-026-06457-5","url":null,"abstract":"<p><p>Predicting drug-target interactions (DTIs) is a fundamental task in computational drug discovery, where reliable generalization to novel compounds and protein targets is essential for practical virtual screening and drug repurposing. However, most deep learning models are evaluated using random or single-split settings that fail to reflect the ligand and protein novelty conditions encountered in real-world discovery pipelines. This work proposes a dual-encoder fusion framework for robust and interpretable DTI prediction. The model combines pretrained ESM protein embeddings with two complementary ligand representations: a ChemBERTa-based molecular language encoder and a graph-based structural encoder. Predictions from both branches are integrated through decision-level fusion. To rigorously evaluate generalization, we adopt a novelty-aware protocol that isolates ligand novelty, protein novelty, and their joint occurrence. A ligand-centered, gradientbased interpretability analysis is also employed to examine how molecular substructures contribute to binding predictions. Experiments on the largescale BindingDB dataset show that ligand novelty induces minimal performance degradation, with strong results under warm and cold-drug settings (F1 = 0.84-0.87; AUC = 0.87-0.89). In contrast, protein novelty emerges as the dominant generalization bottleneck, producing the largest performance drops under cold-protein and double-cold conditions. Across all scenarios, the fusion model exhibits the most stable behavior, achieving an AUC of 0.90 in the warm setting. External validation on the Davis and KIBA datasets further demonstrates consistent AUC values (0.60-0.64) and high recall despite substantial biochemical domain shift. These results highlight the importance of novelty-aware evaluation and show that decision-level fusion provides a practical pathway toward reliable and interpretable DTI prediction.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147761098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genomica: linear mixed model based, multiple hypothesis testing corrected, ortholog functional enrichment analysis.","authors":"Salvatore Galgano","doi":"10.1186/s12859-026-06450-y","DOIUrl":"https://doi.org/10.1186/s12859-026-06450-y","url":null,"abstract":"<p><strong>Background: </strong>The analysis of ortholog genes derived from metagenomic experiments provides an invaluable opportunity to assess the functional role of microbial communities towards, for example, antimicrobial resistance or biochemical pathways under different experimental conditions. Nevertheless, the integration of the statistical analysis of these complex data sets and the enrichment of the derived significantly differential abundant orthologs is not currently facilitated by existing software. Genomica is an R package that, with minimal input from the user, allows to perform a double-step analysis of functional orthologs from the KEGG Orthology. The pipeline is carried out via combining false discovery rate corrected linear mixed models to functional enrichment analysis through integrating established R pipelines (i.e., lme4 and MicrobiomeProfiler).</p><p><strong>Results: </strong>Only two data frames are needed as input to run Genomica, which contain data and metadata, respectively. The fast pipeline integrated within the function Genomica allows to analyze 4000 orthologs in circa 3 min. The outputs are collected in a single directory, containing publication-ready results from the linear mixed model and from the enrichment analysis. The Benjamini & Hochberg correction is applied to the results from the linear mixed model, therefore only P adjusted significant comparisons are further included in the enrichment analysis.</p><p><strong>Conclusions: </strong>Genomica is a simple-to-use R package to analyze complex datasets, integrating a well-founded statistical analysis, accounting for the calculation of the type I error under repeated testing, with the enrichment analysis of the significantly differential abundant orthologs across experimental conditions, all with minimal input from the user.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147715758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reciprocal best matching: a new pipeline for scoring models with unknown stoichiometry in CASP experiments.","authors":"Rongqing Yuan, Jing Zhang, Qian Cong","doi":"10.1186/s12859-026-06439-7","DOIUrl":"https://doi.org/10.1186/s12859-026-06439-7","url":null,"abstract":"","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147715742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel IVN-entropy based distance-driven MARCOS framework for evaluating and ranking global green hydrogen-producing countries.","authors":"Venkata Prasanna Nagari, Vinoth Subbiah","doi":"10.1186/s12859-026-06425-z","DOIUrl":"https://doi.org/10.1186/s12859-026-06425-z","url":null,"abstract":"","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147697450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}