Louis Plyer, Alexey A Orlov, Tagir N Akhmetshin, Erik Yeghyan, Fanny Bonachera, Dragos Horvath, Alexandre Varnek
{"title":"Interpretable and Scalable Similarity Metrics for DNA-Encoded Library Design Using Generative Topographic Mapping.","authors":"Louis Plyer, Alexey A Orlov, Tagir N Akhmetshin, Erik Yeghyan, Fanny Bonachera, Dragos Horvath, Alexandre Varnek","doi":"10.1002/minf.70026","DOIUrl":"10.1002/minf.70026","url":null,"abstract":"<p><p>The growing number and size of DNA-encoded libraries (DELs), together with the vast space of possible DEL designs, demand interpretable and scalable criteria for selecting which libraries to construct and screen against a given target. An ideal target-focused DEL shows both strong similarity with an active reference compound collection and high intra-DEL diversity. Chemography with Generative Topographic Mapping (GTM) was shown to be a promising approach for selecting DELs, offering both intuitive visualization and fast quantitative analysis scalable to thousands of DEL designs. This is achieved by defining each library by a \"stand-alone\" vector, the comparison of which precludes costly pairwise inter-molecular similarity calculations. However, the extent to which such \"stand-alone\" (SA) approaches in general, and GTM-derived SA metrics in particular, recover DELs that are reference-proximal and chemically diverse as evaluated by conventional compound pair-matching (CP) metrics in the initial descriptor space remains insufficiently characterized. In this article, the comparative analysis of the Morgan count fingerprint-based chemical-library similarity versus GTM-derived metrics, using 100 diverse DEL subsets and a reference set of compounds tested against cyclin-dependent kinase 2 (CDK2) from ChEMBL, was performed. GTM-based SA metrics provide robust approximations for \"gold standard\" molecular descriptor space CP metrics for DEL selection: Spearman rank correlations fall in the 0.6-0.7 range. Our results demonstrate that GTM helps to identify DELs that best span the reference space according to same \"gold standard\" molecular descriptor space metrics: SA GTM-driven rankings of libraries achieve enrichment factors at 5% (EF5%) of 4-12 (in terms of finding \"gold standard\" top libraries within the 5% best ranked by GTM)-always picking 2 out of the top 3 libraries. The accompanying two-dimensional landscapes make intra- and interlibrary diversity visually accessible, supporting rapid, interpretable screening of alternative DEL designs. Collectively, these results position GTM as an efficient tool for chemical-library similarity assessment and target-focused DEL selection.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 3","pages":"e70026"},"PeriodicalIF":3.1,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13019124/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147513447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Adittya Pal, Rolf Fagerberg, Jakob Lykke Andersen, Peter Dittrich, Daniel Merkle
{"title":"Finding Pathways in Reaction Networks Guided by Energy Barriers Using Integer Linear Programing.","authors":"Adittya Pal, Rolf Fagerberg, Jakob Lykke Andersen, Peter Dittrich, Daniel Merkle","doi":"10.1002/minf.70021","DOIUrl":"https://doi.org/10.1002/minf.70021","url":null,"abstract":"<p><p>Analyzing synthesis pathways for target molecules in a chemical reaction network annotated with information on the kinetics of individual reactions is an area of active study. This work presents a computational methodology for searching for pathways in reaction networks which is based on integer linear programing and the modeling of reaction networks by directed hypergraphs. Often multiple pathways fit the given search criteria. To rank them, we develop an objective function based on physical arguments maximizing the probability of the pathway. We furthermore develop an automated pipeline to estimate the energy barriers of individual reactions in reaction networks. Combined, the methodology facilitates flexible and kinetically informed pathway investigations on large reaction networks by computational means, even for networks coming without kinetic annotation, such as those created via generative approaches for expanding molecular spaces. To demonstrate the methodology, we apply it on a chemical reaction network generated from 2-hydroxyethanenitrile, water, and ammonia, where we search for pathways to glycine and 2-hydroxyethanoic acid using the input molecules as precursors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 3","pages":"e70021"},"PeriodicalIF":3.1,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147513395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lisa Lombardo, Francesco Agnello, Rosaria Gitto, Laura De Luca
{"title":"A Multistep Computational Approach to Achieve a Complete Human 5-Lipoxygenase Structure and Provide a Pharmacophore Model for Further Drug Design.","authors":"Lisa Lombardo, Francesco Agnello, Rosaria Gitto, Laura De Luca","doi":"10.1002/minf.70025","DOIUrl":"10.1002/minf.70025","url":null,"abstract":"<p><p>Human 5-lipoxygenase (5-LOX) plays a crucial role in the biosynthesis of leukotrienes (LTs). Therefore, 5-LOX inhibitors are designed as effective agents for the treatment of several diseases such as asthma, cardiovascular disorders, allergies, and cancer. Insights into crystal structures of several 5-LOX isoforms have revealed that this protein adopts two different conformations (open/closed) through modulation of its Hα2 and arched helix regions, which are conditioned by the presence or absence of ligand in the active site; moreover, these structures are incomplete in regions critical for ligand binding. To advance the design of 5-LOX inhibitors, we developed a computational procedure to reconstruct the first full-length open conformation structure of 5-LOX complexed with chelating inhibitor within the active site. Dynamic simulations and protein model validation confirmed the quality of our model, which was subsequently used for docking analyses and culminated in the development of a structure-based pharmacophore model. These computational studies might constitute powerful tools for rationally designing and identifying novel 5-LOX iron chelator inhibitors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 3","pages":"e70025"},"PeriodicalIF":3.1,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13014066/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147513416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sania Saeed, Shahrukh Khan, Aneeqa Noor, Inga Zerr, Saima Zafar
{"title":"Therapeutic Potential of Amyloid-β Interactors in Rapidly Progressive Alzheimer's Disease-An In Silico Study.","authors":"Sania Saeed, Shahrukh Khan, Aneeqa Noor, Inga Zerr, Saima Zafar","doi":"10.1002/minf.70024","DOIUrl":"https://doi.org/10.1002/minf.70024","url":null,"abstract":"<p><p>Rapidly progressive Alzheimer's disease (rpAD) is a rare but severe form of Alzheimer's disease characterized by accelerated cognitive decline and limited therapeutic options. Conventional anti-amyloid-β interventions have shown little success due to poor target specificity, neurotoxicity, and lack of efficacy, underscoring the need for novel therapeutic strategies. This study aimed to identify and prioritize molecular targets associated with rpAD by investigating the protein interactome of amyloid-β (Aβ<sub>42</sub>) using integrative computational approaches. Functional enrichment, protein-protein interaction network analysis, and community clustering revealed that rpAD-specific Aβ<sub>42</sub> interactors were predominantly involved in mitochondrial bioenergetics, redox regulation, and cytoskeletal stability, pathways central to neuronal survival and synaptic function. Molecular docking identified fumarate hydratase, carbonyl reductase 1, and the F-actin capping protein as high-affinity interactors of Aβ<sub>42</sub>, linking these proteins to energy failure, oxidative stress, and synaptic dysfunction. Virtual screening of a therapeutic drug library against fumarate hydratase identified several compounds with strong binding affinities, among which quinestrol, estradiol benzoate, norethindrone, tamibarotene, drospirenone, and ketanserin emerged as lead candidates. Pharmacokinetic profiling, including ADMET modeling, confirmed their blood-brain barrier permeability and drug-likeness, supporting their potential as central nervous system active agents. Together, this work highlights key molecular targets in rpAD and proposes repurposed, pharmacologically diverse compounds with multitarget neuroprotective potential. By utilizing in silico analysis, the study provides a rational framework for target discovery and drug prioritization in rpAD, offering a foundation for future experimental validation and the development of translational research.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70024"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146202169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Statistics and Ontology of Published Small Molecule Ring Systems.","authors":"Lutz Weber","doi":"10.1002/minf.70022","DOIUrl":"10.1002/minf.70022","url":null,"abstract":"<p><p>Diversity and properties of ring systems contained in small molecules are of high interest for applications such as drug discovery and materials science. In the present work, we extract, analyze, and classify ring systems found in small-molecule compounds of open-access databases such as PubChem, ChEMBL, DrugCentral, IUPHAR, and LOTUS. We also developed a classification taxonomy of frequently found ring systems using an Open Biomedical Ontologies (OBO)-format ontology. Open-access software was used to automate the classification of compounds into their respective ring system classes. As an example, the natural product compounds of the LOTUS database were classified and are available as an open-access ontology dataset.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70022"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147307815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Infrared Spectral Descriptors for Reaction Yield Prediction: Toward Redefining Experimental Spaces.","authors":"Yuya Endo, Hiromasa Kaneko","doi":"10.1002/minf.70019","DOIUrl":"10.1002/minf.70019","url":null,"abstract":"<p><p>Yield prediction in catalytic reactions is essential for improving chemical process efficiency and product quality. Ligands significantly influence reactivity and selectivity, highlighting the need for descriptors that accurately capture their structural and electronic properties. In this study, we focus on infrared (IR) spectra, which reflects molecular vibrational modes, and propose novel descriptors based on wavenumber information. We evaluated the predictive performance of these descriptors using two datasets: direct Pd-catalyzed arylation and Suzuki-Miyaura coupling reactions. The wavenumber-based IR descriptors outperformed conventional molecular descriptors and structural fingerprints (one-hot encoding, Mordred, MACCS, Morgan fingerprint, RDKit, and density functional theory). Notably, descriptors limited to the fingerprint region (0-1700 cm<sup>-1</sup>) effectively captured key molecular features, contributing to both high prediction accuracy and improved chemical interpretability. Our results indicate that IR-based descriptors can achieve strong generalization performance even with small datasets. This approach offers a promising strategy for redefining reaction condition spaces and enhancing the interpretability of predictive models, thereby supporting more informed experimental design.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70019"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12899324/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146166057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DeepMoDRP: A Multi-Omics-Based Deep Learning Framework for Drug Response Prediction in Brain Cancer.","authors":"Yuxuan Li, Xiumin Shi, Lu Wang, Lianzhong Zhang","doi":"10.1002/minf.70020","DOIUrl":"https://doi.org/10.1002/minf.70020","url":null,"abstract":"<p><p>Considering the limited efficacy of existing pharmacotherapies for brain tumors, the development of accurate predictive models is essential for advancing neuro-oncology treatment strategies. In this article, we introduce a drug response prediction model, DeepMoDRP, specifically designed for brain cancer. This model integrates genomic, transcriptomic, and epigenomic data from various brain tumor cell lines, including low-grade glioma, glioblastoma multiforme, and diffuse large B-cell lymphoma. To address the high-dimensional complexity inherent in gene expression and copy number variations within cell line data, we have integrated sparse autoencoders (AEs) and denoising AEs to reduce noise and redundancy. Meanwhile, one-dimensional convolutional neural networks are utilized to process the low-dimensional mutation and DNA methylation data. Additionally, a multiscale graph neural network is implemented to handle the drug-related data. Finally, fully connected networks are employed to generate predictions of drug responses. A series of experiments were conducted utilizing a brain tumor dataset that was extracted and curated from public databases. The experimental results demonstrate that the proposed DeepMoDRP outperforms the performance of state-of-the-art pan-cancer baseline models in predicting drug responses for brain tumors. The downstream analysis indicates that the DeepMoDRP holds significant promise for the treatment of brain tumors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70020"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146202110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Read-Across Structure-Property Relationship-Based Superior Prediction of Fraction Unbound in Plasma from Chemical Structure: Interpretable Models with Minimum Descriptors.","authors":"Indrasis Dasgupta, Samima Khatun, Shovanlal Gayen","doi":"10.1002/minf.70023","DOIUrl":"https://doi.org/10.1002/minf.70023","url":null,"abstract":"<p><p>Accurately predicting the fraction unbound in plasma (f<sub>up</sub>) from chemical structures is essential for understanding pharmacokinetic characteristics during the early stage of drug discovery. This prediction serves as a valuable tool for minimizing late-stage setbacks and refining subsequent screening processes. Conventional approaches often rely on complex computational methodologies that may require extensive descriptor sets, resulting in opaque models with limited interpretability. In this study, we applied the read-across strategy in combination with traditional quantitative structure-property relationship to predict f<sub>up</sub> while minimizing descriptor complexity. Our method employs interpretable models (regression and classification), facilitating insight into the underlying structure-property relationships governing plasma protein binding. Through comprehensive validation and comparison with different machine learning methods, we demonstrated the superior predictive performance of quantitative read-across structure-property relationship multiple linear regression and classification-based read-across structure-property relatonship respectively. support vector classifier models across diverse chemical compounds. This approach offers a valuable tool for predicting f<sub>up</sub> in the process of drug discovery. Overall, this study aims to advance the field of pharmacokinetic modeling by applying the read-across strategy that improves predictive power with interpretability. By elucidating the complex relationship between chemical structures and f<sub>up</sub>, our best models have the potential to formulate more rational drug design approaches, ultimately contributing to the development of more effective therapeutics.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70023"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146202152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anastasia Rudik, Leonid Stolbov, Alexey Lagunin, Dmitry Filimonov, Vladimir Poroikov
{"title":"MetaStab-Analyzer: Classification and Regression Models for Metabolic Stability Prediction.","authors":"Anastasia Rudik, Leonid Stolbov, Alexey Lagunin, Dmitry Filimonov, Vladimir Poroikov","doi":"10.1002/minf.70018","DOIUrl":"https://doi.org/10.1002/minf.70018","url":null,"abstract":"<p><p>The pharmacokinetic profile of a potential drug is largely determined by its metabolic stability, which reflects its susceptibility to biotransformation. Metabolic stability data allow one to assess the therapeutic value of a compound and its toxicological risk. This assesment relies primarily on pharmacokinetic parameters, particularly half-life (t<sub>1/2</sub>) and clearance (CL), which are typically determined using in vitro systems including hepatocytes and liver microsomal fractions. Using the publicly available ChEMBL v. 35 and PubChem databases, we collected over 8000 chemical compounds with experimental intrinsic CL and/or half-life data from liver microsome assays obtained in mice, rats, and humans. Different thresholds were applied to differentiate the stable and unstable molecules. The Naive Bayesian classifier with MNA (Multilevel Neighborhoods of Atoms) descriptors and Self-Consistent Extreme Classifier (SCEC) with QNA (Quantitative Neighborhoods of Atoms) descriptors were used for creating classification models. The accuracy (AUC) of most classification models exceeded 0.85. Self-Consistent Regression was used to create quantitative models. The coefficient of determination of the regression models varied from 0.35 (rat, t<sub>1/2</sub>) to 0.7 (human, CL<sub>int</sub>). These models were integrated into the freely available web application MetaStab-Analyzer, which provides a unique combination of qualitative (stable/unstable/moderate) and quantitative predictions for three species. A key feature of the application is the providing of numerical metrics for each prediction, which increases its interpretability. This combination of innovative algorithms (SCR and SCEC), dual qualitative-quantitative assessment, and a user-friendly interface is not available in any existing tool. MetaStab-Analyzer is freely available at https://www.way2drug.com/metastab/.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 2","pages":"e70018"},"PeriodicalIF":3.1,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146119507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"11<sup>th</sup> National Conference of the French Chemoinformatics Society (SFCi).","authors":"Alban Lepailleur, Ronan Bureau","doi":"10.1002/minf.70017","DOIUrl":"10.1002/minf.70017","url":null,"abstract":"<p><p>The French Chemoinformatics Society (SFCi) is a learned society that unites French academics, students and industrial scientists in chemoinformatics, promoting this discipline at the interface of chemistry, computer science, data science and AI through conferences, training initiatives, networking and community-building.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"45 1","pages":"e70017"},"PeriodicalIF":3.1,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146065224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}