Molecular Informatics最新文献_第3页

An Integrated Fuzzy Neural Network and Topological Data Analysis for Molecular Graph Representation Learning and Property Forecasting. 基于模糊神经网络和拓扑数据分析的分子图表示学习和性质预测。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-03-01 DOI: 10.1002/minf.202400335

Phu Pham

{"title":"An Integrated Fuzzy Neural Network and Topological Data Analysis for Molecular Graph Representation Learning and Property Forecasting.","authors":"Phu Pham","doi":"10.1002/minf.202400335","DOIUrl":"10.1002/minf.202400335","url":null,"abstract":"Within a recent decade, graph neural network (GNN) has emerged as a powerful neural architecture for various graph-structured data modelling and task-driven representation learning problems. Recent studies have highlighted the remarkable capabilities of GNNs in handling complex graph representation learning tasks, achieving state-of-the-art results in node/graph classification, regression, and generation. However, most traditional GNN-based architectures like GCN and GraphSAGE still faced several challenges related to the capability of preserving the multi-scaled topological structures. These models primarily focus on capturing local neighborhood information, often failing to retain global structural features essential for graph-level representation and classification tasks. Furthermore, their expressiveness is limited when learning topological structures in complex molecular graph datasets. To overcome these limitations, in this paper, we proposed a novel graph neural architecture which is an integration between neuro-fuzzy network and topological graph learning approach, naming as: FTPG. Specifically, within our proposed FTPG model, we introduce a novel approach to molecular graph representation and property prediction by integrating multi-scaled topological graph learning with advanced neural components. The architecture employs separate graph neural learning modules to effectively capture both local graph-based structures as well as global topological features. Moreover, to further address feature uncertainty in the global-view representation, a multi-layered neuro-fuzzy network is incorporated within our model to enhance the robustness and expressiveness of the learned molecular graph embeddings. This combinatorial approach can assist to leverage the strengths of multi-view and multi-modal neural learning, enabling FTPG to deliver superior performance in molecular graph tasks. Extensive experiments on real-world/benchmark molecular datasets demonstrate the effectiveness of our proposed FTPG model. It consistently outperforms state-of-the-art GNN-based baselines categorized in different approaches, including canonical local proximity message passing based, graph transformer-based, and topology-driven approaches.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 3","pages":"e202400335"},"PeriodicalIF":2.8,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143616256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Discovery of New HER2 Inhibitors via Computational Docking, Pharmacophore Modeling, and Machine Learning. 通过计算对接、药效团建模和机器学习发现新的HER2抑制剂。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400336

Aseel Yasin Matrouk, Haneen Mohammad, Safa Daoud, Mutasem Omar Taha

{"title":"Discovery of New HER2 Inhibitors via Computational Docking, Pharmacophore Modeling, and Machine Learning.","authors":"Aseel Yasin Matrouk, Haneen Mohammad, Safa Daoud, Mutasem Omar Taha","doi":"10.1002/minf.202400336","DOIUrl":"10.1002/minf.202400336","url":null,"abstract":"The human epidermal growth factor receptor 2 (HER2) is a critical oncogene implicated in the development of various aggressive cancers, particularly breast cancer. Discovering novel HER2 inhibitors is crucial for expanding therapeutic options for HER2-related malignancies. In this study, we present a computational workflow that focuses on generating pharmacophores derived from docked poses of a selected list of 15 diverse, potent HER2 inhibitors, utilizing flexible docking. The resulting pharmacophores, along with other physicochemical molecular descriptors, were then evaluated in a machine learning-quantitative structure-activity relationship (ML-QSAR) analysis against 1,272 HER2 inhibitors. Several machine learning methods were assessed, and a genetic function algorithm (GFA) was employed for feature selection. Ultimately, GFA combined with Bagging and J48Graft classifiers produced the best self-consistent and predictive models. These models highlighted the significance of two pharmacophores, Hypo_1 and Hypo_2, in distinguishing potent from less active inhibitors. The successful ML-QSAR models and their associated pharmacophores were used to screen the National Cancer Institute (NCI) database for novel HER2 inhibitors. Three promising anti-HER2 leads were identified, with the top-performing lead demonstrating an experimental anti-HER2 IC50 value of 3.85 μM. Notably, the three inhibitors exhibited distinct chemical scaffolds compared to existing HER2 inhibitors, as indicated by principal component analysis.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 2","pages":"e202400336"},"PeriodicalIF":2.8,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143458679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MAYA (Multiple ActivitY Analyzer): An Open Access Tool to Explore Structure-Multiple Activity Relationships in the Chemical Universe. MAYA（多活性分析仪）：一个开放访问工具，探索化学宇宙中的结构-多活性关系。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400306

J Israel Espinoza-Castañeda, José L Medina-Franco

引用次数: 0

An Attempt to Classify Elementary Reactions on the Basis of TS Motifs. 基于TS基序对元素反应进行分类的尝试。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400040

Kenji Hori, Yujiro Matsuo, Toru Yamaguchi, Kimito Funatsu

引用次数: 0

Predicting the Price of Molecules Using Their Predicted Synthetic Pathways. 利用预测的合成途径预测分子的价格。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400039

Massina Abderrahmane, Hamza Tajmouati, Vinicius Barros Ribeiro da Silva, Quentin Perron

{"title":"Predicting the Price of Molecules Using Their Predicted Synthetic Pathways.","authors":"Massina Abderrahmane, Hamza Tajmouati, Vinicius Barros Ribeiro da Silva, Quentin Perron","doi":"10.1002/minf.202400039","DOIUrl":"10.1002/minf.202400039","url":null,"abstract":"Currently, numerous metrics allow chemists and computational chemists to refine and filter libraries of virtual molecules in order to prioritize their synthesis. Some of the most commonly used metrics and models are QSAR models, docking scores, diverse druggability metrics, and synthetic feasibility scores to name only a few. To our knowledge, among the known metrics, a function which estimates the price of a novel virtual molecule and which takes into account the availability and price of starting materials has not been considered before in literature. Being able to make such a prediction could improve and accelerate the decision-making process related to the cost-of-goods. Taking advantage of recent advances in the field of Computer Aided Synthetic Planning (CASP), we decided to investigate if the predicted retrosynthetic pathways of a given molecule and the prices of its associated starting materials could be good features to predict the price of that compound. In this work, we present a deep learning model, RetroPriceNet, that predicts the price of molecules using their predicted synthetic pathways. On a holdout test set, the model achieves better performance than the state-of-the-art model. The developed approach takes into account the synthetic feasibility of molecules and the availability and prices of the starting materials.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 2","pages":"e202400039"},"PeriodicalIF":2.8,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143066819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prediction of the Appropriate Temperature and Pressure for Polymer Dissolution Using Machine Learning Models. 使用机器学习模型预测聚合物溶解的适当温度和压力。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400193

Dorsa Dadashi, Marjan Kaedi, Parsa Dadashi, Suprakas Sinha Ray

{"title":"Prediction of the Appropriate Temperature and Pressure for Polymer Dissolution Using Machine Learning Models.","authors":"Dorsa Dadashi, Marjan Kaedi, Parsa Dadashi, Suprakas Sinha Ray","doi":"10.1002/minf.202400193","DOIUrl":"10.1002/minf.202400193","url":null,"abstract":"The widespread use of polymer solutions in the chemical industry poses a significant challenge in determining optimal dissolution conditions. Traditionally, researchers have relied on experimental methods to estimate the processing parameters needed to dissolve polymers, often requiring numerous iterations of testing different temperatures and pressures. This approach is both costly and time-consuming. In this study, for the first time, we present a machine learning-based approach to predict the minimum temperature and pressure required for polymer dissolution, correlating molecular weight and chemical structure of both the polymer and solvent and its weight percent. Using a dataset compiled from existing literature, which includes key factors influencing polymer dissolution, we also extracted chemical bond information from the molecular structures of polymer-solvent systems. Six different machine learning algorithms, including linear regression, k-nearest neighbors, regression trees, random forests, multilayer perceptron neural networks, and support vector regression, were employed to develop predictive models. Among these, the Random Forest model achieved the highest accuracy, with R2 values of 0.931 and 0.942 for temperature and pressure predictions, respectively. This novel approach eliminates the need for repetitive experimental testing, offering a more efficient pathway to determining dissolution conditions.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 2","pages":"e202400193"},"PeriodicalIF":2.8,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143391324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

KNIME Workflows for Chemoinformatic Characterization of Chemical Databases. 用于化学数据库化学信息学表征的KNIME工作流程。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400337

Carlos D Ramírez-Márquez, José L Medina-Franco

引用次数: 0

Exploration of the Global Minimum and Conical Intersection with Bayesian Optimization. 用贝叶斯优化方法探索全局最小和圆锥交问题。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-02-01 DOI: 10.1002/minf.202400041

Riho Somaki, Taichi Inagaki, Miho Hatanaka

{"title":"Exploration of the Global Minimum and Conical Intersection with Bayesian Optimization.","authors":"Riho Somaki, Taichi Inagaki, Miho Hatanaka","doi":"10.1002/minf.202400041","DOIUrl":"10.1002/minf.202400041","url":null,"abstract":"Conventional molecular geometry searches on a potential energy surface (PES) utilize energy gradients from quantum chemical calculations. However, replacing energy calculations with noisy quantum computer measurements generates errors in the energies, which makes geometry optimization using the energy gradient difficult. One gradient-free optimization method that can potentially solve this problem is Bayesian optimization (BO). To use BO in geometry search, an acquisition function (AF), which involves an objective variable, must be defined suitably. In this study, we propose a strategy for geometry searches using BO and examine the appropriate AFs to explore two critical structures: the global minimum (GM) on the singlet ground state (S0) and the most stable conical intersection (CI) point between S0 and the singlet excited state. We applied our strategy to two molecules and located the GM and the most stable CI geometries with high accuracy for both molecules. We also succeeded in the geometry searches even when artificial random noises were added to the energies to simulate geometry optimization using noisy quantum computer measurements.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 2","pages":"e202400041"},"PeriodicalIF":2.8,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11781018/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143066818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ultra-Large Virtual Screening: Definition, Recent Advances, and Challenges in Drug Design. 超大虚拟筛选：药物设计的定义、最新进展和挑战。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-01-01 Epub Date: 2024-12-05 DOI: 10.1002/minf.202400305

Gabriel Corrêa Veríssimo, Rafaela Salgado Ferreira, Vinícius Gonçalves Maltarollo

{"title":"Ultra-Large Virtual Screening: Definition, Recent Advances, and Challenges in Drug Design.","authors":"Gabriel Corrêa Veríssimo, Rafaela Salgado Ferreira, Vinícius Gonçalves Maltarollo","doi":"10.1002/minf.202400305","DOIUrl":"10.1002/minf.202400305","url":null,"abstract":"Virtual screening (VS) in drug design employs computational methodologies to systematically rank molecules from a virtual compound library based on predicted features related to their biological activities or chemical properties. The recent expansion in commercially accessible compound libraries and the advancements in artificial intelligence (AI) and computational power - including enhanced central processing units (CPUs), graphics processing units (GPUs), high-performance computing (HPC), and cloud computing - have significantly expanded our capacity to screen libraries containing over 109 molecules. Herein, we review the concept of ultra-large virtual screening (ULVS), focusing on the various algorithms and methodologies employed for virtual screening at this scale. In this context, we present the software utilized, applications, and results of different approaches, such as brute force docking, reaction-based docking approaches, machine learning (ML) strategies applied to docking or other VS methods, and similarity/pharmacophore search-based techniques. These examples represent a paradigm shift in the drug discovery process, demonstrating not only the feasibility of billion-scale compound screening but also their potential to identify hit candidates and increase the structural diversity of novel compounds with biological activities.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":" ","pages":"e202400305"},"PeriodicalIF":2.8,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142780630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simple User-Friendly Reaction Format. 简单的用户友好的反应格式。

IF 2.8 4区医学

Molecular Informatics Pub Date : 2025-01-01 DOI: 10.1002/minf.202400361

David F Nippa, Alex T Müller, Kenneth Atz, David B Konrad, Uwe Grether, Rainer E Martin, Gisbert Schneider

{"title":"Simple User-Friendly Reaction Format.","authors":"David F Nippa, Alex T Müller, Kenneth Atz, David B Konrad, Uwe Grether, Rainer E Martin, Gisbert Schneider","doi":"10.1002/minf.202400361","DOIUrl":"10.1002/minf.202400361","url":null,"abstract":"Utilizing the growing wealth of chemical reaction data can boost synthesis planning and increase success rates. Yet, the effectiveness of machine learning tools for retrosynthesis planning and forward reaction prediction relies on accessible, well-curated data presented in a structured format. Although some public and licensed reaction databases exist, they often lack essential information about reaction conditions. To address this issue and promote the principles of findable, accessible, interoperable, and reusable (FAIR) data reporting and sharing, we introduce the Simple User-Friendly Reaction Format (SURF). SURF standardizes the documentation of reaction data through a structured tabular format, requiring only a basic understanding of spreadsheets. This format enables chemists to record the synthesis of molecules in a format that is understandable by both humans and machines, which facilitates seamless sharing and integration directly into machine learning pipelines. SURF files are designed to be interoperable, easily imported into relational databases, and convertible into other formats. This complements existing initiatives like the Open Reaction Database (ORD) and Unified Data Model (UDM). At Roche, SURF plays a crucial role in democratizing FAIR reaction data sharing and expediting the chemical synthesis process.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 1","pages":"e202400361"},"PeriodicalIF":2.8,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11755691/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143024131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0