Digital discovery最新文献_第5页

Multireference error mitigation for quantum computation of chemistry 化学量子计算的多参考误差缓解

IF 6.2

Digital discovery Pub Date : 2025-07-29 DOI: 10.1039/D5DD00202H

Hang Zou, Erika Magnusson, Hampus Brunander, Werner Dobrautz and Martin Rahm

{"title":"Multireference error mitigation for quantum computation of chemistry","authors":"Hang Zou, Erika Magnusson, Hampus Brunander, Werner Dobrautz and Martin Rahm","doi":"10.1039/D5DD00202H","DOIUrl":"https://doi.org/10.1039/D5DD00202H","url":null,"abstract":"Quantum error mitigation (QEM) strategies are essential for improving the precision and reliability of quantum chemistry algorithms on noisy intermediate-scale quantum devices. Reference-state error mitigation (REM) is a cost-effective chemistry-inspired QEM method that performs well for weakly correlated problems. However, the effectiveness of REM is often limited when applied to strongly correlated systems. Here, we introduce multireference-state error mitigation (MREM), an extension of REM that systematically captures quantum hardware noise in strongly correlated ground states by utilizing multireference states. A pivotal aspect of MREM is using Givens rotations to efficiently construct quantum circuits to generate multireference states. To strike a balance between circuit expressivity and noise sensitivity, we employ compact wavefunctions composed of a few dominant Slater determinants. These truncated multireference states, engineered to exhibit substantial overlap with the target ground state, can effectively enhance error mitigation in variational quantum eigensolver experiments. We demonstrate the effectiveness of MREM through comprehensive simulations of molecular systems H2O, N2, and F2, underscoring its ability to realize significant improvements in computational accuracy compared to the original REM method. MREM broadens the scope of error mitigation to encompass a wider variety of molecular systems, including those exhibiting pronounced electron correlation.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2521-2533"},"PeriodicalIF":6.2,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00202h?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal learning in synthetic chemistry applications: gas chromatography retention time prediction and isomer separation optimization 合成化学应用中的多模态学习：气相色谱保留时间预测和同分异构体分离优化

IF 6.2

Digital discovery Pub Date : 2025-07-29 DOI: 10.1039/D4DD00369A

Jinglong Lin, Longyin Song, Yuntian Chen, Chengchun Liu, Shufeng Chen and Fanyang Mo

{"title":"Multimodal learning in synthetic chemistry applications: gas chromatography retention time prediction and isomer separation optimization","authors":"Jinglong Lin, Longyin Song, Yuntian Chen, Chengchun Liu, Shufeng Chen and Fanyang Mo","doi":"10.1039/D4DD00369A","DOIUrl":"https://doi.org/10.1039/D4DD00369A","url":null,"abstract":"Multimodal learning, a key machine learning (ML) approach, has been extensively applied in fields such as medical diagnostics and recommendation systems. The complexity of chemical data offers unique opportunities for multimodal learning, though its application in chemistry remains underexplored. Here, we propose an innovative multimodal framework for gas chromatography (GC) that integrates a geometry-enhanced graph isomorphism network and gated recurrent units. This framework predicts GC retention time across diverse molecular heating profiles with a test set R2 of 0.995, outperforming traditional ML methods. It effectively recommends optimal chromatographic conditions for separating positional isomers and cis/trans isomers, minimizing experimental iterations and significantly improving analytical efficiency. Moreover, the model provides insights into the separation challenges of various isomers, enhancing understanding of the relationship between molecular structure and chromatographic behavior. This approach could pave the way for broader applications of multimodal learning in chemistry.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2465-2477"},"PeriodicalIF":6.2,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d4dd00369a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Toward sustainable polymer design: a molecular dynamics-informed machine learning approach for vitrimers 迈向可持续聚合物设计：一种基于分子动力学的玻璃体机器学习方法

IF 6.2

Digital discovery Pub Date : 2025-07-28 DOI: 10.1039/D5DD00239G

Yiwen Zheng, Agni K. Biswal, Yaqi Guo, Prakash Thakolkaran, Yash Kokane, Vikas Varshney, Siddhant Kumar and Aniruddh Vashisth

{"title":"Toward sustainable polymer design: a molecular dynamics-informed machine learning approach for vitrimers","authors":"Yiwen Zheng, Agni K. Biswal, Yaqi Guo, Prakash Thakolkaran, Yash Kokane, Vikas Varshney, Siddhant Kumar and Aniruddh Vashisth","doi":"10.1039/D5DD00239G","DOIUrl":"https://doi.org/10.1039/D5DD00239G","url":null,"abstract":"Vitrimers represent an emerging class of sustainable polymers with self-healing capabilities enabled by dynamic covalent adaptive networks. However, their limited molecular diversity constrains their property space and potential applications. Recent developments in machine learning (ML) techniques accelerate polymer design by predicting properties and virtually screening candidates, yet the scarcity of available experimental vitrimer data poses challenges in training ML models. To address this, we leverage molecular dynamics (MD) data generated by our previous work to train and benchmark seven ML models covering six feature representations for glass transition temperature (Tg) prediction. By averaging predicted Tg from different models, the model ensemble approach outperforms individual models, allowing for accurate and efficient property prediction on unlabeled datasets. Two novel vitrimers are identified and synthesized, exhibiting experimentally validated higher Tg than existing bifunctional transesterification vitrimers, along with demonstrated healability. This work explores the possibility of using MD data to train ML models in the absence of sufficient experimental data, enabling the discovery of novel, synthesizable polymer chemistries with a wide range of desirable properties. The integrated MD–ML approach offers polymer chemists an efficient tool for accurate property prediction and designing polymers tailored to diverse applications.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2559-2569"},"PeriodicalIF":6.2,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00239g?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quantum machine learning of molecular energies with hybrid quantum-neural wavefunction† 基于混合量子神经波函数的分子能量量子机器学习

IF 6.2

Digital discovery Pub Date : 2025-07-28 DOI: 10.1039/D5DD00222B

Weitang Li, Shi-Xin Zhang, Zirui Sheng, Cunxi Gong, Jianpeng Chen and Zhigang Shuai

引用次数: 0

Chemoenzymatic synthesis planning guided by synthetic potential scores 以合成电位评分为指导的化学酶合成计划

IF 6.2

Digital discovery Pub Date : 2025-07-28 DOI: 10.1039/D5DD00008D

Xuan Liu, Hongxiang Li and Huimin Zhao

{"title":"Chemoenzymatic synthesis planning guided by synthetic potential scores","authors":"Xuan Liu, Hongxiang Li and Huimin Zhao","doi":"10.1039/D5DD00008D","DOIUrl":"https://doi.org/10.1039/D5DD00008D","url":null,"abstract":"Computer-aided chemoenzymatic synthesis planning integrates the advantages of enzymatic and organic reactions to design efficient hybrid synthesis routes for a target molecule. Existing tools rely on either a step-by-step strategy or a bypass strategy. Here we introduce a synthetic potential score (SPScore) to unify these two strategies. This score is developed by training a multilayer perceptron on existing reaction databases to evaluate the potential of enzymatic or organic reactions for synthesis of a molecule. We systematically evaluate the effectiveness of the SPScore in both single-step and multi-step hybrid retrosynthesis, demonstrating its strong ability to prioritize promising reaction types. In benchmarking various chemoenzymatic retrosynthesis algorithms guided by the SPScore, we find that an asynchronous search algorithm named ACERetro yields higher efficiency and robustness that can find hybrid synthesis routes to 46% more molecules compared with the state-of-the-art tool using a test dataset consisting of 1001 molecules. We then apply ACERetro to design efficient chemoenzymatic synthesis routes for 4 FDA-approved drugs. We anticipate that the application of the SPScore will provide a new avenue for computer-aided chemoenzymatic synthesis planning, thereby advancing the synthesis of functional molecules.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2534-2547"},"PeriodicalIF":6.2,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00008d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A precise comparison of molecular target prediction methods 分子靶标预测方法的精确比较

IF 6.2

Digital discovery Pub Date : 2025-07-25 DOI: 10.1039/D5DD00199D

Tiantian He, Klaudia Caba and Pedro J. Ballester

{"title":"A precise comparison of molecular target prediction methods","authors":"Tiantian He, Klaudia Caba and Pedro J. Ballester","doi":"10.1039/D5DD00199D","DOIUrl":"https://doi.org/10.1039/D5DD00199D","url":null,"abstract":"Small-molecule drug discovery has transitioned from traditional phenotypic screening to more precise target-based approaches, with an increased focus on understanding mechanisms of action (MoA) and target identification. With more research on off-target effects of approved drugs and the discovery of new therapeutic targets, revealing hidden polypharmacology can reduce both time and costs in drug discovery through off-target drug repurposing. However, despite the potential of in silico target prediction, its reliability and consistency remain a challenge across different methods. This project systematically compares seven target prediction methods, including stand-alone codes and web servers (MolTarPred, PPB2, RF-QSAR, TargetNet, ChEMBL, CMTNN and SuperPred), using a shared benchmark dataset of FDA-approved drugs. We also explore model optimization strategies, such as high-confidence filtering, which reduces recall, making it less ideal for drug repurposing. Furthermore, for MolTarPred, Morgan fingerprints with Tanimoto scores outperform MACCS fingerprints with Dice scores. This analysis shows that MolTarPred is the most effective method. For practical applications, we introduce a programmatic pipeline for target prediction and MoA hypothesis generation. A case study on fenofibric acid shows its potential for drug repurposing as a THRB modulator for thyroid cancer treatment.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2548-2558"},"PeriodicalIF":6.2,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00199d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing automated drug substance impurity structure elucidation from tandem mass spectra through transfer learning and domain knowledge† 通过迁移学习和领域知识增强串联质谱中药物杂质结构的自动解析

IF 6.2

Digital discovery Pub Date : 2025-07-24 DOI: 10.1039/D5DD00115C

Emilio Dorigatti, Jonathan Groß, Jonas Kühlborn, Robert Möckel, Frank Maier and Julian Keupp

{"title":"Enhancing automated drug substance impurity structure elucidation from tandem mass spectra through transfer learning and domain knowledge†","authors":"Emilio Dorigatti, Jonathan Groß, Jonas Kühlborn, Robert Möckel, Frank Maier and Julian Keupp","doi":"10.1039/D5DD00115C","DOIUrl":"https://doi.org/10.1039/D5DD00115C","url":null,"abstract":"Liquid chromatography-tandem mass spectrometry (LC-MS/MS) is an essential analytical technique in the pharmaceutical industry, used particularly for elucidating the structure of unknown impurities in the synthesis of active pharmaceutical ingredients. However, the interpretation of mass spectra is challenging and time-consuming, requiring significant expertise. While recent computational tools aimed at automating this process have been developed, their accuracy in determining the chemical structure limits its use in practice. In this paper, we introduce a new method called SEISMiQ for elucidating unknown impurities from their MS/MS spectra. We are able to significantly improve elucidation accuracy by integrating domain experts' knowledge, specifically the impurity sum formula and known substructure, into the model's training and inference process. Further performance improvements can be achieved through transfer learning using simulated MS/MS spectra of impurities from an in-house database. Finally, the need for any experimental data collection for finetuning can be circumvented by simulating the entire drug substance synthesis process in silico via reaction templates.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2454-2464"},"PeriodicalIF":6.2,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00115c?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PepMSND: integrating multi-level feature engineering and comprehensive databases to enhance in vitro/in vivo peptide blood stability prediction† PepMSND：整合多层次特征工程和综合数据库，提高体外/体内肽血稳定性预测†

IF 6.2

Digital discovery Pub Date : 2025-07-19 DOI: 10.1039/D5DD00118H

Haomeng Hu, Chengyun Zhang, Zhenyu Xu, Jingjing Guo, An Su, Chengxi Li and Hongliang Duan

{"title":"PepMSND: integrating multi-level feature engineering and comprehensive databases to enhance in vitro/in vivo peptide blood stability prediction†","authors":"Haomeng Hu, Chengyun Zhang, Zhenyu Xu, Jingjing Guo, An Su, Chengxi Li and Hongliang Duan","doi":"10.1039/D5DD00118H","DOIUrl":"https://doi.org/10.1039/D5DD00118H","url":null,"abstract":"Deep learning has emerged as a transformative tool for peptide drug discovery, yet predicting peptide blood stability—a critical determinant of bioavailability and therapeutic efficacy—remains a major challenge. While such a task can be accomplished through experiments, it requires much time and cost. Here, to address this challenge, we collect extensive experimental data on peptide stability in blood from public databases and the literature and construct a database of peptide blood stability that includes 635 samples. Based on this database, we develop a novel model called PepMSND, integrating KAN, Transformer, GAT, and SE(3)-Transformer to perform multi-level feature engineering for peptide blood stability prediction. Our model can achieve an ACC of 0.867 and an AUC of 0.912 on average and outperforms the baseline models. We also develop a user-friendly web interface for the PepMSND model, which is freely available at http://model.highslab.com/pepmsnd. This research is crucial for the development of novel peptides with strong blood stability, as the stability of peptide drugs directly determines their effectiveness and reliability in clinical applications.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2478-2490"},"PeriodicalIF":6.2,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00118h?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Solid-state synthesizability predictions using positive-unlabeled learning from human-curated literature data† 固态综合预测使用正面无标签学习从人类策划的文献数据†

IF 6.2

Digital discovery Pub Date : 2025-07-19 DOI: 10.1039/D5DD00065C

Vincent Chung, Aron Walsh and David J. Payne

{"title":"Solid-state synthesizability predictions using positive-unlabeled learning from human-curated literature data†","authors":"Vincent Chung, Aron Walsh and David J. Payne","doi":"10.1039/D5DD00065C","DOIUrl":"https://doi.org/10.1039/D5DD00065C","url":null,"abstract":"The rate of materials discovery is limited by the experimental validation of promising candidate materials generated from high-throughput calculations. Although data-driven approaches, utilizing text-mined datasets, have shown some success in aiding synthesis planning and synthesizability prediction, they are limited by the quality of the underlying datasets. In this study, synthesis information of 4103 ternary oxides was extracted from the literature, including whether the oxide has been synthesized via solid-state reaction and the associated reaction conditions. This dataset provides an opportunity to supplement existing solid-state reaction models via reliable data and information from articles whose content and formats are challenging to extract automatically. A simple screening using this dataset identified 156 outliers from a subset of a text-mined dataset that contains 4800 entries, of which only 15% of the outliers were extracted correctly. Finally, this dataset was used to train a positive-unlabeled learning model to predict the solid-state synthesizability of new ternary oxides, where we predict 134 out of 4312 hypothetical compositions are likely to be synthesizable.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2439-2453"},"PeriodicalIF":6.2,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00065c?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Programmable aerosol chemistry coupled to chemical imaging establishes a new arena for automated chemical synthesis and discovery† 可编程气溶胶化学耦合到化学成像建立了自动化化学合成和发现的新领域†

IF 6.2

Digital discovery Pub Date : 2025-07-18 DOI: 10.1039/D5DD00100E

Jakub D. Wosik, Chaoyi Zhu, Zehua Li and S. Hessam M. Mehr

{"title":"Programmable aerosol chemistry coupled to chemical imaging establishes a new arena for automated chemical synthesis and discovery†","authors":"Jakub D. Wosik, Chaoyi Zhu, Zehua Li and S. Hessam M. Mehr","doi":"10.1039/D5DD00100E","DOIUrl":"https://doi.org/10.1039/D5DD00100E","url":null,"abstract":"Aerosols have emerged as a massively parallel reaction medium promising accelerated reactivity and unanticipated reactivity outcomes, yet exploration of these properties has so far only been confined to specific reactions. Wider deployment in chemical synthesis and discovery is impeded by the lack of a general-purpose formalism for conceiving multi-step chemical transformations in the aerosol medium and standardised building blocks to enable adaptation of existing synthesis procedures to execution in the inherently stochastic and inhomogeneous aerosol phase. Here we propose a framework based on programmable timed release of reagents as atomised solutions that provides the minimum necessary building blocks for synthesis in an automated aerosol reactor. This framework both connects synthesis in traditional bulk media with aerosols and lays the foundation for massively parallel discovery in airborne microdroplets. To validate our proposed formalism with a concrete methodology, we demonstrate a prototype open hardware platform and three examples of automated procedures. Further, we propose chemical imaging as a category of analytical methodology tailored to interrogation of aerosols. As a proof-of-principle demonstration, we use optical microscopy to detect reactivity in the resulting microdroplets and study the spatial distribution of their compositions in response to changes in the synthesis program.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 9","pages":" 2423-2430"},"PeriodicalIF":6.2,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00100e?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0