{"title":"基于QSAR的机器学习揭示了一种针对Ensitrelvir - Resistant SARS - CoV - 2主蛋白酶的双重功能G4配体","authors":"Napat Prompat, Panik Nadee, Aekkaraj Nualla‐ong","doi":"10.1002/adts.202501079","DOIUrl":null,"url":null,"abstract":"The emergence of drug‐resistant SARS‐CoV‐2 variants necessitates novel antiviral strategies targeting conserved viral components. This study integrates machine learning‐based quantitative structure‐activity relationship modeling and comprehensive computational approaches to identify dual‐function inhibitors against the main protease and RNA G‐quadruplex structures of SARS‐CoV‐2. A Random Forest classifier trained on 890 curated compounds achieves superior predictive performance (AUC = 0.9458) using CDK fingerprints, enabling virtual screening of 4,564 G‐quadruplex ligands from the G4LDB. Molecular docking reveals lead compound G4L2574 exhibits stronger binding affinity (−12.11 kcal mol<jats:sup>−1</jats:sup>) to the M49I mutant Mpro than clinical inhibitor ensitrelvir (−8.92 kcal mol<jats:sup>−1</jats:sup>), with molecular dynamics simulations demonstrating enhanced complex stability and persistent hydrogen bonding. MM/PBSA calculations confirm favorable binding free energy (−40.54 kcal mol<jats:sup>−1</jats:sup>) for G4L2574‐M49I, driven by robust electrostatic interactions. Structural analysis shows the M49I mutation induced steric hindrance compromising ensitrelvir binding, while G4L2574 maintained critical interactions with catalytic residues His41 and Cys145. Additionally, G4L2574 demonstrates superior RNA G‐quadruplex binding (−11.73 kcal mol<jats:sup>−1</jats:sup>) than RNA G‐quadruplex stabilizing ligand TMPyP4. This dual‐targeting mechanism, validated through machine learning and MD simulations, presents a promising strategy to circumvent resistance mutations while leveraging conserved viral replication targets. The integrated computational pipeline establishes a framework for rapid identification of broad‐spectrum antivirals against evolving coronaviruses.","PeriodicalId":7219,"journal":{"name":"Advanced Theory and Simulations","volume":"96 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"QSAR‐Based Machine Learning Reveals a Repurposed Dual‐Function G4 Ligand against Ensitrelvir‐Resistant SARS‐CoV‐2 Main Protease\",\"authors\":\"Napat Prompat, Panik Nadee, Aekkaraj Nualla‐ong\",\"doi\":\"10.1002/adts.202501079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The emergence of drug‐resistant SARS‐CoV‐2 variants necessitates novel antiviral strategies targeting conserved viral components. This study integrates machine learning‐based quantitative structure‐activity relationship modeling and comprehensive computational approaches to identify dual‐function inhibitors against the main protease and RNA G‐quadruplex structures of SARS‐CoV‐2. A Random Forest classifier trained on 890 curated compounds achieves superior predictive performance (AUC = 0.9458) using CDK fingerprints, enabling virtual screening of 4,564 G‐quadruplex ligands from the G4LDB. Molecular docking reveals lead compound G4L2574 exhibits stronger binding affinity (−12.11 kcal mol<jats:sup>−1</jats:sup>) to the M49I mutant Mpro than clinical inhibitor ensitrelvir (−8.92 kcal mol<jats:sup>−1</jats:sup>), with molecular dynamics simulations demonstrating enhanced complex stability and persistent hydrogen bonding. MM/PBSA calculations confirm favorable binding free energy (−40.54 kcal mol<jats:sup>−1</jats:sup>) for G4L2574‐M49I, driven by robust electrostatic interactions. Structural analysis shows the M49I mutation induced steric hindrance compromising ensitrelvir binding, while G4L2574 maintained critical interactions with catalytic residues His41 and Cys145. Additionally, G4L2574 demonstrates superior RNA G‐quadruplex binding (−11.73 kcal mol<jats:sup>−1</jats:sup>) than RNA G‐quadruplex stabilizing ligand TMPyP4. This dual‐targeting mechanism, validated through machine learning and MD simulations, presents a promising strategy to circumvent resistance mutations while leveraging conserved viral replication targets. The integrated computational pipeline establishes a framework for rapid identification of broad‐spectrum antivirals against evolving coronaviruses.\",\"PeriodicalId\":7219,\"journal\":{\"name\":\"Advanced Theory and Simulations\",\"volume\":\"96 1\",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Theory and Simulations\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1002/adts.202501079\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Theory and Simulations","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/adts.202501079","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
QSAR‐Based Machine Learning Reveals a Repurposed Dual‐Function G4 Ligand against Ensitrelvir‐Resistant SARS‐CoV‐2 Main Protease
The emergence of drug‐resistant SARS‐CoV‐2 variants necessitates novel antiviral strategies targeting conserved viral components. This study integrates machine learning‐based quantitative structure‐activity relationship modeling and comprehensive computational approaches to identify dual‐function inhibitors against the main protease and RNA G‐quadruplex structures of SARS‐CoV‐2. A Random Forest classifier trained on 890 curated compounds achieves superior predictive performance (AUC = 0.9458) using CDK fingerprints, enabling virtual screening of 4,564 G‐quadruplex ligands from the G4LDB. Molecular docking reveals lead compound G4L2574 exhibits stronger binding affinity (−12.11 kcal mol−1) to the M49I mutant Mpro than clinical inhibitor ensitrelvir (−8.92 kcal mol−1), with molecular dynamics simulations demonstrating enhanced complex stability and persistent hydrogen bonding. MM/PBSA calculations confirm favorable binding free energy (−40.54 kcal mol−1) for G4L2574‐M49I, driven by robust electrostatic interactions. Structural analysis shows the M49I mutation induced steric hindrance compromising ensitrelvir binding, while G4L2574 maintained critical interactions with catalytic residues His41 and Cys145. Additionally, G4L2574 demonstrates superior RNA G‐quadruplex binding (−11.73 kcal mol−1) than RNA G‐quadruplex stabilizing ligand TMPyP4. This dual‐targeting mechanism, validated through machine learning and MD simulations, presents a promising strategy to circumvent resistance mutations while leveraging conserved viral replication targets. The integrated computational pipeline establishes a framework for rapid identification of broad‐spectrum antivirals against evolving coronaviruses.
期刊介绍:
Advanced Theory and Simulations is an interdisciplinary, international, English-language journal that publishes high-quality scientific results focusing on the development and application of theoretical methods, modeling and simulation approaches in all natural science and medicine areas, including:
materials, chemistry, condensed matter physics
engineering, energy
life science, biology, medicine
atmospheric/environmental science, climate science
planetary science, astronomy, cosmology
method development, numerical methods, statistics