{"title":"知识增强和引导图对比学习在分子性质预测中的应用","authors":"Kunjie Dong, Yanhui Zhang, Xiaohui Lin","doi":"10.1016/j.eswa.2025.128190","DOIUrl":null,"url":null,"abstract":"<div><div>Molecular property prediction (MPP) lies at the core of fundamental tasks for AI-aided drug discovery. Recent studies have shown great promise in applying self-supervised learning (SSL) to cope with the data scarcity in MPP, and contrastive learning is one of the mainstream methods in SSL. However, current molecular graph contrastive learning methods suffer from two main challenges: molecular graph augmentation that preserves the molecular chemical semantics, and contrastive goal that captures the precise prior knowledge. To address these issues, we propose the <u>K</u>nowledge <u>E</u>nhanced and <u>G</u>uided <u>G</u>raph <u>C</u>ontrastive <u>L</u>earning (KEGGCL). KEGGCL adopts the chemical element domain knowledge to generate two enhanced molecular graphs without changing the molecular chemical structure, ensuring the preservation of the molecular semantics and structure. Next, KEGGCL uses the quantitative estimate of drug-likeness as the guideline to push away sample pairs constituted of different molecules discriminately, capturing the precise prior knowledge. Then, KEGGCL utilizes the well-trained encoders on the featured molecular graph and two element knowledge enhanced molecular graphs to decide the final prediction jointly. Experiments on the 10 benchmark datasets from MoleculeNet show the superiority of KEGGCL. It provides a new graph contrastive manner to learn the precise prior knowledge for better predicting molecular property.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"288 ","pages":"Article 128190"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge enhanced and guided graph contrastive learning for molecular property prediction\",\"authors\":\"Kunjie Dong, Yanhui Zhang, Xiaohui Lin\",\"doi\":\"10.1016/j.eswa.2025.128190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Molecular property prediction (MPP) lies at the core of fundamental tasks for AI-aided drug discovery. Recent studies have shown great promise in applying self-supervised learning (SSL) to cope with the data scarcity in MPP, and contrastive learning is one of the mainstream methods in SSL. However, current molecular graph contrastive learning methods suffer from two main challenges: molecular graph augmentation that preserves the molecular chemical semantics, and contrastive goal that captures the precise prior knowledge. To address these issues, we propose the <u>K</u>nowledge <u>E</u>nhanced and <u>G</u>uided <u>G</u>raph <u>C</u>ontrastive <u>L</u>earning (KEGGCL). KEGGCL adopts the chemical element domain knowledge to generate two enhanced molecular graphs without changing the molecular chemical structure, ensuring the preservation of the molecular semantics and structure. Next, KEGGCL uses the quantitative estimate of drug-likeness as the guideline to push away sample pairs constituted of different molecules discriminately, capturing the precise prior knowledge. Then, KEGGCL utilizes the well-trained encoders on the featured molecular graph and two element knowledge enhanced molecular graphs to decide the final prediction jointly. Experiments on the 10 benchmark datasets from MoleculeNet show the superiority of KEGGCL. It provides a new graph contrastive manner to learn the precise prior knowledge for better predicting molecular property.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"288 \",\"pages\":\"Article 128190\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095741742501810X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095741742501810X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Knowledge enhanced and guided graph contrastive learning for molecular property prediction
Molecular property prediction (MPP) lies at the core of fundamental tasks for AI-aided drug discovery. Recent studies have shown great promise in applying self-supervised learning (SSL) to cope with the data scarcity in MPP, and contrastive learning is one of the mainstream methods in SSL. However, current molecular graph contrastive learning methods suffer from two main challenges: molecular graph augmentation that preserves the molecular chemical semantics, and contrastive goal that captures the precise prior knowledge. To address these issues, we propose the Knowledge Enhanced and Guided Graph Contrastive Learning (KEGGCL). KEGGCL adopts the chemical element domain knowledge to generate two enhanced molecular graphs without changing the molecular chemical structure, ensuring the preservation of the molecular semantics and structure. Next, KEGGCL uses the quantitative estimate of drug-likeness as the guideline to push away sample pairs constituted of different molecules discriminately, capturing the precise prior knowledge. Then, KEGGCL utilizes the well-trained encoders on the featured molecular graph and two element knowledge enhanced molecular graphs to decide the final prediction jointly. Experiments on the 10 benchmark datasets from MoleculeNet show the superiority of KEGGCL. It provides a new graph contrastive manner to learn the precise prior knowledge for better predicting molecular property.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.