{"title":"The development of the generative adversarial supporting vector machine for molecular property generation","authors":"Qing Lu","doi":"10.1186/s13321-025-01052-x","DOIUrl":null,"url":null,"abstract":"The generative adversarial network (GAN) is a milestone technique in artificial intelligence, and it is widely used in image generation. However, it has a large hyper-parameter space, which makes it difficult for training. In this work, we propose a new generative model by introducing the supporting vector machine into the GAN architecture. Such modification reduces the hyper-parameter space by half, thus making the training more accessible. The formic acid dimer (FAD) system is studied to examine the generation capacity of the proposed model. The molecular structures, molecular energies and molecular dipole moments are combined as the feature vector to train the model. It is found that the proposed model can generate new feature vectors from scratch, and the generated data agrees well with the ab initio values. In addition, each generated feature vector is unique, so the mode collapse problem is avoided, which is often encountered in the GAN model. The proposed model is extensible to incorporate any molecular properties as the feature vector is established as the direct sum of corresponding component vectors; thus, it is expected that the proposed method will have a wide range of application scenarios. Scientific contribution statement: A generative adversarial algorithm combing supporting vector machine is proposed for the first time to predict molecular properties from scratch, which agrees well with ab initio values. The new model is more efficient than generative adversarial networks, and it is convenient to extend for application in different scenarios.","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"81 1","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13321-025-01052-x","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The generative adversarial network (GAN) is a milestone technique in artificial intelligence, and it is widely used in image generation. However, it has a large hyper-parameter space, which makes it difficult for training. In this work, we propose a new generative model by introducing the supporting vector machine into the GAN architecture. Such modification reduces the hyper-parameter space by half, thus making the training more accessible. The formic acid dimer (FAD) system is studied to examine the generation capacity of the proposed model. The molecular structures, molecular energies and molecular dipole moments are combined as the feature vector to train the model. It is found that the proposed model can generate new feature vectors from scratch, and the generated data agrees well with the ab initio values. In addition, each generated feature vector is unique, so the mode collapse problem is avoided, which is often encountered in the GAN model. The proposed model is extensible to incorporate any molecular properties as the feature vector is established as the direct sum of corresponding component vectors; thus, it is expected that the proposed method will have a wide range of application scenarios. Scientific contribution statement: A generative adversarial algorithm combing supporting vector machine is proposed for the first time to predict molecular properties from scratch, which agrees well with ab initio values. The new model is more efficient than generative adversarial networks, and it is convenient to extend for application in different scenarios.
期刊介绍:
Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling.
Coverage includes, but is not limited to:
chemical information systems, software and databases, and molecular modelling,
chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases,
computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.