{"title":"Hidden descriptors: Using statistical treatments to generate better descriptor sets","authors":"Lucía Morán-González , Feliu Maseras","doi":"10.1016/j.aichem.2024.100061","DOIUrl":null,"url":null,"abstract":"<div><p>The application of artificial intelligence to chemistry usually focuses on the identification of good correlations between descriptors and a given property of interest. The descriptors often come from arbitrary sets, with the implicit assumption that the evaluation of a sufficiently wide range of descriptors will lead to a satisfactory choice. Recent work in our group has focused on applying statistical analysis to large amounts of DFT results with the goal of finding optimal descriptor sets for a given property, which we label as hidden descriptors. This article briefly discusses this treatment and the chemical knowledge that has been gained through its application in two different domains: metal-ligand bond strength in transition metal complexes, and energy barriers in bimolecular nucleophilic substitution reactions.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000198/pdfft?md5=541c02174c39b94d0f3787f465d80154&pid=1-s2.0-S2949747724000198-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence chemistry","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949747724000198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The application of artificial intelligence to chemistry usually focuses on the identification of good correlations between descriptors and a given property of interest. The descriptors often come from arbitrary sets, with the implicit assumption that the evaluation of a sufficiently wide range of descriptors will lead to a satisfactory choice. Recent work in our group has focused on applying statistical analysis to large amounts of DFT results with the goal of finding optimal descriptor sets for a given property, which we label as hidden descriptors. This article briefly discusses this treatment and the chemical knowledge that has been gained through its application in two different domains: metal-ligand bond strength in transition metal complexes, and energy barriers in bimolecular nucleophilic substitution reactions.