Mark Croxall, Reece Lawrence, Jiaqi Gong, M Cynthia Goh
{"title":"High-Throughput Photocatalysis for Generating Reliable Datasets Analyzed by Machine Learning.","authors":"Mark Croxall, Reece Lawrence, Jiaqi Gong, M Cynthia Goh","doi":"10.1002/cphc.202500039","DOIUrl":null,"url":null,"abstract":"<p><p>Photocatalysis is an environmentally conscious tool for removing contaminants from water. Novel photocatalytic materials are often measured on ability to degrade a small number of analytes, which may not be indicative of broader applicability. In this work, an experimental method dubbed high-throughput photocatalysis (HTP) is introduced to assay photocatalytic materials against a range of analytes in a time effective manner. HTP is modular; experimental parameters, including matrix, can be changed to fit a proposed application. The photodegradation of each analyte is attained in a consistent manner such that machine learning (ML) models can be applied to the obtained datasets. Three out of the box ML models-linear regression, random forest (RF), and neural network (NN)-are tasked with estimating the percentage removal as a function of irradiation time and molecular structure, as represented by Morgan fingerprints. Leave-out sets demonstrated that RF and NN models did not overfit the training data and reasonably estimated the degradation of unknown molecules. SHapley additive exPlanations values are utilized to correlate molecular substructures to the parent molecule's susceptibility to photocatalytic degradation. These correlations are used to generate heatmaps of estimated reactivity within molecules that corroborate reports in which dye degradation pathways were studied in detail.</p>","PeriodicalId":9819,"journal":{"name":"Chemphyschem","volume":" ","pages":"e202500039"},"PeriodicalIF":2.2000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemphyschem","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1002/cphc.202500039","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Photocatalysis is an environmentally conscious tool for removing contaminants from water. Novel photocatalytic materials are often measured on ability to degrade a small number of analytes, which may not be indicative of broader applicability. In this work, an experimental method dubbed high-throughput photocatalysis (HTP) is introduced to assay photocatalytic materials against a range of analytes in a time effective manner. HTP is modular; experimental parameters, including matrix, can be changed to fit a proposed application. The photodegradation of each analyte is attained in a consistent manner such that machine learning (ML) models can be applied to the obtained datasets. Three out of the box ML models-linear regression, random forest (RF), and neural network (NN)-are tasked with estimating the percentage removal as a function of irradiation time and molecular structure, as represented by Morgan fingerprints. Leave-out sets demonstrated that RF and NN models did not overfit the training data and reasonably estimated the degradation of unknown molecules. SHapley additive exPlanations values are utilized to correlate molecular substructures to the parent molecule's susceptibility to photocatalytic degradation. These correlations are used to generate heatmaps of estimated reactivity within molecules that corroborate reports in which dye degradation pathways were studied in detail.
期刊介绍:
ChemPhysChem is one of the leading chemistry/physics interdisciplinary journals (ISI Impact Factor 2018: 3.077) for physical chemistry and chemical physics. It is published on behalf of Chemistry Europe, an association of 16 European chemical societies.
ChemPhysChem is an international source for important primary and critical secondary information across the whole field of physical chemistry and chemical physics. It integrates this wide and flourishing field ranging from Solid State and Soft-Matter Research, Electro- and Photochemistry, Femtochemistry and Nanotechnology, Complex Systems, Single-Molecule Research, Clusters and Colloids, Catalysis and Surface Science, Biophysics and Physical Biochemistry, Atmospheric and Environmental Chemistry, and many more topics. ChemPhysChem is peer-reviewed.