Influence of descriptor database selection on modeling retention factors in capillary micellar and microemulsion electrokinetic chromatography using the solvation parameter model
{"title":"Influence of descriptor database selection on modeling retention factors in capillary micellar and microemulsion electrokinetic chromatography using the solvation parameter model","authors":"Sanka N. Atapattu , Colin F. Poole","doi":"10.1016/j.chroma.2025.465992","DOIUrl":null,"url":null,"abstract":"<div><div>Abraham's solvation parameter model has been widely used to model retention in capillary micellar and microemulsion electrokinetic chromatography systems. To fit or predict retention factors in separation systems experimentally determined compound descriptors are required. These are typically obtained from two sources: the Abraham and Wayne State University (WSU) compound descriptor databases. Alternatively, compound descriptors can be estimated from molecular structures using computational, machine learning, and group contribution approaches without having to resort to experimental methods for compounds without assigned values in either database. Here, we compare descriptor quality for the Abraham and WSU experimental compound descriptor databases as well as computational approaches using group contribution and machine learning approaches for modelling retention in capillary micellar and microemulsion electrokinetic chromatography systems. It is shown that the WSU compound descriptor database affords a more accurate description for the retention of varied compounds while descriptors based on a machine learning approach more accurately modeled retention factors than a group contribution approach. Model standard errors and coefficients of determination for WSU descriptors varied from 0.046 to 0.116 and 0.976 to 0.996, respectively, while Abraham descriptors varied from 0.048 to 0.166 and 0.953 to 0.995, respectively. Model standard errors and coefficients of determination for machine learning descriptors ranged from 0.086 to 0.116 and 0.979 to 0.981, respectively, whereas group contribution descriptors ranged from 0.090 to 0.181 and 0.942 to 0.979, respectively. Descriptor quality is important when modeling retention in capillary micellar and microemulsion electrokinetic chromatography systems.</div></div>","PeriodicalId":347,"journal":{"name":"Journal of Chromatography A","volume":"1753 ","pages":"Article 465992"},"PeriodicalIF":3.8000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chromatography A","FirstCategoryId":"1","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021967325003401","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Abraham's solvation parameter model has been widely used to model retention in capillary micellar and microemulsion electrokinetic chromatography systems. To fit or predict retention factors in separation systems experimentally determined compound descriptors are required. These are typically obtained from two sources: the Abraham and Wayne State University (WSU) compound descriptor databases. Alternatively, compound descriptors can be estimated from molecular structures using computational, machine learning, and group contribution approaches without having to resort to experimental methods for compounds without assigned values in either database. Here, we compare descriptor quality for the Abraham and WSU experimental compound descriptor databases as well as computational approaches using group contribution and machine learning approaches for modelling retention in capillary micellar and microemulsion electrokinetic chromatography systems. It is shown that the WSU compound descriptor database affords a more accurate description for the retention of varied compounds while descriptors based on a machine learning approach more accurately modeled retention factors than a group contribution approach. Model standard errors and coefficients of determination for WSU descriptors varied from 0.046 to 0.116 and 0.976 to 0.996, respectively, while Abraham descriptors varied from 0.048 to 0.166 and 0.953 to 0.995, respectively. Model standard errors and coefficients of determination for machine learning descriptors ranged from 0.086 to 0.116 and 0.979 to 0.981, respectively, whereas group contribution descriptors ranged from 0.090 to 0.181 and 0.942 to 0.979, respectively. Descriptor quality is important when modeling retention in capillary micellar and microemulsion electrokinetic chromatography systems.
期刊介绍:
The Journal of Chromatography A provides a forum for the publication of original research and critical reviews on all aspects of fundamental and applied separation science. The scope of the journal includes chromatography and related techniques, electromigration techniques (e.g. electrophoresis, electrochromatography), hyphenated and other multi-dimensional techniques, sample preparation, and detection methods such as mass spectrometry. Contributions consist mainly of research papers dealing with the theory of separation methods, instrumental developments and analytical and preparative applications of general interest.