Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman
{"title":"COVID-19共包装paxlovid的可持续分析:探索先进的采样技术和多变量处理工具。","authors":"Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman","doi":"10.1186/s13065-025-01567-2","DOIUrl":null,"url":null,"abstract":"<p><p>The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid<sup>®</sup>, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the \"Sample Preparation Metric of Sustainability\", the \"Analytical Greenness metric for Sample Preparation\" and the \"Analytical Greenness metric\" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.</p>","PeriodicalId":496,"journal":{"name":"BMC Chemistry","volume":"19 1","pages":"206"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243336/pdf/","citationCount":"0","resultStr":"{\"title\":\"Sustainable analysis of COVID-19 Co-packaged paxlovid: exploring advanced sampling techniques and multivariate processing tools.\",\"authors\":\"Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman\",\"doi\":\"10.1186/s13065-025-01567-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid<sup>®</sup>, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the \\\"Sample Preparation Metric of Sustainability\\\", the \\\"Analytical Greenness metric for Sample Preparation\\\" and the \\\"Analytical Greenness metric\\\" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.</p>\",\"PeriodicalId\":496,\"journal\":{\"name\":\"BMC Chemistry\",\"volume\":\"19 1\",\"pages\":\"206\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243336/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1186/s13065-025-01567-2\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13065-025-01567-2","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Sustainable analysis of COVID-19 Co-packaged paxlovid: exploring advanced sampling techniques and multivariate processing tools.
The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid®, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the "Sample Preparation Metric of Sustainability", the "Analytical Greenness metric for Sample Preparation" and the "Analytical Greenness metric" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.
期刊介绍:
BMC Chemistry, formerly known as Chemistry Central Journal, is now part of the BMC series journals family.
Chemistry Central Journal has served the chemistry community as a trusted open access resource for more than 10 years – and we are delighted to announce the next step on its journey. In January 2019 the journal has been renamed BMC Chemistry and now strengthens the BMC series footprint in the physical sciences by publishing quality articles and by pushing the boundaries of open chemistry.