COVID-19共包装paxlovid的可持续分析:探索先进的采样技术和多变量处理工具。

IF 4.3 2区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman
{"title":"COVID-19共包装paxlovid的可持续分析:探索先进的采样技术和多变量处理工具。","authors":"Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman","doi":"10.1186/s13065-025-01567-2","DOIUrl":null,"url":null,"abstract":"<p><p>The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid<sup>®</sup>, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the \"Sample Preparation Metric of Sustainability\", the \"Analytical Greenness metric for Sample Preparation\" and the \"Analytical Greenness metric\" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.</p>","PeriodicalId":496,"journal":{"name":"BMC Chemistry","volume":"19 1","pages":"206"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243336/pdf/","citationCount":"0","resultStr":"{\"title\":\"Sustainable analysis of COVID-19 Co-packaged paxlovid: exploring advanced sampling techniques and multivariate processing tools.\",\"authors\":\"Shymaa S Soliman, Nisreen F Abo- Talib, Mohamed R Elghobashy, Mona A Abdel Rahman\",\"doi\":\"10.1186/s13065-025-01567-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid<sup>®</sup>, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the \\\"Sample Preparation Metric of Sustainability\\\", the \\\"Analytical Greenness metric for Sample Preparation\\\" and the \\\"Analytical Greenness metric\\\" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.</p>\",\"PeriodicalId\":496,\"journal\":{\"name\":\"BMC Chemistry\",\"volume\":\"19 1\",\"pages\":\"206\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243336/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1186/s13065-025-01567-2\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13065-025-01567-2","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

随机抽样的缺点不仅阻碍了更可靠、更有效的方法的发展,而且削弱了它们在多个领域的准确性、预测能力和有效性。在目前的研究中,一种开创性的统计技术,即拉丁超立方体采样(LHS)与不同的多元化学计量模型相结合,即;偏最小二乘(PLS),遗传算法-偏最小二乘(GA-PLS),人工神经网络(ANN)和多元曲线分辨率-交替最小二乘(MCR-ALS)。这种整合旨在实现全数据覆盖,从而增强这些模型的预测能力。Paxlovid®是一种新型共包装抗covid -19药物,含有利托那韦(RNV)增强的nirmatrelvir (NMV),具有临床意义,被用作研究对象,以证明LHS在提高模型稳健性和预测准确性方面的强大潜力。LHS技术能够在不增加样本数量的情况下捕获整个输入空间的基本变量,从而提供良好的解释和信息丰富的样本。与随机抽样蒙特卡罗方法进行了比较,结果表明该方法优于随机抽样蒙特卡罗方法。对已开发模型进行了综合比较,使用ANN和MCR-ALS模型,RNV和NMV的RMSEP分别相对降低了14.1%、8.9%、53.1%和34.6%。采用各种预处理技术来提高PLS构建的信号质量,与原始未处理的光谱数据(RNV和NMV的RMSEC均为0.19)相比,结果更优(RNV和NMV的RMSEC均为0.21)。构建了主成分分析评分图,证实了数据集的一致性和不存在系统误差,增强了对模型稳健性的信心。为了提高GA-PLS模型的鲁棒性和简便性,提出了一种新的混合变量选择策略GA-ICOMP-PLS。RNV和NMV的预测误差分别为0.15和0.14,具有较强的预测能力和泛化能力。与可持续发展和环保目标一致,目前的研究率先使用绿-蓝-白替代传统的分析方法。使用“可持续性样品制备指标”、“样品制备分析绿色指标”和“分析绿色指标”以及两种溶剂可持续性评估工具进行了全面评估。这些评估产生了令人满意的结果,绿色象限分类和每个指标的高分分别为5.89,0.67和0.82,以及令人满意的t和f检验值。此外,该模型在RGB12指标和蓝度适用性等级指数上取得了出色的成绩,分别为96.8%和82.5分,突出了其广泛的适用性、高效率和与环保分析实践的一致性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sustainable analysis of COVID-19 Co-packaged paxlovid: exploring advanced sampling techniques and multivariate processing tools.

The drawbacks of random sampling not only hinder the development of more reliable and efficient methods but also weaken their accuracy, predictive abilities, and validity across several domains. During the current study, a pioneering statistical technique namely, Latin Hypercube Sampling (LHS) was integrated with different multivariate chemometric models namely; Partial Least Squares (PLS), Genetic Algorithm‑Partial Least Squares (GA-PLS), Artificial Neural Networks (ANN), and Multivariate Curve Resolution‑Alternating Least Squares (MCR-ALS). This integration aimed to achieve full data coverage and thereby enhance the predictive powers of these models. Being of clinical significance, Paxlovid®, a newly co-packaged antiCOVID-19 drug containing ritonavir (RNV)-boosted nirmatrelvir (NMV), was utilized as a study subject to demonstrate the powerful potentials of LHS in enhancing models' robustness and predictive accuracy. The LHS technique was able to provide well-interpreted and informative samples by capturing essential variabilities across the input space without any increase in sample numbers. It was compared and outperformed the random sampling Monte Carlo technique. A comprehensive comparison between the developed models was held where the RMSEP was relatively reduced by 14.1%, 8.9%, 53.1%, and 34.6% for RNV and NMV, respectively using the ANN and MCR-ALS models. Various preprocessing techniques were employed to improve signal quality for PLS construction, yielding superior results (RMSEC of 0.19 for both RNV and NMV) compared to the original, unprocessed spectral data (RMSEC of 0.21 for both RNV and NMV). The Principal Component Analysis score plot was constructed, confirming the consistency of the dataset and the absence of systematic errors, enhancing confidence in the models' robustness. A new hybrid variable selection strategy (GA-ICOMP-PLS) was developed to enhance the robustness and parsimony of the GA-PLS model. Prediction error values of 0.15 and 0.14 were successfully achieved for RNV and NMV, respectively, indicating strong predictive power and generalization. Consistent with sustainability and eco-friendly goals, the current study pioneers the usage of green-blue-white alternatives to conventional analytical methods. A comprehensive assessment was conducted using the "Sample Preparation Metric of Sustainability", the "Analytical Greenness metric for Sample Preparation" and the "Analytical Greenness metric" alongside two solvent sustainability evaluation tools. These evaluations yielded promising results, with green quadrant classification and high scores of 5.89, 0.67, and 0.82 for each metric, respectively, as well as satisfactory t- and F-test values. Moreover, the models achieved outstanding results on the RGB12 metric and Blueness Applicability Grade Index, scoring 96.8% and 82.5, respectively, highlighting their broad applicability, high efficiency, and alignment with eco-friendly analytical practices.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Chemistry
BMC Chemistry Chemistry-General Chemistry
CiteScore
5.30
自引率
2.20%
发文量
92
审稿时长
27 weeks
期刊介绍: BMC Chemistry, formerly known as Chemistry Central Journal, is now part of the BMC series journals family. Chemistry Central Journal has served the chemistry community as a trusted open access resource for more than 10 years – and we are delighted to announce the next step on its journey. In January 2019 the journal has been renamed BMC Chemistry and now strengthens the BMC series footprint in the physical sciences by publishing quality articles and by pushing the boundaries of open chemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信