Decoding the synergy: unveiling gradient boosting regression model for multivariate quantitation of pioglitazone, alogliptin and glimepiride in pure and tablet dosage forms

IF 4.3 2区化学 Q2 CHEMISTRY, MULTIDISCIPLINARY

BMC Chemistry Pub Date : 2024-11-29 DOI:10.1186/s13065-024-01351-8

Mahmoud M. Elkhoudary, Aya A. Marie, Sherin F. Hammad, Mohamed M. Salim, Amira H. Kamal

{"title":"Decoding the synergy: unveiling gradient boosting regression model for multivariate quantitation of pioglitazone, alogliptin and glimepiride in pure and tablet dosage forms","authors":"Mahmoud M. Elkhoudary, Aya A. Marie, Sherin F. Hammad, Mohamed M. Salim, Amira H. Kamal","doi":"10.1186/s13065-024-01351-8","DOIUrl":null,"url":null,"abstract":"<div><p>This study represents a comparison among the performances of four multivariate procedures: partial least square (PLS) and artificial neural networks (ANN) in addition to support vector regression (SVR) and extreme gradient boosting (XG Boost) algorithm for the determination of the anti-diabetic mixture of pioglitazone (PIO), alogliptin (ALG) and glimepiride (GLM) in pharmaceutical formulations with aid of UV spectrometry. Key wavelengths were selected using knowledge-based variable selection and various preprocessing methods (e.g., mean centering, orthogonal scatter correction, and principal component analysis) to minimize noise and improve model precision. XG Boost effectively enhanced computing speed and accuracy by focusing on specific spectral features rather than the entire spectrum, demonstrating its advantages in resolving complex, overlapping spectral data. The independent test results of different models demonstrated that XG Boost outperformed other methods. XG Boost achieved the lowest root mean squared error of prediction (RMSEP) and standard deviation (SD) values across all compounds, indicating minimal prediction error and variability. For PIO, XG Boost recorded an RMSEP of 0.100 and SD of 0.369, significantly better than PLS and ANN. For ALG, XG Boost showed near-perfect performance with an RMSEP of 0.001 and SD of 0.005, outperforming SVR and PLS, which had higher error rates. In the case of GLM, XG Boost also excelled with an RMSEP of 0.001 and SD of 0.018, demonstrating superior precision compared to the much higher errors seen in PLS and ANN. These results highlight XG Boost’s exceptional ability to handle complex, overlapping spectral data, making it the most reliable and accurate model in this study.</p></div>","PeriodicalId":496,"journal":{"name":"BMC Chemistry","volume":"18 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bmcchem.biomedcentral.com/counter/pdf/10.1186/s13065-024-01351-8","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Chemistry","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13065-024-01351-8","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

This study represents a comparison among the performances of four multivariate procedures: partial least square (PLS) and artificial neural networks (ANN) in addition to support vector regression (SVR) and extreme gradient boosting (XG Boost) algorithm for the determination of the anti-diabetic mixture of pioglitazone (PIO), alogliptin (ALG) and glimepiride (GLM) in pharmaceutical formulations with aid of UV spectrometry. Key wavelengths were selected using knowledge-based variable selection and various preprocessing methods (e.g., mean centering, orthogonal scatter correction, and principal component analysis) to minimize noise and improve model precision. XG Boost effectively enhanced computing speed and accuracy by focusing on specific spectral features rather than the entire spectrum, demonstrating its advantages in resolving complex, overlapping spectral data. The independent test results of different models demonstrated that XG Boost outperformed other methods. XG Boost achieved the lowest root mean squared error of prediction (RMSEP) and standard deviation (SD) values across all compounds, indicating minimal prediction error and variability. For PIO, XG Boost recorded an RMSEP of 0.100 and SD of 0.369, significantly better than PLS and ANN. For ALG, XG Boost showed near-perfect performance with an RMSEP of 0.001 and SD of 0.005, outperforming SVR and PLS, which had higher error rates. In the case of GLM, XG Boost also excelled with an RMSEP of 0.001 and SD of 0.018, demonstrating superior precision compared to the much higher errors seen in PLS and ANN. These results highlight XG Boost’s exceptional ability to handle complex, overlapping spectral data, making it the most reliable and accurate model in this study.

查看原文本刊更多论文

解码协同作用：揭示梯度增强回归模型的多变量定量吡格列酮，阿格列汀和格列美脲在纯剂型和片剂形式

本研究比较了偏最小二乘法（PLS）和人工神经网络（ANN）、支持向量回归（SVR）和极端梯度增强（XG Boost）算法在紫外光谱法测定药物配方中吡格列酮（PIO）、阿格列汀（ALG）和格列吡脲（GLM）的性能。利用基于知识的变量选择和各种预处理方法（如均值定心、正交散射校正和主成分分析）选择关键波长，以最大限度地减少噪声，提高模型精度。XG Boost通过专注于特定的光谱特征而不是整个光谱，有效地提高了计算速度和精度，在解决复杂、重叠的光谱数据方面显示出优势。不同模型的独立测试结果表明，XG Boost优于其他方法。XG Boost在所有化合物中实现了最低的预测均方根误差（RMSEP）和标准差（SD）值，表明预测误差和可变性最小。对于PIO， XG Boost的RMSEP为0.100，SD为0.369，显著优于PLS和ANN。对于ALG， XG Boost表现出近乎完美的性能，RMSEP为0.001，SD为0.005，优于具有较高错误率的SVR和PLS。在GLM的情况下，XG Boost也以0.001的RMSEP和0.018的SD表现出色，与PLS和ANN中看到的更高的误差相比，显示出更高的精度。这些结果突出了XG Boost处理复杂、重叠光谱数据的卓越能力，使其成为本研究中最可靠、最准确的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Chemistry Chemistry-General Chemistry

CiteScore

5.30

自引率

2.20%

发文量

审稿时长

27 weeks

期刊介绍： BMC Chemistry, formerly known as Chemistry Central Journal, is now part of the BMC series journals family. Chemistry Central Journal has served the chemistry community as a trusted open access resource for more than 10 years – and we are delighted to announce the next step on its journey. In January 2019 the journal has been renamed BMC Chemistry and now strengthens the BMC series footprint in the physical sciences by publishing quality articles and by pushing the boundaries of open chemistry.