Decoding the synergy: unveiling gradient boosting regression model for multivariate quantitation of pioglitazone, alogliptin and glimepiride in pure and tablet dosage forms
Mahmoud M. Elkhoudary, Aya A. Marie, Sherin F. Hammad, Mohamed M. Salim, Amira H. Kamal
{"title":"Decoding the synergy: unveiling gradient boosting regression model for multivariate quantitation of pioglitazone, alogliptin and glimepiride in pure and tablet dosage forms","authors":"Mahmoud M. Elkhoudary, Aya A. Marie, Sherin F. Hammad, Mohamed M. Salim, Amira H. Kamal","doi":"10.1186/s13065-024-01351-8","DOIUrl":null,"url":null,"abstract":"<div><p>This study represents a comparison among the performances of four multivariate procedures: partial least square (PLS) and artificial neural networks (ANN) in addition to support vector regression (SVR) and extreme gradient boosting (XG Boost) algorithm for the determination of the anti-diabetic mixture of pioglitazone (PIO), alogliptin (ALG) and glimepiride (GLM) in pharmaceutical formulations with aid of UV spectrometry. Key wavelengths were selected using knowledge-based variable selection and various preprocessing methods (e.g., mean centering, orthogonal scatter correction, and principal component analysis) to minimize noise and improve model precision. XG Boost effectively enhanced computing speed and accuracy by focusing on specific spectral features rather than the entire spectrum, demonstrating its advantages in resolving complex, overlapping spectral data. The independent test results of different models demonstrated that XG Boost outperformed other methods. XG Boost achieved the lowest root mean squared error of prediction (RMSEP) and standard deviation (SD) values across all compounds, indicating minimal prediction error and variability. For PIO, XG Boost recorded an RMSEP of 0.100 and SD of 0.369, significantly better than PLS and ANN. For ALG, XG Boost showed near-perfect performance with an RMSEP of 0.001 and SD of 0.005, outperforming SVR and PLS, which had higher error rates. In the case of GLM, XG Boost also excelled with an RMSEP of 0.001 and SD of 0.018, demonstrating superior precision compared to the much higher errors seen in PLS and ANN. These results highlight XG Boost’s exceptional ability to handle complex, overlapping spectral data, making it the most reliable and accurate model in this study.</p></div>","PeriodicalId":496,"journal":{"name":"BMC Chemistry","volume":"18 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bmcchem.biomedcentral.com/counter/pdf/10.1186/s13065-024-01351-8","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Chemistry","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13065-024-01351-8","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This study represents a comparison among the performances of four multivariate procedures: partial least square (PLS) and artificial neural networks (ANN) in addition to support vector regression (SVR) and extreme gradient boosting (XG Boost) algorithm for the determination of the anti-diabetic mixture of pioglitazone (PIO), alogliptin (ALG) and glimepiride (GLM) in pharmaceutical formulations with aid of UV spectrometry. Key wavelengths were selected using knowledge-based variable selection and various preprocessing methods (e.g., mean centering, orthogonal scatter correction, and principal component analysis) to minimize noise and improve model precision. XG Boost effectively enhanced computing speed and accuracy by focusing on specific spectral features rather than the entire spectrum, demonstrating its advantages in resolving complex, overlapping spectral data. The independent test results of different models demonstrated that XG Boost outperformed other methods. XG Boost achieved the lowest root mean squared error of prediction (RMSEP) and standard deviation (SD) values across all compounds, indicating minimal prediction error and variability. For PIO, XG Boost recorded an RMSEP of 0.100 and SD of 0.369, significantly better than PLS and ANN. For ALG, XG Boost showed near-perfect performance with an RMSEP of 0.001 and SD of 0.005, outperforming SVR and PLS, which had higher error rates. In the case of GLM, XG Boost also excelled with an RMSEP of 0.001 and SD of 0.018, demonstrating superior precision compared to the much higher errors seen in PLS and ANN. These results highlight XG Boost’s exceptional ability to handle complex, overlapping spectral data, making it the most reliable and accurate model in this study.
期刊介绍:
BMC Chemistry, formerly known as Chemistry Central Journal, is now part of the BMC series journals family.
Chemistry Central Journal has served the chemistry community as a trusted open access resource for more than 10 years – and we are delighted to announce the next step on its journey. In January 2019 the journal has been renamed BMC Chemistry and now strengthens the BMC series footprint in the physical sciences by publishing quality articles and by pushing the boundaries of open chemistry.