Zhonghai He, Huilong Sheng, Yi Zhang, Xiaofang Zhang
{"title":"通过优化PLS建模中的Logistic损失函数来提高分级精度","authors":"Zhonghai He, Huilong Sheng, Yi Zhang, Xiaofang Zhang","doi":"10.1002/cem.70064","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The prediction results from Partial Least Squares (PLS) model are commonly used to assess whether a product meets quality standards, or whether adjustments are needed in production process parameters. It's easy to understand that misgrading is mostly occurred for marginal samples (samples near the threshold). We propose Logistic-Enhanced PLS (LE-PLS) model, which defines a logistic loss function and minimizes it via gradient descent to optimize the PLS projection vector. The prediction result of LE-PLS for marginal samples tends to be far away from the threshold value. This optimization enables LE-PLS to enhance grading capability while largely maintaining the regression accuracy of the PLS. LE-PLS was evaluated on two real-world datasets (bean pulp and corn gluten meal) and one simulated dataset, correcting 10 out of 19 misgraded samples, 6 out of 7, and 6 out of 12, respectively. Statistical analysis using paired <i>t</i>-tests confirmed that these improvements were significant. Although RMSEP increased slightly, the change remained within an acceptable range considering the substantial enhancement in grading performance. The algorithm has a computational complexity of <span></span><math>\n <semantics>\n <mrow>\n <mi>O</mi>\n <mfenced>\n <mrow>\n <mtext>iteration</mtext>\n <mo>*</mo>\n <mtext>samples</mtext>\n <mo>*</mo>\n <mtext>variables</mtext>\n </mrow>\n </mfenced>\n </mrow>\n <annotation>$$ \\mathrm{O}\\left({\\mathrm{iteration}}^{\\ast }{\\mathrm{samples}}^{\\ast}\\mathrm{variables}\\right) $$</annotation>\n </semantics></math> during modeling training. However, its prediction-phase complexity is only <span></span><math>\n <semantics>\n <mrow>\n <mi>O</mi>\n <mfenced>\n <mrow>\n <mtext>samples</mtext>\n <mo>*</mo>\n <mtext>variables</mtext>\n </mrow>\n </mfenced>\n </mrow>\n <annotation>$$ \\mathrm{O}\\left({\\mathrm{samples}}^{\\ast}\\mathrm{variables}\\right) $$</annotation>\n </semantics></math>. Given these advantages, LE-PLS is well-suited for practical applications in NIR-based quality grading of products.</p>\n </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving Grading Accuracy by Optimizing the Logistic Loss Function in PLS Modelling\",\"authors\":\"Zhonghai He, Huilong Sheng, Yi Zhang, Xiaofang Zhang\",\"doi\":\"10.1002/cem.70064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>The prediction results from Partial Least Squares (PLS) model are commonly used to assess whether a product meets quality standards, or whether adjustments are needed in production process parameters. It's easy to understand that misgrading is mostly occurred for marginal samples (samples near the threshold). We propose Logistic-Enhanced PLS (LE-PLS) model, which defines a logistic loss function and minimizes it via gradient descent to optimize the PLS projection vector. The prediction result of LE-PLS for marginal samples tends to be far away from the threshold value. This optimization enables LE-PLS to enhance grading capability while largely maintaining the regression accuracy of the PLS. LE-PLS was evaluated on two real-world datasets (bean pulp and corn gluten meal) and one simulated dataset, correcting 10 out of 19 misgraded samples, 6 out of 7, and 6 out of 12, respectively. Statistical analysis using paired <i>t</i>-tests confirmed that these improvements were significant. Although RMSEP increased slightly, the change remained within an acceptable range considering the substantial enhancement in grading performance. The algorithm has a computational complexity of <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>O</mi>\\n <mfenced>\\n <mrow>\\n <mtext>iteration</mtext>\\n <mo>*</mo>\\n <mtext>samples</mtext>\\n <mo>*</mo>\\n <mtext>variables</mtext>\\n </mrow>\\n </mfenced>\\n </mrow>\\n <annotation>$$ \\\\mathrm{O}\\\\left({\\\\mathrm{iteration}}^{\\\\ast }{\\\\mathrm{samples}}^{\\\\ast}\\\\mathrm{variables}\\\\right) $$</annotation>\\n </semantics></math> during modeling training. However, its prediction-phase complexity is only <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>O</mi>\\n <mfenced>\\n <mrow>\\n <mtext>samples</mtext>\\n <mo>*</mo>\\n <mtext>variables</mtext>\\n </mrow>\\n </mfenced>\\n </mrow>\\n <annotation>$$ \\\\mathrm{O}\\\\left({\\\\mathrm{samples}}^{\\\\ast}\\\\mathrm{variables}\\\\right) $$</annotation>\\n </semantics></math>. Given these advantages, LE-PLS is well-suited for practical applications in NIR-based quality grading of products.</p>\\n </div>\",\"PeriodicalId\":15274,\"journal\":{\"name\":\"Journal of Chemometrics\",\"volume\":\"39 9\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemometrics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/cem.70064\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOCIAL WORK\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/cem.70064","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}
Improving Grading Accuracy by Optimizing the Logistic Loss Function in PLS Modelling
The prediction results from Partial Least Squares (PLS) model are commonly used to assess whether a product meets quality standards, or whether adjustments are needed in production process parameters. It's easy to understand that misgrading is mostly occurred for marginal samples (samples near the threshold). We propose Logistic-Enhanced PLS (LE-PLS) model, which defines a logistic loss function and minimizes it via gradient descent to optimize the PLS projection vector. The prediction result of LE-PLS for marginal samples tends to be far away from the threshold value. This optimization enables LE-PLS to enhance grading capability while largely maintaining the regression accuracy of the PLS. LE-PLS was evaluated on two real-world datasets (bean pulp and corn gluten meal) and one simulated dataset, correcting 10 out of 19 misgraded samples, 6 out of 7, and 6 out of 12, respectively. Statistical analysis using paired t-tests confirmed that these improvements were significant. Although RMSEP increased slightly, the change remained within an acceptable range considering the substantial enhancement in grading performance. The algorithm has a computational complexity of during modeling training. However, its prediction-phase complexity is only . Given these advantages, LE-PLS is well-suited for practical applications in NIR-based quality grading of products.
期刊介绍:
The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.