Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris
{"title":"A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors","authors":"Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris","doi":"10.1002/amp2.70004","DOIUrl":null,"url":null,"abstract":"<p>Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.</p>","PeriodicalId":87290,"journal":{"name":"Journal of advanced manufacturing and processing","volume":"7 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/amp2.70004","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of advanced manufacturing and processing","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/amp2.70004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.