Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris
{"title":"具有多个潜在评分向量的增强型PLS-Tree算法的比较评价","authors":"Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris","doi":"10.1002/amp2.70004","DOIUrl":null,"url":null,"abstract":"<p>Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.</p>","PeriodicalId":87290,"journal":{"name":"Journal of advanced manufacturing and processing","volume":"7 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/amp2.70004","citationCount":"0","resultStr":"{\"title\":\"A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors\",\"authors\":\"Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris\",\"doi\":\"10.1002/amp2.70004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.</p>\",\"PeriodicalId\":87290,\"journal\":{\"name\":\"Journal of advanced manufacturing and processing\",\"volume\":\"7 3\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/amp2.70004\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of advanced manufacturing and processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://aiche.onlinelibrary.wiley.com/doi/10.1002/amp2.70004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of advanced manufacturing and processing","FirstCategoryId":"1085","ListUrlMain":"https://aiche.onlinelibrary.wiley.com/doi/10.1002/amp2.70004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors
Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.