具有多个潜在评分向量的增强型PLS-Tree算法的比较评价

Shyam Panjwani, Alice Almazan, Hao Wei, Konstantinos Spetsieris
{"title":"具有多个潜在评分向量的增强型PLS-Tree算法的比较评价","authors":"Shyam Panjwani,&nbsp;Alice Almazan,&nbsp;Hao Wei,&nbsp;Konstantinos Spetsieris","doi":"10.1002/amp2.70004","DOIUrl":null,"url":null,"abstract":"<p>Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.</p>","PeriodicalId":87290,"journal":{"name":"Journal of advanced manufacturing and processing","volume":"7 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/amp2.70004","citationCount":"0","resultStr":"{\"title\":\"A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors\",\"authors\":\"Shyam Panjwani,&nbsp;Alice Almazan,&nbsp;Hao Wei,&nbsp;Konstantinos Spetsieris\",\"doi\":\"10.1002/amp2.70004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.</p>\",\"PeriodicalId\":87290,\"journal\":{\"name\":\"Journal of advanced manufacturing and processing\",\"volume\":\"7 3\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/amp2.70004\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of advanced manufacturing and processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://aiche.onlinelibrary.wiley.com/doi/10.1002/amp2.70004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of advanced manufacturing and processing","FirstCategoryId":"1085","ListUrlMain":"https://aiche.onlinelibrary.wiley.com/doi/10.1002/amp2.70004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

多重共线性和异质性是过程工业数据集分析中普遍存在的挑战,需要能够同时解决这两个问题的算法。偏最小二乘(PLS)-树算法将PLS回归与决策树方法相结合,通过同时解决数据异质性和提高预测性能而脱颖而出。然而,与其他机器学习方法相比,PLS-Tree算法仍未得到充分探索。本研究深入研究了PLS-Tree算法的复杂性,利用合成数据,反映了现实世界过程工业场景的复杂性,其特点是高共线性和聚类。本文通过引入多个潜在评分向量进一步增强了原始PLS- tree框架,目的是改进聚类过程,提高预测精度,超过标准PLS和回归树算法。此外,还提出了一项比较分析,评估了增强型PLS- tree与常规PLS和回归树的性能,突出了其在过程工业中复杂数据分析的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors

A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors

A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors

A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors

A comparative evaluation of the enhanced PLS-Tree algorithm with multiple latent score vectors

Multicollinearity and heterogeneity are prevalent challenges in the analysis of process industry datasets, necessitating algorithms capable of addressing both simultaneously. The partial least squares (PLS)-Tree algorithm, which integrates PLS regression with decision tree methodologies, stands out by concurrently addressing data heterogeneity and improving predictive performance. However, the PLS-Tree algorithm remains underexplored compared to other machine learning approaches. This study delves into the intricacies of the PLS-Tree algorithm, utilizing synthetic data that mirrors the complexity of real-world process industry scenarios characterized by high collinearity and clustering. This paper further enhances the original PLS-Tree framework by introducing multiple latent score vectors, with the objective of refining the clustering process and boosting predictive accuracy beyond that of standard PLS and regression tree algorithms. Additionally, a comparative analysis is presented, evaluating the performance of the enhanced PLS-Tree against regular PLS and regression tree, highlighting its potential for sophisticated data analysis in the process industries.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.50
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信