A systematic data-driven modelling framework for nonlinear distillation processes incorporating data intervals clustering and new integrated learning algorithm
{"title":"A systematic data-driven modelling framework for nonlinear distillation processes incorporating data intervals clustering and new integrated learning algorithm","authors":"Zhe Wang , Renchu He , Jian Long","doi":"10.1016/j.cjche.2025.02.013","DOIUrl":null,"url":null,"abstract":"<div><div>The distillation process is an important chemical process, and the application of data-driven modelling approach has the potential to reduce model complexity compared to mechanistic modelling, thus improving the efficiency of process optimization or monitoring studies. However, the distillation process is highly nonlinear and has multiple uncertainty perturbation intervals, which brings challenges to accurate data-driven modelling of distillation processes. This paper proposes a systematic data-driven modelling framework to solve these problems. Firstly, data segment variance was introduced into the K-means algorithm to form K-means data interval (KMDI) clustering in order to cluster the data into perturbed and steady state intervals for steady-state data extraction. Secondly, maximal information coefficient (MIC) was employed to calculate the nonlinear correlation between variables for removing redundant features. Finally, extreme gradient boosting (XGBoost) was integrated as the basic learner into adaptive boosting (AdaBoost) with the error threshold (ET) set to improve weights update strategy to construct the new integrated learning algorithm, XGBoost-AdaBoost-ET. The superiority of the proposed framework is verified by applying this data-driven modelling framework to a real industrial process of propylene distillation.</div></div>","PeriodicalId":9966,"journal":{"name":"Chinese Journal of Chemical Engineering","volume":"81 ","pages":"Pages 182-199"},"PeriodicalIF":3.7000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Journal of Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1004954125000953","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The distillation process is an important chemical process, and the application of data-driven modelling approach has the potential to reduce model complexity compared to mechanistic modelling, thus improving the efficiency of process optimization or monitoring studies. However, the distillation process is highly nonlinear and has multiple uncertainty perturbation intervals, which brings challenges to accurate data-driven modelling of distillation processes. This paper proposes a systematic data-driven modelling framework to solve these problems. Firstly, data segment variance was introduced into the K-means algorithm to form K-means data interval (KMDI) clustering in order to cluster the data into perturbed and steady state intervals for steady-state data extraction. Secondly, maximal information coefficient (MIC) was employed to calculate the nonlinear correlation between variables for removing redundant features. Finally, extreme gradient boosting (XGBoost) was integrated as the basic learner into adaptive boosting (AdaBoost) with the error threshold (ET) set to improve weights update strategy to construct the new integrated learning algorithm, XGBoost-AdaBoost-ET. The superiority of the proposed framework is verified by applying this data-driven modelling framework to a real industrial process of propylene distillation.
期刊介绍:
The Chinese Journal of Chemical Engineering (Monthly, started in 1982) is the official journal of the Chemical Industry and Engineering Society of China and published by the Chemical Industry Press Co. Ltd. The aim of the journal is to develop the international exchange of scientific and technical information in the field of chemical engineering. It publishes original research papers that cover the major advancements and achievements in chemical engineering in China as well as some articles from overseas contributors.
The topics of journal include chemical engineering, chemical technology, biochemical engineering, energy and environmental engineering and other relevant fields. Papers are published on the basis of their relevance to theoretical research, practical application or potential uses in the industry as Research Papers, Communications, Reviews and Perspectives. Prominent domestic and overseas chemical experts and scholars have been invited to form an International Advisory Board and the Editorial Committee. It enjoys recognition among Chinese academia and industry as a reliable source of information of what is going on in chemical engineering research, both domestic and abroad.