Zhibo Gao, Jie Wang, Song Liu, Mingyang Zhao, Fusheng Ouyang
{"title":"基于 AE 和 PCA 算法的 S Zorb 装置原料油特征提取和聚类","authors":"Zhibo Gao, Jie Wang, Song Liu, Mingyang Zhao, Fusheng Ouyang","doi":"10.1134/S0965544124010109","DOIUrl":null,"url":null,"abstract":"<p>Based on the 5-year data on the feed oil characteristics obtained from the S Zorb unit, the outliers in the data were detected using the boxplot and LOF methods, and 536 modeling samples were obtained. Combining MIC with the Pearson correlation coefficient, six characteristics of feed oil including RON, sulfur content, olefin content, aromatic content, density, and vapor pressure were chosen as input variables for the clustering model. Two features were extracted from the six variables by the autoencoder (AE) characterized by the 6-32-2-32-6 neural network structure and PCA algorithm for clustering. Three clustering models were built using AE+K-means, PCA+K-means, and K-means. The results of evaluation showed that the optimal clustering number in these models was three, and the AE+K-means model provided the best clustering effect. According to the clustering centers and the property distribution, the dividing boundaries between three types of feed oils are obvious indicating that the AE+K-means model is available to classify feed oils from the S Zorb unit. On this basis, prediction models for the RON of refined gasoline were built for different types of feed oils to get the optimal operation conditions for the reduction of RON losses of refined gasoline in the S Zorb unit.</p>","PeriodicalId":725,"journal":{"name":"Petroleum Chemistry","volume":"64 3","pages":"385 - 395"},"PeriodicalIF":1.3000,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Extraction and Clustering of Feed Oil from a S Zorb Unit Based on AE and PCA Algorithms\",\"authors\":\"Zhibo Gao, Jie Wang, Song Liu, Mingyang Zhao, Fusheng Ouyang\",\"doi\":\"10.1134/S0965544124010109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Based on the 5-year data on the feed oil characteristics obtained from the S Zorb unit, the outliers in the data were detected using the boxplot and LOF methods, and 536 modeling samples were obtained. Combining MIC with the Pearson correlation coefficient, six characteristics of feed oil including RON, sulfur content, olefin content, aromatic content, density, and vapor pressure were chosen as input variables for the clustering model. Two features were extracted from the six variables by the autoencoder (AE) characterized by the 6-32-2-32-6 neural network structure and PCA algorithm for clustering. Three clustering models were built using AE+K-means, PCA+K-means, and K-means. The results of evaluation showed that the optimal clustering number in these models was three, and the AE+K-means model provided the best clustering effect. According to the clustering centers and the property distribution, the dividing boundaries between three types of feed oils are obvious indicating that the AE+K-means model is available to classify feed oils from the S Zorb unit. On this basis, prediction models for the RON of refined gasoline were built for different types of feed oils to get the optimal operation conditions for the reduction of RON losses of refined gasoline in the S Zorb unit.</p>\",\"PeriodicalId\":725,\"journal\":{\"name\":\"Petroleum Chemistry\",\"volume\":\"64 3\",\"pages\":\"385 - 395\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-04-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Petroleum Chemistry\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1134/S0965544124010109\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, ORGANIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Petroleum Chemistry","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1134/S0965544124010109","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, ORGANIC","Score":null,"Total":0}
引用次数: 0
摘要
摘要基于S Zorb装置5年来的原料油特性数据,采用箱形图法和LOF法检测数据中的异常值,得到536个建模样本。将 MIC 与皮尔逊相关系数相结合,选择 RON、硫含量、烯烃含量、芳烃含量、密度和蒸汽压等六个原料油特征作为聚类模型的输入变量。采用 6-32-2-32-6 神经网络结构的自动编码器(AE)和 PCA 算法从六个变量中提取两个特征进行聚类。利用 AE+K-means、PCA+K-means 和 K-means 建立了三种聚类模型。评估结果表明,这些模型的最佳聚类数为 3,其中 AE+K-means 模型的聚类效果最好。根据聚类中心和属性分布,三类原料油的划分界限明显,表明 AE+K-means 模型可用于 S Zorb 装置原料油的分类。在此基础上,建立了不同类型进料油的精制汽油 RON 值预测模型,以获得 S Zorb 装置减少精制汽油 RON 损失的最佳操作条件。
Feature Extraction and Clustering of Feed Oil from a S Zorb Unit Based on AE and PCA Algorithms
Based on the 5-year data on the feed oil characteristics obtained from the S Zorb unit, the outliers in the data were detected using the boxplot and LOF methods, and 536 modeling samples were obtained. Combining MIC with the Pearson correlation coefficient, six characteristics of feed oil including RON, sulfur content, olefin content, aromatic content, density, and vapor pressure were chosen as input variables for the clustering model. Two features were extracted from the six variables by the autoencoder (AE) characterized by the 6-32-2-32-6 neural network structure and PCA algorithm for clustering. Three clustering models were built using AE+K-means, PCA+K-means, and K-means. The results of evaluation showed that the optimal clustering number in these models was three, and the AE+K-means model provided the best clustering effect. According to the clustering centers and the property distribution, the dividing boundaries between three types of feed oils are obvious indicating that the AE+K-means model is available to classify feed oils from the S Zorb unit. On this basis, prediction models for the RON of refined gasoline were built for different types of feed oils to get the optimal operation conditions for the reduction of RON losses of refined gasoline in the S Zorb unit.
期刊介绍:
Petroleum Chemistry (Neftekhimiya), founded in 1961, offers original papers on and reviews of theoretical and experimental studies concerned with current problems of petroleum chemistry and processing such as chemical composition of crude oils and natural gas liquids; petroleum refining (cracking, hydrocracking, and catalytic reforming); catalysts for petrochemical processes (hydrogenation, isomerization, oxidation, hydroformylation, etc.); activation and catalytic transformation of hydrocarbons and other components of petroleum, natural gas, and other complex organic mixtures; new petrochemicals including lubricants and additives; environmental problems; and information on scientific meetings relevant to these areas.
Petroleum Chemistry publishes articles on these topics from members of the scientific community of the former Soviet Union.