{"title":"Anode Effect prediction based on Expectation Maximization and XGBoost model","authors":"Zhixin Zhang, Gaofeng Xu, Hongting Wang, Kaibo Zhou","doi":"10.1109/DDCLS.2018.8516046","DOIUrl":null,"url":null,"abstract":"Anode Effect Prediction problem has been drawing great research interest of scientists, due to its significant values in reducing energy consumption and improving the efficiency of aluminum electrolysis. However, a large number of missing values contained in the collected data from the aluminum reduction cell are always neglected in the works, resulting in a decline in prediction accuracy and generalization ability. To solve this problem, a combined model of Expectation Maximization and XGBoost (EM-XGBoost) is proposed. Firstly, the original incomplete samples collected from the aluminum cells are recovered by Expectation Maximization (EM) algorithm. Afterwards, the XGBoost model trains on the recovered data, and then predicts the result for new samples. The more comprehensive metrics accuracy and F1 Score are introduced for evaluation. The results in the experiment show that the proposed model improves the accuracy to 99.7% and the F1 Score can achieve 99.8% under the premise of forecasting 30 minutes in advance. The proposed model not only has a high prediction accuracy, but also owns an excellent generalization ability.","PeriodicalId":6565,"journal":{"name":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"9 1","pages":"560-564"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS.2018.8516046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Anode Effect Prediction problem has been drawing great research interest of scientists, due to its significant values in reducing energy consumption and improving the efficiency of aluminum electrolysis. However, a large number of missing values contained in the collected data from the aluminum reduction cell are always neglected in the works, resulting in a decline in prediction accuracy and generalization ability. To solve this problem, a combined model of Expectation Maximization and XGBoost (EM-XGBoost) is proposed. Firstly, the original incomplete samples collected from the aluminum cells are recovered by Expectation Maximization (EM) algorithm. Afterwards, the XGBoost model trains on the recovered data, and then predicts the result for new samples. The more comprehensive metrics accuracy and F1 Score are introduced for evaluation. The results in the experiment show that the proposed model improves the accuracy to 99.7% and the F1 Score can achieve 99.8% under the premise of forecasting 30 minutes in advance. The proposed model not only has a high prediction accuracy, but also owns an excellent generalization ability.