Clement Lartey , Richmond K. Asamoah , Christopher Greet , Massimiliano Zanin , Jixue Liu
{"title":"An interpretable and generalised machine learning model for predicting flotation performance","authors":"Clement Lartey , Richmond K. Asamoah , Christopher Greet , Massimiliano Zanin , Jixue Liu","doi":"10.1016/j.mineng.2025.109492","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in sensor technology and machine learning (ML) have created new opportunities to predict and improve flotation performance in mineral processing. However, existing flotation performance prediction models lack interpretability and are mostly poorly generalised, making drawing insights from model outcomes difficult. This study addresses these gaps by developing a balanced flotation model using an Extreme Gradient Boosting (XGBoost) algorithm. To interpret the model, we visualise the decision-making process using a tree plot and analyse the prediction path of the model. We employed SHAP (SHapely Addictive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) to quantify the contribution of each input variable to the model’s predictions. Sensitivity analysis of the input variables revealed patterns that were consistent with the interpretability results, providing additional validation of the model’s decision-making process. The prediction results showed that XGBoost (Extreme Gradient Boosting) model demonstrated superior performance, achieving coefficient of determination (<span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>) values of 0.97 and 0.92 for training and testing datasets respectively. This performance surpassed comparative models including Gaussian Process Regression (GPR) with <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> values of 0.91 and 0.88 and Support Vector Regression (SVR) yielding the lowest <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> values of 0.76 and 0.75 on training and testing datasets respectively. This study enhances flotation performance prediction while providing clear insights into the model prediction outcomes.</div></div>","PeriodicalId":18594,"journal":{"name":"Minerals Engineering","volume":"232 ","pages":"Article 109492"},"PeriodicalIF":4.9000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Minerals Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0892687525003206","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advances in sensor technology and machine learning (ML) have created new opportunities to predict and improve flotation performance in mineral processing. However, existing flotation performance prediction models lack interpretability and are mostly poorly generalised, making drawing insights from model outcomes difficult. This study addresses these gaps by developing a balanced flotation model using an Extreme Gradient Boosting (XGBoost) algorithm. To interpret the model, we visualise the decision-making process using a tree plot and analyse the prediction path of the model. We employed SHAP (SHapely Addictive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) to quantify the contribution of each input variable to the model’s predictions. Sensitivity analysis of the input variables revealed patterns that were consistent with the interpretability results, providing additional validation of the model’s decision-making process. The prediction results showed that XGBoost (Extreme Gradient Boosting) model demonstrated superior performance, achieving coefficient of determination () values of 0.97 and 0.92 for training and testing datasets respectively. This performance surpassed comparative models including Gaussian Process Regression (GPR) with values of 0.91 and 0.88 and Support Vector Regression (SVR) yielding the lowest values of 0.76 and 0.75 on training and testing datasets respectively. This study enhances flotation performance prediction while providing clear insights into the model prediction outcomes.
期刊介绍:
The purpose of the journal is to provide for the rapid publication of topical papers featuring the latest developments in the allied fields of mineral processing and extractive metallurgy. Its wide ranging coverage of research and practical (operating) topics includes physical separation methods, such as comminution, flotation concentration and dewatering, chemical methods such as bio-, hydro-, and electro-metallurgy, analytical techniques, process control, simulation and instrumentation, and mineralogical aspects of processing. Environmental issues, particularly those pertaining to sustainable development, will also be strongly covered.