Mostafa Hosseini , Richard Boudreault , Yuri Leonenko
{"title":"Insights into predicting equilibrium conditions of clathrate hydrates of methane + water-soluble hydrate former","authors":"Mostafa Hosseini , Richard Boudreault , Yuri Leonenko","doi":"10.1016/j.fuel.2024.133945","DOIUrl":null,"url":null,"abstract":"<div><div>This study aims to improve the prediction of equilibrium conditions in methane hydrate systems by incorporating diverse water-soluble hydrate formers and applying advanced machine learning techniques. Methane hydrates, which naturally form under high pressure and low temperature, can be more efficiently formed or dissociated by altering thermodynamic conditions using these hydrate formers. Accurate prediction of these conditions is crucial for optimizing gas storage and energy applications. In this research, molecular descriptors and operational parameters, such as mole fraction and pressure, are used as input variables to predict equilibrium temperature. Machine learning methods, including Decision Trees (DT), Random Forests (RF), Support Vector Machines (SVM), and Multi-Layer Perceptron (MLP), were employed with a novel data-splitting approach based on hydrate formers rather than traditional sample-based methods. Among these models, the RF achieved the highest performance, with a coefficient of determination (R<sup>2</sup>) of 0.930, a root mean square error (RMSE) of 1.71, and an average absolute relative deviation (AARD) of 0.48%. Feature selection, preprocessing, and Shapley Additive Explanations (SHAP) provided valuable insights into the influence of specific variables on model predictions. Additionally, a supplementary examination, termed the reduced model, highlights the critical role of proper feature selection, with certain features regarded as less important yet essential for the functionality of distance-based models, particularly for models like SVM and MLP. This work advances methane hydrate research by offering a more accurate and interpretable framework for predicting hydrate equilibrium, addressing key gaps in previous studies, and extending its applicability to a broader range of systems. Moreover, the introduction of a former-based data-splitting method improves generalization across different hydrate formers, while the use of SHAP values for model interpretability offers deeper insights into the relationships between molecular descriptors and hydrate equilibrium conditions. This study paves the way for improved selection of hydrate formers in hydrate systems.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"384 ","pages":"Article 133945"},"PeriodicalIF":6.7000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236124030953","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to improve the prediction of equilibrium conditions in methane hydrate systems by incorporating diverse water-soluble hydrate formers and applying advanced machine learning techniques. Methane hydrates, which naturally form under high pressure and low temperature, can be more efficiently formed or dissociated by altering thermodynamic conditions using these hydrate formers. Accurate prediction of these conditions is crucial for optimizing gas storage and energy applications. In this research, molecular descriptors and operational parameters, such as mole fraction and pressure, are used as input variables to predict equilibrium temperature. Machine learning methods, including Decision Trees (DT), Random Forests (RF), Support Vector Machines (SVM), and Multi-Layer Perceptron (MLP), were employed with a novel data-splitting approach based on hydrate formers rather than traditional sample-based methods. Among these models, the RF achieved the highest performance, with a coefficient of determination (R2) of 0.930, a root mean square error (RMSE) of 1.71, and an average absolute relative deviation (AARD) of 0.48%. Feature selection, preprocessing, and Shapley Additive Explanations (SHAP) provided valuable insights into the influence of specific variables on model predictions. Additionally, a supplementary examination, termed the reduced model, highlights the critical role of proper feature selection, with certain features regarded as less important yet essential for the functionality of distance-based models, particularly for models like SVM and MLP. This work advances methane hydrate research by offering a more accurate and interpretable framework for predicting hydrate equilibrium, addressing key gaps in previous studies, and extending its applicability to a broader range of systems. Moreover, the introduction of a former-based data-splitting method improves generalization across different hydrate formers, while the use of SHAP values for model interpretability offers deeper insights into the relationships between molecular descriptors and hydrate equilibrium conditions. This study paves the way for improved selection of hydrate formers in hydrate systems.
期刊介绍:
The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.