Mohammad Rasool Dehghani , Moein Kafi , Mehdi Maleki , Maryam Aghel , Yousef Kazemzadeh , Ali Ranjbar
{"title":"A data-driven study on viscosity estimation of hydrogen-containing gas mixtures using machine learning","authors":"Mohammad Rasool Dehghani , Moein Kafi , Mehdi Maleki , Maryam Aghel , Yousef Kazemzadeh , Ali Ranjbar","doi":"10.1016/j.ijhydene.2025.05.156","DOIUrl":null,"url":null,"abstract":"<div><div>Hydrogen has increasingly gained attention as an energy carrier due to its clean nature and high efficiency. One crucial factor in its production, storage, and transportation is viscosity. However, none of the previous studies have explored the use of machine learning models to estimate the viscosity of hydrogen-based mixtures using laboratory data. To address this gap, we compiled a dataset of 1624 viscosity measurements for hydrogen-based gas mixtures from past experimental studies. Six machine learning techniques were employed for modeling: k-nearest neighbors (KNN), support vector regression (SVR), regression tree, categorical boosting (CatBoost), extra trees, and extreme gradient boosting (XGBoost). The dataset was split into 70 % for training and 30 % for testing, with a 5-fold cross-validation approach applied to validate the models during training. To assess model performance, we used cross plots, residual error plots, error metrics, and absolute error frequency plots. Among all methods, the extra trees model demonstrated the highest accuracy, achieving an R<sup>2</sup> value of 0.9983. It was followed closely by XGBoost (0.9976), CatBoost (0.9974), KNN (0.9923), regression tree (0.9917), and SVR (0.9735). Sensitivity analysis revealed that temperature had the most significant impact on viscosity, whereas methane mole fraction had the least. Additionally, at low pressures, the mole fractions of carbon dioxide and methane exhibited an inverse relationship with viscosity, while the hydrogen mole fraction showed a direct correlation. To define the applicability range of the extra trees model, a William's plot was used, indicating that 1562 data points (96 % of the dataset) were valid. Given the direct impact of viscosity on flow behavior and system efficiency, these findings can be instrumental in optimizing hydrogen production, transportation, and storage processes.</div></div>","PeriodicalId":337,"journal":{"name":"International Journal of Hydrogen Energy","volume":"138 ","pages":"Pages 331-343"},"PeriodicalIF":8.1000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Hydrogen Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360319925024243","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Hydrogen has increasingly gained attention as an energy carrier due to its clean nature and high efficiency. One crucial factor in its production, storage, and transportation is viscosity. However, none of the previous studies have explored the use of machine learning models to estimate the viscosity of hydrogen-based mixtures using laboratory data. To address this gap, we compiled a dataset of 1624 viscosity measurements for hydrogen-based gas mixtures from past experimental studies. Six machine learning techniques were employed for modeling: k-nearest neighbors (KNN), support vector regression (SVR), regression tree, categorical boosting (CatBoost), extra trees, and extreme gradient boosting (XGBoost). The dataset was split into 70 % for training and 30 % for testing, with a 5-fold cross-validation approach applied to validate the models during training. To assess model performance, we used cross plots, residual error plots, error metrics, and absolute error frequency plots. Among all methods, the extra trees model demonstrated the highest accuracy, achieving an R2 value of 0.9983. It was followed closely by XGBoost (0.9976), CatBoost (0.9974), KNN (0.9923), regression tree (0.9917), and SVR (0.9735). Sensitivity analysis revealed that temperature had the most significant impact on viscosity, whereas methane mole fraction had the least. Additionally, at low pressures, the mole fractions of carbon dioxide and methane exhibited an inverse relationship with viscosity, while the hydrogen mole fraction showed a direct correlation. To define the applicability range of the extra trees model, a William's plot was used, indicating that 1562 data points (96 % of the dataset) were valid. Given the direct impact of viscosity on flow behavior and system efficiency, these findings can be instrumental in optimizing hydrogen production, transportation, and storage processes.
期刊介绍:
The objective of the International Journal of Hydrogen Energy is to facilitate the exchange of new ideas, technological advancements, and research findings in the field of Hydrogen Energy among scientists and engineers worldwide. This journal showcases original research, both analytical and experimental, covering various aspects of Hydrogen Energy. These include production, storage, transmission, utilization, enabling technologies, environmental impact, economic considerations, and global perspectives on hydrogen and its carriers such as NH3, CH4, alcohols, etc.
The utilization aspect encompasses various methods such as thermochemical (combustion), photochemical, electrochemical (fuel cells), and nuclear conversion of hydrogen, hydrogen isotopes, and hydrogen carriers into thermal, mechanical, and electrical energies. The applications of these energies can be found in transportation (including aerospace), industrial, commercial, and residential sectors.