A data-driven study on viscosity estimation of hydrogen-containing gas mixtures using machine learning

IF 8.1 2区 工程技术 Q1 CHEMISTRY, PHYSICAL
Mohammad Rasool Dehghani , Moein Kafi , Mehdi Maleki , Maryam Aghel , Yousef Kazemzadeh , Ali Ranjbar
{"title":"A data-driven study on viscosity estimation of hydrogen-containing gas mixtures using machine learning","authors":"Mohammad Rasool Dehghani ,&nbsp;Moein Kafi ,&nbsp;Mehdi Maleki ,&nbsp;Maryam Aghel ,&nbsp;Yousef Kazemzadeh ,&nbsp;Ali Ranjbar","doi":"10.1016/j.ijhydene.2025.05.156","DOIUrl":null,"url":null,"abstract":"<div><div>Hydrogen has increasingly gained attention as an energy carrier due to its clean nature and high efficiency. One crucial factor in its production, storage, and transportation is viscosity. However, none of the previous studies have explored the use of machine learning models to estimate the viscosity of hydrogen-based mixtures using laboratory data. To address this gap, we compiled a dataset of 1624 viscosity measurements for hydrogen-based gas mixtures from past experimental studies. Six machine learning techniques were employed for modeling: k-nearest neighbors (KNN), support vector regression (SVR), regression tree, categorical boosting (CatBoost), extra trees, and extreme gradient boosting (XGBoost). The dataset was split into 70 % for training and 30 % for testing, with a 5-fold cross-validation approach applied to validate the models during training. To assess model performance, we used cross plots, residual error plots, error metrics, and absolute error frequency plots. Among all methods, the extra trees model demonstrated the highest accuracy, achieving an R<sup>2</sup> value of 0.9983. It was followed closely by XGBoost (0.9976), CatBoost (0.9974), KNN (0.9923), regression tree (0.9917), and SVR (0.9735). Sensitivity analysis revealed that temperature had the most significant impact on viscosity, whereas methane mole fraction had the least. Additionally, at low pressures, the mole fractions of carbon dioxide and methane exhibited an inverse relationship with viscosity, while the hydrogen mole fraction showed a direct correlation. To define the applicability range of the extra trees model, a William's plot was used, indicating that 1562 data points (96 % of the dataset) were valid. Given the direct impact of viscosity on flow behavior and system efficiency, these findings can be instrumental in optimizing hydrogen production, transportation, and storage processes.</div></div>","PeriodicalId":337,"journal":{"name":"International Journal of Hydrogen Energy","volume":"138 ","pages":"Pages 331-343"},"PeriodicalIF":8.1000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Hydrogen Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360319925024243","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Hydrogen has increasingly gained attention as an energy carrier due to its clean nature and high efficiency. One crucial factor in its production, storage, and transportation is viscosity. However, none of the previous studies have explored the use of machine learning models to estimate the viscosity of hydrogen-based mixtures using laboratory data. To address this gap, we compiled a dataset of 1624 viscosity measurements for hydrogen-based gas mixtures from past experimental studies. Six machine learning techniques were employed for modeling: k-nearest neighbors (KNN), support vector regression (SVR), regression tree, categorical boosting (CatBoost), extra trees, and extreme gradient boosting (XGBoost). The dataset was split into 70 % for training and 30 % for testing, with a 5-fold cross-validation approach applied to validate the models during training. To assess model performance, we used cross plots, residual error plots, error metrics, and absolute error frequency plots. Among all methods, the extra trees model demonstrated the highest accuracy, achieving an R2 value of 0.9983. It was followed closely by XGBoost (0.9976), CatBoost (0.9974), KNN (0.9923), regression tree (0.9917), and SVR (0.9735). Sensitivity analysis revealed that temperature had the most significant impact on viscosity, whereas methane mole fraction had the least. Additionally, at low pressures, the mole fractions of carbon dioxide and methane exhibited an inverse relationship with viscosity, while the hydrogen mole fraction showed a direct correlation. To define the applicability range of the extra trees model, a William's plot was used, indicating that 1562 data points (96 % of the dataset) were valid. Given the direct impact of viscosity on flow behavior and system efficiency, these findings can be instrumental in optimizing hydrogen production, transportation, and storage processes.
用机器学习对含氢气体混合物粘度估计的数据驱动研究
氢作为一种能源载体,由于其清洁、高效的特点,越来越受到人们的重视。其生产、储存和运输的一个关键因素是粘度。然而,之前的研究都没有探索使用机器学习模型来使用实验室数据估计氢基混合物的粘度。为了解决这一差距,我们从过去的实验研究中编制了1624个氢基气体混合物粘度测量数据集。采用了六种机器学习技术进行建模:k近邻(KNN)、支持向量回归(SVR)、回归树、分类增强(CatBoost)、额外树和极端梯度增强(XGBoost)。数据集被分成70%用于训练,30%用于测试,在训练期间应用5倍交叉验证方法来验证模型。为了评估模型的性能,我们使用了交叉图、残差图、误差度量和绝对误差频率图。在所有方法中,额外树模型的准确率最高,R2值为0.9983。其次是XGBoost(0.9976)、CatBoost(0.9974)、KNN(0.9923)、回归树(0.9917)和SVR(0.9735)。灵敏度分析表明,温度对粘度的影响最为显著,而甲烷摩尔分数对粘度的影响最小。此外,在低压下,二氧化碳和甲烷的摩尔分数与粘度呈反比关系,而氢的摩尔分数与粘度呈正相关关系。为了定义额外树模型的适用范围,使用了William’s plot,表明1562个数据点(数据集的96%)是有效的。考虑到粘度对流动行为和系统效率的直接影响,这些发现有助于优化氢气的生产、运输和储存过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Hydrogen Energy
International Journal of Hydrogen Energy 工程技术-环境科学
CiteScore
13.50
自引率
25.00%
发文量
3502
审稿时长
60 days
期刊介绍: The objective of the International Journal of Hydrogen Energy is to facilitate the exchange of new ideas, technological advancements, and research findings in the field of Hydrogen Energy among scientists and engineers worldwide. This journal showcases original research, both analytical and experimental, covering various aspects of Hydrogen Energy. These include production, storage, transmission, utilization, enabling technologies, environmental impact, economic considerations, and global perspectives on hydrogen and its carriers such as NH3, CH4, alcohols, etc. The utilization aspect encompasses various methods such as thermochemical (combustion), photochemical, electrochemical (fuel cells), and nuclear conversion of hydrogen, hydrogen isotopes, and hydrogen carriers into thermal, mechanical, and electrical energies. The applications of these energies can be found in transportation (including aerospace), industrial, commercial, and residential sectors.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信