In silico estimation of density, vapor pressure, distillation curve, and octane numbers of gasoline fuels using machine learning

IF 7.5 1区 工程技术 Q2 ENERGY & FUELS
Fuel Pub Date : 2025-05-15 DOI:10.1016/j.fuel.2025.135640
Balaji Mohan , Fatema Alsaleh , Abdullah S. AlRamadan , Saud Almbdal , Alya Al-Ammari , Zainab Saihati , Alexander Voice , Jihad Badra
{"title":"In silico estimation of density, vapor pressure, distillation curve, and octane numbers of gasoline fuels using machine learning","authors":"Balaji Mohan ,&nbsp;Fatema Alsaleh ,&nbsp;Abdullah S. AlRamadan ,&nbsp;Saud Almbdal ,&nbsp;Alya Al-Ammari ,&nbsp;Zainab Saihati ,&nbsp;Alexander Voice ,&nbsp;Jihad Badra","doi":"10.1016/j.fuel.2025.135640","DOIUrl":null,"url":null,"abstract":"<div><div>This study aimed to develop a machine learning model for accurately estimating gasoline’s mixture density, vapor pressure, distillation curve, and research and motor octane numbers (RON and MON) using detailed hydrocarbon analysis (DHA) data to design sustainable fuels. DHA data from over 2200 gasoline samples collected globally were used for model training. The DHA component names were converted into Simplified molecular-input line-entry system (SMILES) representations, a molecular formula language. These SMILES strings were further transformed into molecular descriptors, which capture various physicochemical properties of the molecules. The molecular descriptors were then weighted by the corresponding volume fractions of the DHA components and fed into the model. The model was a SuperLearner ensemble, combining the top five best-performing learners (out of 40 base learners) identified through a data-driven selection process. The model’s performance was assessed using Mean Absolute Error (MAE) to evaluate its accuracy on unseen test data. The MAE achieved for mixture density was 0.001 g/cc, vapor pressure was 0.371 psi, distillation curve ranged from 1.110 to 2.764 <span><math><msup><mspace></mspace><mrow><mo>∘</mo></mrow></msup></math></span>C, RON was 0.635, and MON was 0.922. Notably, the MAE achieved by the model for the distillation curve and octane numbers was within the established reproducibility tolerances.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"399 ","pages":"Article 135640"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236125013651","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

This study aimed to develop a machine learning model for accurately estimating gasoline’s mixture density, vapor pressure, distillation curve, and research and motor octane numbers (RON and MON) using detailed hydrocarbon analysis (DHA) data to design sustainable fuels. DHA data from over 2200 gasoline samples collected globally were used for model training. The DHA component names were converted into Simplified molecular-input line-entry system (SMILES) representations, a molecular formula language. These SMILES strings were further transformed into molecular descriptors, which capture various physicochemical properties of the molecules. The molecular descriptors were then weighted by the corresponding volume fractions of the DHA components and fed into the model. The model was a SuperLearner ensemble, combining the top five best-performing learners (out of 40 base learners) identified through a data-driven selection process. The model’s performance was assessed using Mean Absolute Error (MAE) to evaluate its accuracy on unseen test data. The MAE achieved for mixture density was 0.001 g/cc, vapor pressure was 0.371 psi, distillation curve ranged from 1.110 to 2.764 C, RON was 0.635, and MON was 0.922. Notably, the MAE achieved by the model for the distillation curve and octane numbers was within the established reproducibility tolerances.
使用机器学习对汽油燃料的密度、蒸汽压、蒸馏曲线和辛烷值进行计算机估计
本研究旨在开发一种机器学习模型,利用详细的碳氢化合物分析(DHA)数据,准确估算汽油的混合物密度、蒸汽压、蒸馏曲线以及研究和发动机辛烷值(RON和MON),以设计可持续燃料。从全球收集的2200多个汽油样本中收集的DHA数据用于模型训练。DHA组分名称被转换成简化分子输入行输入系统(SMILES)表示,一种分子式语言。这些SMILES字符串进一步转化为分子描述符,捕获分子的各种物理化学性质。然后将分子描述符按DHA组分的相应体积分数加权,并将其输入模型。该模型是一个超级学习者集合,结合了通过数据驱动的选择过程确定的前五个表现最好的学习者(从40个基本学习者中)。使用平均绝对误差(MAE)来评估模型在未知测试数据上的准确性。混合物密度为0.001 g/cc,蒸汽压为0.371 psi,蒸馏曲线范围为1.110 ~ 2.764°C, RON为0.635,MON为0.922。值得注意的是,该模型获得的蒸馏曲线和辛烷值的MAE在既定的再现性公差范围内。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Fuel
Fuel 工程技术-工程:化工
CiteScore
12.80
自引率
20.30%
发文量
3506
审稿时长
64 days
期刊介绍: The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信