Balaji Mohan , Fatema Alsaleh , Abdullah S. AlRamadan , Saud Almbdal , Alya Al-Ammari , Zainab Saihati , Alexander Voice , Jihad Badra
{"title":"In silico estimation of density, vapor pressure, distillation curve, and octane numbers of gasoline fuels using machine learning","authors":"Balaji Mohan , Fatema Alsaleh , Abdullah S. AlRamadan , Saud Almbdal , Alya Al-Ammari , Zainab Saihati , Alexander Voice , Jihad Badra","doi":"10.1016/j.fuel.2025.135640","DOIUrl":null,"url":null,"abstract":"<div><div>This study aimed to develop a machine learning model for accurately estimating gasoline’s mixture density, vapor pressure, distillation curve, and research and motor octane numbers (RON and MON) using detailed hydrocarbon analysis (DHA) data to design sustainable fuels. DHA data from over 2200 gasoline samples collected globally were used for model training. The DHA component names were converted into Simplified molecular-input line-entry system (SMILES) representations, a molecular formula language. These SMILES strings were further transformed into molecular descriptors, which capture various physicochemical properties of the molecules. The molecular descriptors were then weighted by the corresponding volume fractions of the DHA components and fed into the model. The model was a SuperLearner ensemble, combining the top five best-performing learners (out of 40 base learners) identified through a data-driven selection process. The model’s performance was assessed using Mean Absolute Error (MAE) to evaluate its accuracy on unseen test data. The MAE achieved for mixture density was 0.001 g/cc, vapor pressure was 0.371 psi, distillation curve ranged from 1.110 to 2.764 <span><math><msup><mspace></mspace><mrow><mo>∘</mo></mrow></msup></math></span>C, RON was 0.635, and MON was 0.922. Notably, the MAE achieved by the model for the distillation curve and octane numbers was within the established reproducibility tolerances.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"399 ","pages":"Article 135640"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236125013651","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
This study aimed to develop a machine learning model for accurately estimating gasoline’s mixture density, vapor pressure, distillation curve, and research and motor octane numbers (RON and MON) using detailed hydrocarbon analysis (DHA) data to design sustainable fuels. DHA data from over 2200 gasoline samples collected globally were used for model training. The DHA component names were converted into Simplified molecular-input line-entry system (SMILES) representations, a molecular formula language. These SMILES strings were further transformed into molecular descriptors, which capture various physicochemical properties of the molecules. The molecular descriptors were then weighted by the corresponding volume fractions of the DHA components and fed into the model. The model was a SuperLearner ensemble, combining the top five best-performing learners (out of 40 base learners) identified through a data-driven selection process. The model’s performance was assessed using Mean Absolute Error (MAE) to evaluate its accuracy on unseen test data. The MAE achieved for mixture density was 0.001 g/cc, vapor pressure was 0.371 psi, distillation curve ranged from 1.110 to 2.764 C, RON was 0.635, and MON was 0.922. Notably, the MAE achieved by the model for the distillation curve and octane numbers was within the established reproducibility tolerances.
期刊介绍:
The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.