深度学习燃料性能预测模型的不确定性量化

IF 5 Q2 ENERGY & FUELS

Applications in Energy and Combustion Science Pub Date : 2023-09-24 DOI:10.1016/j.jaecs.2023.100211

Kiran K. Yalamanchi , Sahil Kommalapati , Pinaki Pal , Nursulu Kuzhagaliyeva , Abdullah S AlRamadan , Balaji Mohan , Yuanjiang Pei , S. Mani Sarathy , Emre Cenker , Jihad Badra

{"title":"深度学习燃料性能预测模型的不确定性量化","authors":"Kiran K. Yalamanchi , Sahil Kommalapati , Pinaki Pal , Nursulu Kuzhagaliyeva , Abdullah S AlRamadan , Balaji Mohan , Yuanjiang Pei , S. Mani Sarathy , Emre Cenker , Jihad Badra","doi":"10.1016/j.jaecs.2023.100211","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning models are being widely used in the field of combustion. Given the black-box nature of typical neural network based models, uncertainty quantification (UQ) is critical to ensure the reliability of predictions as well as the training datasets, and for a principled quantification of noise and its various sources. Deep learning surrogate models for predicting properties of chemical compounds and mixtures have been recently shown to be promising for enabling data-driven fuel design and optimization, with the ultimate goal of improving efficiency and lowering emissions from combustion engines. In this study, UQ is performed for a multi-task deep learning model that simultaneously predicts the research octane number (RON), Motor Octane Number (MON), and Yield Sooting Index (YSI) of pure components and multicomponent blends. The deep learning model is comprised of three smaller networks: Extractor 1, Extractor 2, and Predictor, and a mixing operator. The molecular fingerprints of individual components are encoded via Extractor 1 and Extractor 2, the mixing operator generates fingerprints for mixtures/blends based on linear mixing operation, and the predictor maps the fingerprint to the target properties. Two different classes of UQ methods, Monte Carlo ensemble methods and Bayesian neural networks (BNNs), are employed for quantifying the epistemic uncertainty. Combinations of Bernoulli and Gaussian distributions with DropConnect and DropOut techniques are explored as ensemble methods. All the DropConnect, DropOut and Bayesian layers are applied to the predictor network. Aleatoric uncertainty is modeled by assuming that each data point has an independent uncertainty associated with it. The results of the UQ study are further analyzed to compare the performance of BNN and ensemble methods. Although this study is confined to UQ of fuel property prediction, the methodologies are applicable to other deep learning frameworks that are being widely used in the combustion community.</p></div>","PeriodicalId":100104,"journal":{"name":"Applications in Energy and Combustion Science","volume":"16 ","pages":"Article 100211"},"PeriodicalIF":5.0000,"publicationDate":"2023-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncertainty quantification of a deep learning fuel property prediction model\",\"authors\":\"Kiran K. Yalamanchi , Sahil Kommalapati , Pinaki Pal , Nursulu Kuzhagaliyeva , Abdullah S AlRamadan , Balaji Mohan , Yuanjiang Pei , S. Mani Sarathy , Emre Cenker , Jihad Badra\",\"doi\":\"10.1016/j.jaecs.2023.100211\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Deep learning models are being widely used in the field of combustion. Given the black-box nature of typical neural network based models, uncertainty quantification (UQ) is critical to ensure the reliability of predictions as well as the training datasets, and for a principled quantification of noise and its various sources. Deep learning surrogate models for predicting properties of chemical compounds and mixtures have been recently shown to be promising for enabling data-driven fuel design and optimization, with the ultimate goal of improving efficiency and lowering emissions from combustion engines. In this study, UQ is performed for a multi-task deep learning model that simultaneously predicts the research octane number (RON), Motor Octane Number (MON), and Yield Sooting Index (YSI) of pure components and multicomponent blends. The deep learning model is comprised of three smaller networks: Extractor 1, Extractor 2, and Predictor, and a mixing operator. The molecular fingerprints of individual components are encoded via Extractor 1 and Extractor 2, the mixing operator generates fingerprints for mixtures/blends based on linear mixing operation, and the predictor maps the fingerprint to the target properties. Two different classes of UQ methods, Monte Carlo ensemble methods and Bayesian neural networks (BNNs), are employed for quantifying the epistemic uncertainty. Combinations of Bernoulli and Gaussian distributions with DropConnect and DropOut techniques are explored as ensemble methods. All the DropConnect, DropOut and Bayesian layers are applied to the predictor network. Aleatoric uncertainty is modeled by assuming that each data point has an independent uncertainty associated with it. The results of the UQ study are further analyzed to compare the performance of BNN and ensemble methods. Although this study is confined to UQ of fuel property prediction, the methodologies are applicable to other deep learning frameworks that are being widely used in the combustion community.</p></div>\",\"PeriodicalId\":100104,\"journal\":{\"name\":\"Applications in Energy and Combustion Science\",\"volume\":\"16 \",\"pages\":\"Article 100211\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2023-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applications in Energy and Combustion Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666352X23001000\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applications in Energy and Combustion Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666352X23001000","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

深度学习模型正在燃烧领域得到广泛应用。考虑到典型的基于神经网络的模型的黑匣子性质，不确定性量化（UQ）对于确保预测和训练数据集的可靠性以及噪声及其各种来源的原则性量化至关重要。用于预测化合物和混合物性质的深度学习替代模型最近被证明有希望实现数据驱动的燃料设计和优化，最终目标是提高效率和降低内燃机的排放。在本研究中，对一个多任务深度学习模型进行了UQ，该模型同时预测了纯组分和多组分混合物的研究辛烷值（RON）、马达辛烷值（MON）和收率烟化指数（YSI）。深度学习模型由三个较小的网络组成：提取器1、提取器2和预测器，以及一个混合算子。单个组分的分子指纹通过提取器1和提取器2进行编码，混合算子基于线性混合运算生成混合物/混合物的指纹，预测器将指纹映射到目标特性。采用两类不同的UQ方法，蒙特卡罗集成方法和贝叶斯神经网络（BNN）来量化认识不确定性。将伯努利分布和高斯分布与DropConnect和DropOut技术相结合作为集成方法进行了探索。所有DropConnect、DropOut和贝叶斯层都应用于预测网络。假设每个数据点都有一个独立的不确定性，从而对算术不确定性进行建模。进一步分析了UQ研究的结果，以比较BNN和集成方法的性能。尽管本研究仅限于燃料特性预测的UQ，但该方法适用于燃烧界广泛使用的其他深度学习框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Uncertainty quantification of a deep learning fuel property prediction model

Deep learning models are being widely used in the field of combustion. Given the black-box nature of typical neural network based models, uncertainty quantification (UQ) is critical to ensure the reliability of predictions as well as the training datasets, and for a principled quantification of noise and its various sources. Deep learning surrogate models for predicting properties of chemical compounds and mixtures have been recently shown to be promising for enabling data-driven fuel design and optimization, with the ultimate goal of improving efficiency and lowering emissions from combustion engines. In this study, UQ is performed for a multi-task deep learning model that simultaneously predicts the research octane number (RON), Motor Octane Number (MON), and Yield Sooting Index (YSI) of pure components and multicomponent blends. The deep learning model is comprised of three smaller networks: Extractor 1, Extractor 2, and Predictor, and a mixing operator. The molecular fingerprints of individual components are encoded via Extractor 1 and Extractor 2, the mixing operator generates fingerprints for mixtures/blends based on linear mixing operation, and the predictor maps the fingerprint to the target properties. Two different classes of UQ methods, Monte Carlo ensemble methods and Bayesian neural networks (BNNs), are employed for quantifying the epistemic uncertainty. Combinations of Bernoulli and Gaussian distributions with DropConnect and DropOut techniques are explored as ensemble methods. All the DropConnect, DropOut and Bayesian layers are applied to the predictor network. Aleatoric uncertainty is modeled by assuming that each data point has an independent uncertainty associated with it. The results of the UQ study are further analyzed to compare the performance of BNN and ensemble methods. Although this study is confined to UQ of fuel property prediction, the methodologies are applicable to other deep learning frameworks that are being widely used in the combustion community.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applications in Energy and Combustion Science

CiteScore

4.20

自引率

0.00%

发文量