Hui Du , Tianyu Wang , Haogang Wei , Guy Y. Cornejo Maceda , Bernd R. Noack , Lei Zhou
{"title":"以氨氢火焰层流燃烧速度的拓扑一致回归模型为例","authors":"Hui Du , Tianyu Wang , Haogang Wei , Guy Y. Cornejo Maceda , Bernd R. Noack , Lei Zhou","doi":"10.1016/j.egyai.2024.100456","DOIUrl":null,"url":null,"abstract":"<div><div>Data-driven regression models are generally calibrated by minimizing a representation error. However, optimizing the model accuracy may create nonphysical wiggles. In this study, we propose topological consistency as a new metric to mitigate these wiggles. The key enabler is Persistent Data Topology (PDT) which extracts a topological skeleton from discrete scalar field data. PDT identifies the extrema of the model based on a neighborhood analysis. The topological error is defined as the mismatch of extrema between the data and the model. The methodology is exemplified for the modeling of the Laminar Burning Velocity (<span><math><mrow><mi>L</mi><mi>B</mi><mi>V</mi></mrow></math></span>) of ammonia-hydrogen flames. Four regression models, Multi-layer Perceptron (MLP), eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (Light GBM), are trained using the data generated by a modified GRI3.0 mechanism. In comparison, MLP builds a model that achieves the highest accuracy and preserves the topological structure of the data. We expect that the proposed topologically consistent regression modeling will enjoy many more applications in model calibration, model selection and optimization algorithms.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"19 ","pages":"Article 100456"},"PeriodicalIF":9.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Topologically consistent regression modeling exemplified for laminar burning velocity of ammonia-hydrogen flames\",\"authors\":\"Hui Du , Tianyu Wang , Haogang Wei , Guy Y. Cornejo Maceda , Bernd R. Noack , Lei Zhou\",\"doi\":\"10.1016/j.egyai.2024.100456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Data-driven regression models are generally calibrated by minimizing a representation error. However, optimizing the model accuracy may create nonphysical wiggles. In this study, we propose topological consistency as a new metric to mitigate these wiggles. The key enabler is Persistent Data Topology (PDT) which extracts a topological skeleton from discrete scalar field data. PDT identifies the extrema of the model based on a neighborhood analysis. The topological error is defined as the mismatch of extrema between the data and the model. The methodology is exemplified for the modeling of the Laminar Burning Velocity (<span><math><mrow><mi>L</mi><mi>B</mi><mi>V</mi></mrow></math></span>) of ammonia-hydrogen flames. Four regression models, Multi-layer Perceptron (MLP), eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (Light GBM), are trained using the data generated by a modified GRI3.0 mechanism. In comparison, MLP builds a model that achieves the highest accuracy and preserves the topological structure of the data. We expect that the proposed topologically consistent regression modeling will enjoy many more applications in model calibration, model selection and optimization algorithms.</div></div>\",\"PeriodicalId\":34138,\"journal\":{\"name\":\"Energy and AI\",\"volume\":\"19 \",\"pages\":\"Article 100456\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy and AI\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666546824001228\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546824001228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
数据驱动的回归模型通常通过最小化表示误差来校准。然而,优化模型精度可能会产生非物理波动。在这项研究中,我们提出拓扑一致性作为一个新的度量来减轻这些波动。关键的推动者是持久性数据拓扑(Persistent Data Topology, PDT),它从离散标量字段数据中提取拓扑骨架。PDT基于邻域分析识别模型的极值。拓扑误差定义为数据与模型极值的不匹配。以氨氢火焰层流燃烧速度(LBV)的数值模拟为例。利用改进的GRI3.0机制生成的数据,对多层感知器(MLP)、极端梯度增强(XGBoost)、随机森林(RF)和光梯度增强机(Light GBM)四个回归模型进行训练。相比之下,MLP构建的模型达到了最高的精度,并保留了数据的拓扑结构。我们期望所提出的拓扑一致回归建模将在模型校准、模型选择和优化算法中得到更多的应用。
Topologically consistent regression modeling exemplified for laminar burning velocity of ammonia-hydrogen flames
Data-driven regression models are generally calibrated by minimizing a representation error. However, optimizing the model accuracy may create nonphysical wiggles. In this study, we propose topological consistency as a new metric to mitigate these wiggles. The key enabler is Persistent Data Topology (PDT) which extracts a topological skeleton from discrete scalar field data. PDT identifies the extrema of the model based on a neighborhood analysis. The topological error is defined as the mismatch of extrema between the data and the model. The methodology is exemplified for the modeling of the Laminar Burning Velocity () of ammonia-hydrogen flames. Four regression models, Multi-layer Perceptron (MLP), eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (Light GBM), are trained using the data generated by a modified GRI3.0 mechanism. In comparison, MLP builds a model that achieves the highest accuracy and preserves the topological structure of the data. We expect that the proposed topologically consistent regression modeling will enjoy many more applications in model calibration, model selection and optimization algorithms.