Hui Du , Tianyu Wang , Haogang Wei , Guy Y. Cornejo Maceda , Bernd R. Noack , Lei Zhou
{"title":"Topologically consistent regression modeling exemplified for laminar burning velocity of ammonia-hydrogen flames","authors":"Hui Du , Tianyu Wang , Haogang Wei , Guy Y. Cornejo Maceda , Bernd R. Noack , Lei Zhou","doi":"10.1016/j.egyai.2024.100456","DOIUrl":null,"url":null,"abstract":"<div><div>Data-driven regression models are generally calibrated by minimizing a representation error. However, optimizing the model accuracy may create nonphysical wiggles. In this study, we propose topological consistency as a new metric to mitigate these wiggles. The key enabler is Persistent Data Topology (PDT) which extracts a topological skeleton from discrete scalar field data. PDT identifies the extrema of the model based on a neighborhood analysis. The topological error is defined as the mismatch of extrema between the data and the model. The methodology is exemplified for the modeling of the Laminar Burning Velocity (<span><math><mrow><mi>L</mi><mi>B</mi><mi>V</mi></mrow></math></span>) of ammonia-hydrogen flames. Four regression models, Multi-layer Perceptron (MLP), eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (Light GBM), are trained using the data generated by a modified GRI3.0 mechanism. In comparison, MLP builds a model that achieves the highest accuracy and preserves the topological structure of the data. We expect that the proposed topologically consistent regression modeling will enjoy many more applications in model calibration, model selection and optimization algorithms.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"19 ","pages":"Article 100456"},"PeriodicalIF":9.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546824001228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Data-driven regression models are generally calibrated by minimizing a representation error. However, optimizing the model accuracy may create nonphysical wiggles. In this study, we propose topological consistency as a new metric to mitigate these wiggles. The key enabler is Persistent Data Topology (PDT) which extracts a topological skeleton from discrete scalar field data. PDT identifies the extrema of the model based on a neighborhood analysis. The topological error is defined as the mismatch of extrema between the data and the model. The methodology is exemplified for the modeling of the Laminar Burning Velocity () of ammonia-hydrogen flames. Four regression models, Multi-layer Perceptron (MLP), eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (Light GBM), are trained using the data generated by a modified GRI3.0 mechanism. In comparison, MLP builds a model that achieves the highest accuracy and preserves the topological structure of the data. We expect that the proposed topologically consistent regression modeling will enjoy many more applications in model calibration, model selection and optimization algorithms.