Siddharth Prabhu, Nick Kosir, Mayuresh V. Kothare, Srinivas Rangarajan
{"title":"Derivative-Free Domain-Informed Data-Driven Discovery of Sparse Kinetic Models","authors":"Siddharth Prabhu, Nick Kosir, Mayuresh V. Kothare, Srinivas Rangarajan","doi":"10.1021/acs.iecr.4c02981","DOIUrl":null,"url":null,"abstract":"Developing data-driven kinetic models from reaction data is valuable for inferring the underlying reactions and designing reactive processes without needing first-principles models. However, recently developed techniques to learn interpretable dynamical models from data are susceptible to inherent experimental noise, especially in reaction kinetics data. Here, we address these issues by (1) employing a new derivative-free technique for sparse identification of dynamical equations that approximates the integral rather than the derivative (which we call as <i>DF-SINDy</i>) and (2) including domain information such as mass balance and chemistry information. We demonstrate this using retrospective examples to recover the true (known) governing equations from synthetic data under varying noise levels, sampling frequencies, and number of experiments. We observe that (1) models discovered from <i>DF-SINDy</i> have lower errors than those discovered from <i>SINDy</i> ( <cite><i>Proc. Natl.\nAcad. Sci. U.S.A.</i></cite> <span>2016</span>, <em>113</em>, 3932−3937, DOI: 10.1073/pnas.1517384113) and (2) adding domain knowledge further helps recover correct terms, thereby improving the reliability of the interpretations obtained from these models. This work is chemistry agnostic and represents a step toward developing domain-informed interpretable kinetic models for complex reaction networks.","PeriodicalId":39,"journal":{"name":"Industrial & Engineering Chemistry Research","volume":"9 1","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial & Engineering Chemistry Research","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1021/acs.iecr.4c02981","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Developing data-driven kinetic models from reaction data is valuable for inferring the underlying reactions and designing reactive processes without needing first-principles models. However, recently developed techniques to learn interpretable dynamical models from data are susceptible to inherent experimental noise, especially in reaction kinetics data. Here, we address these issues by (1) employing a new derivative-free technique for sparse identification of dynamical equations that approximates the integral rather than the derivative (which we call as DF-SINDy) and (2) including domain information such as mass balance and chemistry information. We demonstrate this using retrospective examples to recover the true (known) governing equations from synthetic data under varying noise levels, sampling frequencies, and number of experiments. We observe that (1) models discovered from DF-SINDy have lower errors than those discovered from SINDy ( Proc. Natl.
Acad. Sci. U.S.A.2016, 113, 3932−3937, DOI: 10.1073/pnas.1517384113) and (2) adding domain knowledge further helps recover correct terms, thereby improving the reliability of the interpretations obtained from these models. This work is chemistry agnostic and represents a step toward developing domain-informed interpretable kinetic models for complex reaction networks.
期刊介绍:
ndustrial & Engineering Chemistry, with variations in title and format, has been published since 1909 by the American Chemical Society. Industrial & Engineering Chemistry Research is a weekly publication that reports industrial and academic research in the broad fields of applied chemistry and chemical engineering with special focus on fundamentals, processes, and products.