Constraining Genetic Symbolic Regression via Semantic Backpropagation

Maximilian Reissmann, Yuan Fang, Andrew Ooi, Richard Sandberg
{"title":"Constraining Genetic Symbolic Regression via Semantic Backpropagation","authors":"Maximilian Reissmann, Yuan Fang, Andrew Ooi, Richard Sandberg","doi":"arxiv-2409.07369","DOIUrl":null,"url":null,"abstract":"Evolutionary symbolic regression approaches are powerful tools that can\napproximate an explicit mapping between input features and observation for\nvarious problems. However, ensuring that explored expressions maintain\nconsistency with domain-specific constraints remains a crucial challenge. While\nneural networks are able to employ additional information like conservation\nlaws to achieve more appropriate and robust approximations, the potential\nremains unrealized within genetic algorithms. This disparity is rooted in the\ninherent discrete randomness of recombining and mutating to generate new\nmapping expressions, making it challenging to maintain and preserve inferred\nconstraints or restrictions in the course of the exploration. To address this\nlimitation, we propose an approach centered on semantic backpropagation\nincorporated into the Gene Expression Programming (GEP), which integrates\ndomain-specific properties in a vector representation as corrective feedback\nduring the evolutionary process. By creating backward rules akin to algorithmic\ndifferentiation and leveraging pre-computed subsolutions, the mechanism allows\nthe enforcement of any constraint within an expression tree by determining the\nmisalignment and propagating desired changes back. To illustrate the\neffectiveness of constraining GEP through semantic backpropagation, we take the\nconstraint of physical dimension as an example. This framework is applied to\ndiscovering physical equations from the Feynman lectures. Results have shown\nnot only an increased likelihood of recovering the original equation but also\nnotable robustness in the presence of noisy data.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Optimization and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Evolutionary symbolic regression approaches are powerful tools that can approximate an explicit mapping between input features and observation for various problems. However, ensuring that explored expressions maintain consistency with domain-specific constraints remains a crucial challenge. While neural networks are able to employ additional information like conservation laws to achieve more appropriate and robust approximations, the potential remains unrealized within genetic algorithms. This disparity is rooted in the inherent discrete randomness of recombining and mutating to generate new mapping expressions, making it challenging to maintain and preserve inferred constraints or restrictions in the course of the exploration. To address this limitation, we propose an approach centered on semantic backpropagation incorporated into the Gene Expression Programming (GEP), which integrates domain-specific properties in a vector representation as corrective feedback during the evolutionary process. By creating backward rules akin to algorithmic differentiation and leveraging pre-computed subsolutions, the mechanism allows the enforcement of any constraint within an expression tree by determining the misalignment and propagating desired changes back. To illustrate the effectiveness of constraining GEP through semantic backpropagation, we take the constraint of physical dimension as an example. This framework is applied to discovering physical equations from the Feynman lectures. Results have shown not only an increased likelihood of recovering the original equation but also notable robustness in the presence of noisy data.
通过语义反向传播限制遗传符号回归
进化符号回归方法是一种功能强大的工具,它可以为各种问题提供输入特征与观测结果之间的近似显式映射。然而,如何确保探索出的表达式与特定领域的约束条件保持一致,仍然是一个重要的挑战。虽然神经网络能够利用额外的信息(如守恒定律)来实现更合适、更稳健的近似,但遗传算法还没有发挥出这种潜力。这种差异源于重组和变异产生新映射表达式时固有的离散随机性,这使得在探索过程中保持和维护推断的约束或限制具有挑战性。为了解决这一限制,我们提出了一种以纳入基因表达编程(GEP)的语义反向传播为中心的方法,它将特定领域的属性整合到向量表示中,作为进化过程中的校正反馈。通过创建类似于算法差异化的后向规则并利用预先计算的子解决方案,该机制可以通过确定表达式树的对齐方式并将所需的变化传播回去,从而在表达式树中执行任何约束。为了说明通过语义反向传播约束 GEP 的效果,我们以物理维度约束为例。我们将这一框架应用于从费曼讲座中发现物理方程。结果表明,不仅恢复原始方程的可能性增加了,而且在有噪声数据的情况下也具有很强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信