Jingyi Liu , Min Wu , Lina Yu , Weijun Li , Wenqiang Li , Yanjie Li , Meilan Hao , Yusong Deng , Shu Wei
{"title":"CaMo: Capturing the modularity by end-to-end models for Symbolic Regression","authors":"Jingyi Liu , Min Wu , Lina Yu , Weijun Li , Wenqiang Li , Yanjie Li , Meilan Hao , Yusong Deng , Shu Wei","doi":"10.1016/j.knosys.2024.112747","DOIUrl":null,"url":null,"abstract":"<div><div>Modularity is a ubiquitous principle that permeates various aspects of nature, society, and human endeavors, from biological systems to organizational structures and beyond. In the context of Symbolic Regression, which aims to find the explicit expressions from observed data, modularity could be viewed as a type of knowledge to capture the salient substructure to achieve higher fitting results. Symbolic Regression is essentially a composition optimization problem thus remaining valuable sub-structures can provide efficiency to the subsequent search. In this paper, we propose to acquire modularity in a search process and use the term <em>module</em> indicating the useful sub-structure. Specifically, the end-to-end model is chosen to incorporate the module into the search procedure for its scalability and generalization ability. Modules are considered high-order knowledge and act as fundamental operators, expanding the search library of Symbolic Regression. The proposed algorithm enables self-learning or self-evolution of modules as part of the learning component. Additionally, a module extraction strategy generates modules hierarchically from the expression tree, along with a module update mechanism designed to eliminate unnecessary modules while incorporating new useful ones effectively. Experiments were conducted to evaluate the effectiveness of each component.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112747"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124013819","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Modularity is a ubiquitous principle that permeates various aspects of nature, society, and human endeavors, from biological systems to organizational structures and beyond. In the context of Symbolic Regression, which aims to find the explicit expressions from observed data, modularity could be viewed as a type of knowledge to capture the salient substructure to achieve higher fitting results. Symbolic Regression is essentially a composition optimization problem thus remaining valuable sub-structures can provide efficiency to the subsequent search. In this paper, we propose to acquire modularity in a search process and use the term module indicating the useful sub-structure. Specifically, the end-to-end model is chosen to incorporate the module into the search procedure for its scalability and generalization ability. Modules are considered high-order knowledge and act as fundamental operators, expanding the search library of Symbolic Regression. The proposed algorithm enables self-learning or self-evolution of modules as part of the learning component. Additionally, a module extraction strategy generates modules hierarchically from the expression tree, along with a module update mechanism designed to eliminate unnecessary modules while incorporating new useful ones effectively. Experiments were conducted to evaluate the effectiveness of each component.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.