{"title":"Automated physical model building from literature sources: Combining equations based on four predefined requirements","authors":"Shota Kato, Manabu Kano","doi":"10.1016/j.compchemeng.2025.109147","DOIUrl":null,"url":null,"abstract":"<div><div>Constructing physical models from equations extracted from scientific literature databases is a challenging task due to the presence of redundant, intermediate, or inconsistent equations. This study formalizes model building as the combination of equations to form desired models that satisfy criteria such as input–output completeness and consistency. To address this problem, we propose a refined gradual method, an efficient algorithm that iteratively refines candidate equation groups while ensuring perfect recall and avoiding unnecessary computations. Evaluation of the proposed method on eight case studies, including noisy datasets, complex systems, and diverse equation structures, demonstrated that the refined gradual method reduced computational time compared to existing methods and successfully constructed all desired models. The study also identifies limitations of the proposed method and suggests improvements to enhance efficiency and adaptability. By providing a general framework for solving equation combination problems, this study advances automated model-building techniques and offers a robust approach for handling complex and noisy datasets in scientific and engineering disciplines.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"200 ","pages":"Article 109147"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425001516","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Constructing physical models from equations extracted from scientific literature databases is a challenging task due to the presence of redundant, intermediate, or inconsistent equations. This study formalizes model building as the combination of equations to form desired models that satisfy criteria such as input–output completeness and consistency. To address this problem, we propose a refined gradual method, an efficient algorithm that iteratively refines candidate equation groups while ensuring perfect recall and avoiding unnecessary computations. Evaluation of the proposed method on eight case studies, including noisy datasets, complex systems, and diverse equation structures, demonstrated that the refined gradual method reduced computational time compared to existing methods and successfully constructed all desired models. The study also identifies limitations of the proposed method and suggests improvements to enhance efficiency and adaptability. By providing a general framework for solving equation combination problems, this study advances automated model-building techniques and offers a robust approach for handling complex and noisy datasets in scientific and engineering disciplines.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.