{"title":"On ReLU neural networks as piecewise linear surrogate models","authors":"Amirhossein Hosseini , Martin Guay , Xiang Li","doi":"10.1016/j.compchemeng.2025.109208","DOIUrl":null,"url":null,"abstract":"<div><div>Continuous piecewise linear (CPWL) surrogate models are increasingly used in process systems engineering to represent complex, nonlinear relationships. Neural networks with ReLU activation functions (ReLU-NN) have become a common method to represent CPWL models. However, the structure and behavior of the linear partitions formed by rectifier networks have not been fully examined. In this study, we propose exact mathematical expressions for linear functions and linear regions of small rectifier networks. Moreover, we analyze the performance of the rectifier networks from a polyhedral perspective and introduce the three major challenges associated with these models: redundancy, degeneracy, and low efficiency. Furthermore, we assess difference-of-convex continuous piecewise linear (DC-CPWL) function as an alternative representation of CPWL relationships and compare it to ReLU-based shallow and deep Neural Networks across four industrial case studies. Our findings demonstrate that the DC-CPWL representation consistently yields highly efficient models while the ReLU-NN representation generates less efficient ones.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"201 ","pages":"Article 109208"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425002121","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Continuous piecewise linear (CPWL) surrogate models are increasingly used in process systems engineering to represent complex, nonlinear relationships. Neural networks with ReLU activation functions (ReLU-NN) have become a common method to represent CPWL models. However, the structure and behavior of the linear partitions formed by rectifier networks have not been fully examined. In this study, we propose exact mathematical expressions for linear functions and linear regions of small rectifier networks. Moreover, we analyze the performance of the rectifier networks from a polyhedral perspective and introduce the three major challenges associated with these models: redundancy, degeneracy, and low efficiency. Furthermore, we assess difference-of-convex continuous piecewise linear (DC-CPWL) function as an alternative representation of CPWL relationships and compare it to ReLU-based shallow and deep Neural Networks across four industrial case studies. Our findings demonstrate that the DC-CPWL representation consistently yields highly efficient models while the ReLU-NN representation generates less efficient ones.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.