ReLU神经网络作为分段线性代理模型的研究

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Amirhossein Hosseini , Martin Guay , Xiang Li
{"title":"ReLU神经网络作为分段线性代理模型的研究","authors":"Amirhossein Hosseini ,&nbsp;Martin Guay ,&nbsp;Xiang Li","doi":"10.1016/j.compchemeng.2025.109208","DOIUrl":null,"url":null,"abstract":"<div><div>Continuous piecewise linear (CPWL) surrogate models are increasingly used in process systems engineering to represent complex, nonlinear relationships. Neural networks with ReLU activation functions (ReLU-NN) have become a common method to represent CPWL models. However, the structure and behavior of the linear partitions formed by rectifier networks have not been fully examined. In this study, we propose exact mathematical expressions for linear functions and linear regions of small rectifier networks. Moreover, we analyze the performance of the rectifier networks from a polyhedral perspective and introduce the three major challenges associated with these models: redundancy, degeneracy, and low efficiency. Furthermore, we assess difference-of-convex continuous piecewise linear (DC-CPWL) function as an alternative representation of CPWL relationships and compare it to ReLU-based shallow and deep Neural Networks across four industrial case studies. Our findings demonstrate that the DC-CPWL representation consistently yields highly efficient models while the ReLU-NN representation generates less efficient ones.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"201 ","pages":"Article 109208"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On ReLU neural networks as piecewise linear surrogate models\",\"authors\":\"Amirhossein Hosseini ,&nbsp;Martin Guay ,&nbsp;Xiang Li\",\"doi\":\"10.1016/j.compchemeng.2025.109208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Continuous piecewise linear (CPWL) surrogate models are increasingly used in process systems engineering to represent complex, nonlinear relationships. Neural networks with ReLU activation functions (ReLU-NN) have become a common method to represent CPWL models. However, the structure and behavior of the linear partitions formed by rectifier networks have not been fully examined. In this study, we propose exact mathematical expressions for linear functions and linear regions of small rectifier networks. Moreover, we analyze the performance of the rectifier networks from a polyhedral perspective and introduce the three major challenges associated with these models: redundancy, degeneracy, and low efficiency. Furthermore, we assess difference-of-convex continuous piecewise linear (DC-CPWL) function as an alternative representation of CPWL relationships and compare it to ReLU-based shallow and deep Neural Networks across four industrial case studies. Our findings demonstrate that the DC-CPWL representation consistently yields highly efficient models while the ReLU-NN representation generates less efficient ones.</div></div>\",\"PeriodicalId\":286,\"journal\":{\"name\":\"Computers & Chemical Engineering\",\"volume\":\"201 \",\"pages\":\"Article 109208\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Chemical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0098135425002121\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425002121","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

连续分段线性(CPWL)代理模型在过程系统工程中越来越多地用于表示复杂的非线性关系。带有ReLU激活函数的神经网络(ReLU- nn)已经成为表示CPWL模型的常用方法。然而,由整流网络形成的线性分区的结构和行为尚未得到充分的研究。在本研究中,我们提出了小型整流网络线性函数和线性区域的精确数学表达式。此外,我们从多面体的角度分析了整流网络的性能,并介绍了与这些模型相关的三个主要挑战:冗余、退化和低效率。此外,我们评估了凸差分连续分段线性(DC-CPWL)函数作为CPWL关系的替代表示,并在四个工业案例研究中将其与基于relu的浅层和深层神经网络进行了比较。我们的研究结果表明,DC-CPWL表示始终生成高效的模型,而ReLU-NN表示生成效率较低的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On ReLU neural networks as piecewise linear surrogate models
Continuous piecewise linear (CPWL) surrogate models are increasingly used in process systems engineering to represent complex, nonlinear relationships. Neural networks with ReLU activation functions (ReLU-NN) have become a common method to represent CPWL models. However, the structure and behavior of the linear partitions formed by rectifier networks have not been fully examined. In this study, we propose exact mathematical expressions for linear functions and linear regions of small rectifier networks. Moreover, we analyze the performance of the rectifier networks from a polyhedral perspective and introduce the three major challenges associated with these models: redundancy, degeneracy, and low efficiency. Furthermore, we assess difference-of-convex continuous piecewise linear (DC-CPWL) function as an alternative representation of CPWL relationships and compare it to ReLU-based shallow and deep Neural Networks across four industrial case studies. Our findings demonstrate that the DC-CPWL representation consistently yields highly efficient models while the ReLU-NN representation generates less efficient ones.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信