Joachim Schaeffer , Jinwook Rhyu , Robin Droop , Rolf Findeisen , Richard D. Braatz
{"title":"Interpretation of high-dimensional regression coefficients by comparison with linearized compressing features","authors":"Joachim Schaeffer , Jinwook Rhyu , Robin Droop , Rolf Findeisen , Richard D. Braatz","doi":"10.1016/j.compchemeng.2025.109099","DOIUrl":null,"url":null,"abstract":"<div><div>Linear regression is often deemed inherently interpretable; however, challenges arise for high-dimensional data. We focus on further understanding how linear regression approximates nonlinear responses from high-dimensional functional data. We develop a linearization method to derive feature coefficients, which we compare with the closest regression coefficients of the path of regression solutions. We showcase the methods on battery data case studies where a single nonlinear compressing feature, <span><math><mrow><mi>g</mi><mo>:</mo><msup><mrow><mi>R</mi></mrow><mrow><mi>p</mi></mrow></msup><mo>→</mo><mi>R</mi></mrow></math></span>, is used to construct a synthetic response, <span><math><mrow><mi>y</mi><mo>∈</mo><mi>R</mi></mrow></math></span>. This unifying view of linear regression and compressing features for high-dimensional functional data helps to understand (1) how regression coefficients are shaped in the highly regularized domain, (2) how regression coefficients relate to linearized feature coefficients, and (3) how the shape of regression coefficients changes as a function of regularization to approximate nonlinear responses by exploiting local structures.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"198 ","pages":"Article 109099"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425001036","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Linear regression is often deemed inherently interpretable; however, challenges arise for high-dimensional data. We focus on further understanding how linear regression approximates nonlinear responses from high-dimensional functional data. We develop a linearization method to derive feature coefficients, which we compare with the closest regression coefficients of the path of regression solutions. We showcase the methods on battery data case studies where a single nonlinear compressing feature, , is used to construct a synthetic response, . This unifying view of linear regression and compressing features for high-dimensional functional data helps to understand (1) how regression coefficients are shaped in the highly regularized domain, (2) how regression coefficients relate to linearized feature coefficients, and (3) how the shape of regression coefficients changes as a function of regularization to approximate nonlinear responses by exploiting local structures.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.